Help Center/
Well-Architected Framework/
Well-Architected Framework and Practices/
Resilience Pillar/
Cloud Service Reliability/
DCS/
Common Faults
Updated on 2025-05-22 GMT+08:00
Common Faults
Excessive CPU, Memory, Bandwidth, or Connection Usage of a DCS Instance
- Check: Use Cloud Eye to check the CPU, memory, bandwidth, or connection usage.
- Recovery:
- Change the specifications to expand resources based on service requirements.
- Enable overload protection for the application layer to keep high-priority services running smoothly. Switch some services that do not require high performance back to the original data source.
Failed to Connect to a Backend DCS Instance
- Check: Network connection failed.
- Recovery:
- If this is a temporary failure, for example, a master/standby switchover is in progress, connect to the DCS instance again at the application layer. For details, see RES09 Retries After Failures.
- If the connection to an overloaded DCS instance failed, rectify the fault by referring to "Excessive CPU, Memory, Bandwidth, or Connection Usage of a DCS Instance."
- For non-temporary failures, the data source at the application layer needs to be switched back to the original one for processing. This prevents service interruption due to cache faults.
Occasionally Failed to Read Data from or Write Data to a DCS Instance
- Check: Data failed to be read or written. Occasional timeout errors are normal in Redis because of network connectivity and client timeout configurations.
- Recovery:
- If this is a temporary failure, for example, the DCS instance is undergoing a master/standby switchover, retry at the application layer. For details, see RES09 Retries After Failures.
- If the connection to an overloaded DCS instance failed, rectify the fault by referring to "Excessive CPU, Memory, Bandwidth, or Connection Usage of a DCS Instance."
- For non-temporary failures, the data source at the application layer needs to be switched back to the original one for processing. This prevents service interruption due to cache faults.
Parent topic: DCS
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
The system is busy. Please try again later.
For any further questions, feel free to contact us through the chatbot.
Chatbot