What Can I Do If an Xid Error Is Displayed in the Message Log When a GPU-accelerated ECS Is Faulty?
Possible Causes
XID |
Description |
---|---|
32 |
Invalid or corrupted push buffer stream |
74 |
NVLINK Error, which indicates that the GPU hardware is faulty and needs to be brought offline for repair. |
79 |
GPU has fallen off the bus, which indicates that the bus is disconnected and needs to be brought offline for repair. |
For details, see https://docs.nvidia.com/deploy/xid-errors/index.html.
Solution
- Run the dmesg | grep –i xid command to check whether there are Xid errors.
- Stop the services, perform service migration, collect fault information by referring to Fault Information Collection, and contact technical support.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot