How Do I Handle the infoROM Error?
Symptom
The error message "WARNING:infoROM is corrupted at gpu 0000:00:0D.0" is displayed when the nvidia-smi command is executed on a Linux ECS, and the services have been affected.
Possible Causes
The health check failed so the GPU driver would not use or trust its content (some content is not used).
Impact
The ECC-related non-volatile data records may be affected. As a result, the GPU memory pages would have been retired are still being used.
Solution
- If services are not affected, no operation is required.
- If services are affected, stop the services, migrate ECSs, collect fault information by referring to Fault Information Collection, and contact technical support.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot