Updated on 2025-07-30 GMT+08:00

How Do I Handle the infoROM Error?

Symptom

The error message "WARNING:infoROM is corrupted at gpu 0000:00:0D.0" is displayed when the nvidia-smi command is executed on a Linux ECS, and the services have been affected.

Possible Causes

The health check failed so the GPU driver would not use or trust its content (some content is not used).

Impact

The ECC-related non-volatile data records may be affected. As a result, the GPU memory pages would have been retired are still being used.

Solution

  1. If services are not affected, no operation is required.
  2. If services are affected, stop the services, migrate ECSs, collect fault information by referring to Fault Information Collection, and contact technical support.