Why Is the GPU Driver Abnormal?
Symptom
When you run the following command on a GPU-accelerated ECS to view the CPU usage, the system displays a message indicating that the specified program cannot be executed or the file path does not exist.
nvidia-smi
Information similar to the following is displayed:
-bash: /bin/nvidia-smi: No such file or directory
or
nvidia-smi: command not found
Possible Causes
The ECS driver is abnormal, not installed, or uninstalled.
Solution
- If the GPU driver is not installed, install it.
For details, see Installing a Driver and Toolkit.
- If the driver has been uninstalled:
Run the history command to check whether an uninstallation has been performed.
Go to the /var/log directory and check whether the nvidia-uninstall.log file exists. If the log exists, the GPU driver has been uninstalled. Reinstall the GPU driver.
- If the driver has been installed but the driver status is abnormal:
- Uninstall the driver.
- Method 1: Run the nvidia-uninstall command to uninstall the driver.
If the system displays a message indicating that the command does not exist, use method 2.
- Method 2: Run the whereis nvidia command to query the version of the driver installed on the ECS.
Figure 1 Installed driver version
Download the driver package of the same version as the obtained one from the NVIDIA official website. (This driver package is required when you uninstall and reinstall the driver.)
For example, if the driver version is nvidia-396.44, run the sh NVIDIA-Linux-x86_64-396.44.run --uninstall command to uninstall the driver.
- Method 1: Run the nvidia-uninstall command to uninstall the driver.
- Reinstall the driver.
For details, see Installing a Driver and Toolkit.
- Uninstall the driver.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot