Environment Constraints for GPU Monitoring
- Only Linux OSs are supported, and only some Linux public image versions support GPU monitoring. For details, see What OSs Does the Agent Support?
- Supported flavors: G6v, G6, P2s, P2v, P2vs, G5, Pi2, Pi1, ECSs of P1 series, the BMSs of the P, Pi, G, and KP series.
- You have installed the Agent of the enhanced edition. For details, see Installing the Agent. Table 1 describes the differences between Agents of the basic edition and enhanced edition.
Table 1 Basic edition and enhanced edition of the Agent Edition
Description
Basic
Provides basic OS monitoring metrics, such as CPU, memory, disk, and NIC metrics, helping you improve system performance.
Generally, the version number consists of three digits, for example, 2.7.5.
Enhanced
Provides GPU, NPU, and BMS hardware monitoring, in addition to the capabilities provided in the basic edition.
Generally, the version number consists of four digits, for example, 2.7.5.1.
CAUTION:Install the Agent of the enhanced edition if you indeed need it because it collects more metrics, which may occupy more server resources.
- GPU metric collection depends on the following driver files. Check whether there are corresponding driver files in the environment.
- Linux driver file
nvmlUbuntuNvidiaLibraryPath = "/usr/lib/x86_64-linux-gnu/libnvidia-ml.so.1" nvmlCentosNvidiaLibraryPath = "/usr/lib64/libnvidia-ml.so.1" nvmlCceNvidiaLibraryPath = "/opt/cloud/cce/nvidia/lib64/libnvidia-ml.so.1"
- Windows driver file
DefaultNvmlDLLPath = "C:\\Program Files\\NVIDIA Corporation\\NVSMI\\nvml.dll" WHQLNvmlDLLPath = "C:\\Windows\\System32\\nvml.dll"
- Linux driver file
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot