What Metrics Are Supported by the Agent?
OS metric: CPU
Metric |
Name |
Description |
Value Range |
Unit |
Conversion Rule |
Supported Version |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
cpu_usage |
(Agent) CPU Usage |
Used to monitor CPU usage
|
0-100 |
% |
N/A |
2.4.1 |
1 minute |
cpu_usage_idle |
(Agent) Idle CPU Usage |
Percentage of the time that CPU is idle Unit: Percent
|
0-100 |
% |
N/A |
2.4.5 |
1 minute |
cpu_usage_other |
(Agent) Other Process CPU Usage |
Other CPU usage of the monitored object
|
0-100 |
% |
N/A |
2.4.5 |
1 minute |
cpu_usage_system |
(Agent) Kernel Space CPU Usage |
Percentage of time that the CPU is used by kernel space
|
0-100 |
% |
N/A |
2.4.5 |
1 minute |
cpu_usage_user |
(Agent) User Space CPU Usage |
Percentage of time that the CPU is used by user space
|
0-100 |
% |
N/A |
2.4.5 |
1 minute |
cpu_usage_nice |
(Agent) Nice Process CPU Usage |
Percentage of the time that the CPU is in user mode with low-priority processes which can easily be interrupted by higher-priority processes
|
0-100 |
% |
N/A |
2.4.5 |
1 minute |
cpu_usage_iowait |
(Agent) iowait Process CPU Usage |
Percentage of time that the CPU is waiting for I/O operations to complete
|
0-100 |
% |
N/A |
2.4.5 |
1 minute |
cpu_usage_irq |
(Agent) CPU Interrupt Time |
Percentage of time that the CPU is servicing interrupts
|
0-100 |
% |
N/A |
2.4.5 |
1 minute |
cpu_usage_softirq |
(Agent) CPU Software Interrupt Time |
Percentage of time that the CPU is servicing software interrupts
|
0-100 |
% |
N/A |
2.4.5 |
1 minute |
OS Metric: CPU Load
Metric |
Name |
Description |
Value Range |
Unit |
Conversion Rule |
Supported Version |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
load_average1 |
(Agent) 1-Minute Load Average |
CPU load averaged from the last 1 minute
|
≥0 |
None |
N/A |
2.4.1 |
1 minute |
load_average5 |
(Agent) 5-Minute Load Average |
CPU load averaged from the last 5 minutes
|
≥0 |
None |
N/A |
2.4.1 |
1 minute |
load_average15 |
(Agent) 15-Minute Load Average |
CPU load averaged from the last 15 minutes
|
≥0 |
None |
N/A |
2.4.1 |
1 minute |
OS Metric: Memory
Metric |
Name |
Description |
Value Range |
Unit |
Conversion Rule |
Supported Version |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
mem_available |
(Agent) Available Memory |
Amount of memory that is available and can be given instantly to processes
|
≥0 |
GB |
N/A |
2.4.5 |
1 minute |
mem_usedPercent |
(Agent) Memory Usage |
Memory usage of the instance
|
0-100 |
% |
N/A |
2.4.1 |
1 minute |
mem_free |
(Agent) Idle Memory |
Amount of memory that is not being used
|
≥0 |
GB |
N/A |
2.4.5 |
1 minute |
mem_buffers |
(Agent) Buffer |
Amount of memory that is being used for buffers
|
≥0 |
GB |
N/A |
2.4.5 |
1 minute |
mem_cached |
(Agent) Cache |
Amount of memory that is being used for file caches
|
≥0 |
GB |
N/A |
2.4.5 |
1 minute |
total_open_files |
(Agent) Total File Handles |
Total handles used by all processes
|
≥0 |
None |
N/A |
2.4.5 |
1 minute |
OS Metric: Disk

Currently, CES Agent can collect only physical disk metrics and does not support disks mounted using the network file system protocol.
By default, CES Agent will not monitor Docker-related mount points. The prefix of the mount point is as follows:
/var/lib/docker;/mnt/paas/kubernetes;/var/lib/mesos
Metric |
Name |
Description |
Value Range |
Unit |
Conversion Rule |
Supported Version |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
disk_free |
(Agent) Available Disk Space |
Free space on the disks
|
≥0 |
GB |
N/A |
2.4.1 |
1 minute |
disk_total |
(Agent) Disk Storage Capacity |
Total disk capacity
|
≥0 |
GB |
N/A |
2.4.5 |
1 minute |
disk_used |
(Agent) Used Disk Space |
Disk's used space
|
≥0 |
GB |
N/A |
2.4.5 |
1 minute |
disk_usedPercent |
(Agent) Disk Usage |
Percentage of used disk space. It is calculated as follows: Disk Usage = Used Disk Space/Disk Storage Capacity.
|
0-100 |
% |
N/A |
2.4.1 |
1 minute |
OS Metric: Disk I/O
OS Metric: File System
Metric |
Name |
Description |
Value Range |
Unit |
Conversion Rule |
Supported Version |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
disk_fs_rwstate |
(Agent) File System Read/Write Status |
Read and write status of the mounted file system of the monitored object Possible statuses are 0 (read and write) and 1 (read only).
|
|
None |
N/A |
2.4.5 |
1 minute |
disk_inodesTotal |
(Agent) Disk inode Total |
Total number of index nodes on the disk
|
≥ 0 |
None |
N/A |
2.4.5 |
1 minute |
disk_inodesUsed |
(Agent) Total inode Used |
Number of used index nodes on the disk
|
≥ 0 |
None |
N/A |
2.4.5 |
1 minute |
disk_inodesUsedPercent |
(Agent) Percentage of Total inode Used |
Number of used index nodes on the disk
|
0-100 |
% |
N/A |
2.4.1 |
1 minute |
OS Metric: TCP
Metric |
Metric |
Description |
Value Range |
Unit |
Conversion Rule |
Supported Version |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
net_tcp_total |
(Agent) Total Number of TCP Connections |
Total number of TCP connections
|
≥ 0 |
count |
N/A |
2.4.1 |
1 minute |
net_tcp_established |
(Agent) Number of connections in the ESTABLISHED state |
Number of TCP connections in the ESTABLISHED state
|
≥ 0 |
count |
N/A |
2.4.1 |
1 minute |
net_tcp_sys_sent |
(Agent) Number of connections in the TCP SYS_SENT state. |
Number of TCP connections that are being requested by the client
|
≥ 0 |
count |
N/A |
2.4.5 |
1 minute |
net_tcp_sys_recv |
(Agent) Number of connections in the TCP SYS_RECV state. |
Number of pending TCP connections received by the server
|
≥ 0 |
count |
N/A |
2.4.5 |
1 minute |
net_tcp_fin_wait1 |
(Agent) Number of TCP connections in the FIN_WAIT1 state. |
Number of TCP connections waiting for ACK packets when the connections are being actively closed by the client
|
≥ 0 |
count |
N/A |
2.4.5 |
1 minute |
net_tcp_fin_wait2 |
(Agent) Number of TCP connections in the FIN_WAIT2 state. |
Number of TCP connections in the FIN_WAIT2 state
|
≥ 0 |
count |
N/A |
2.4.5 |
1 minute |
net_tcp_time_wait |
(Agent) Number of TCP connections in the TIME_WAIT state. |
Number of TCP connections in the TIME_WAIT state
|
≥ 0 |
count |
N/A |
2.4.5 |
1 minute |
net_tcp_close |
(Agent) Number of TCP connections in the CLOSE state. |
Number of closed TCP connections
|
≥ 0 |
count |
N/A |
2.4.5 |
1 minute |
net_tcp_close_wait |
(Agent) Number of TCP connections in the CLOSE_WAIT state. |
Number of TCP connections in the CLOSE_WAIT state
|
≥ 0 |
count |
N/A |
2.4.5 |
1 minute |
net_tcp_last_ack |
(Agent) Number of TCP connections in the LAST_ACK state. |
Number of TCP connections waiting for ACK packets when the connections are being passively closed by the client
|
≥ 0 |
count |
N/A |
2.4.5 |
1 minute |
net_tcp_listen |
(Agent) Number of TCP connections in the LISTEN state. |
Number of TCP connections in the LISTEN state
|
≥ 0 |
count |
N/A |
2.4.5 |
1 minute |
net_tcp_closing |
(Agent) Number of TCP connections in the CLOSING state. |
Number of TCP connections to be automatically closed by the server and the client at the same time
|
≥ 0 |
count |
N/A |
2.4.5 |
1 minute |
net_tcp_retrans |
(Agent) TCP Retransmission Rate |
Percentage of packets that are resent
|
0-100 |
% |
N/A |
2.4.5 |
1 minute |
OS Metric: NIC
Metric |
Name |
Description |
Value Range |
Unit |
Conversion Rule |
Supported Version |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
net_bitRecv |
(Agent) Outbound Bandwidth |
Number of bits sent by this NIC per second
|
≥ 0 |
bit/s |
1024(IEC) |
2.4.1 |
1 minute |
net_bitSent |
(Agent) Inbound Bandwidth |
Number of bits received by this NIC per second
|
≥ 0 |
bit/s |
1024(IEC) |
2.4.1 |
1 minute |
net_packetRecv |
(Agent) NIC Packet Receive Rate |
Number of packets received by this NIC per second
|
≥ 0 |
Count/s |
N/A |
2.4.1 |
1 minute |
net_packetSent |
(Agent) NIC Packet Send Rate |
Number of packets sent by this NIC per second
|
≥ 0 |
Count/s |
N/A |
2.4.1 |
1 minute |
net_errin |
(Agent) Receive Error Rate |
Percentage of receive errors detected by this NIC per second
|
0-100 |
% |
N/A |
2.4.5 |
1 minute |
net_errout |
(Agent) Transmit Error Rate |
Percentage of transmit errors detected by this NIC per second
|
0-100 |
% |
N/A |
2.4.5 |
1 minute |
net_dropin |
(Agent) Received Packet Drop Rate |
Percentage of packets received by this NIC which were dropped per second
|
0-100 |
% |
N/A |
2.4.5 |
1 minute |
net_dropout |
(Agent) Transmitted Packet Drop Rate |
Percentage of packets transmitted by this NIC which were dropped per second
|
0-100 |
% |
N/A |
2.4.5 |
1 minute |
Process Monitoring Metrics
Metric |
Name |
Description |
Value Range |
Unit |
Conversion Rule |
Supported Version |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
proc_pHashId_cpu |
(Agent) CPU Usage |
CPU consumed by a process. pHashId (process name and process ID) is the value of md5.
|
0–1 x Number of vCPUs |
% |
N/A |
2.4.1 |
1 minute |
proc_pHashId_mem |
(Agent) Memory Usage |
Memory consumed by a process. pHashId (process name and process ID) is the value of md5.
|
0-100 |
% |
N/A |
2.4.1 |
1 minute |
proc_pHashId_file |
(Agent) Number of opened files |
Number of files opened by a process. pHashId (process name and process ID) is the value of md5.
|
≥0 |
Count |
N/A |
2.4.1 |
1 minute |
proc_running_count |
(Agent) Number of running processes |
Number of processes that are running
|
≥0 |
None |
N/A |
2.4.1 |
1 minute |
proc_idle_count |
(Agent) Idle Processes |
Number of processes that are idle
|
≥0 |
None |
N/A |
2.4.1 |
1 minute |
proc_zombie_count |
(Agent) Zombie Processes |
Number of zombie processes
|
≥0 |
None |
N/A |
2.4.1 |
1 minute |
proc_blocked_count |
(Agent) Blocked Processes |
Number of processes that are blocked
|
≥0 |
None |
N/A |
2.4.1 |
1 minute |
proc_sleeping_count |
(Agent) Sleeping Processes |
Number of processes that are sleeping
|
≥0 |
None |
N/A |
2.4.1 |
1 minute |
proc_total_count |
(Agent) Total Processes |
Total number of processes on the monitored object
|
≥0 |
None |
N/A |
2.4.1 |
1 minute |
proc_specified_count |
(Agent) Specified Processes |
Number of specified processes
|
≥0 |
None |
N/A |
2.4.1 |
1 minute |
GPU Specifications

If a GPU server has eight GPU cards and the PM mode is disabled, data may fail to be collected. You can enable the PM mode and restart the monitoring process.
Category |
Metric Name |
Description |
Value Range |
Unit |
Conversion Rule |
Supported Version |
Collection Interval |
---|---|---|---|---|---|---|---|
GPU Specifications |
gpu_status |
GPU health status of the VM. This metric is a composite metric.
|
|
None |
N/A |
2.4.5 |
1 minute |
gpu_performance_state |
Performance status of the GPU
|
|
None |
N/A |
2.4.1 |
1 minute |
|
gpu_power_draw |
Power of the GPU.
|
≥ 0 |
W |
N/A |
2.4.5 |
1 minute |
|
gpu_temperature |
Temperature of the GPU.
|
≥ 0 |
°C |
N/A |
2.4.5 |
1 minute |
|
gpu_usage_gpu |
GPU computing power usage.
|
0-100 |
% |
N/A |
2.4.1 |
1 minute |
|
gpu_usage_mem |
GPU memory usage.
|
0-100 |
% |
N/A |
2.4.1 |
1 minute |
|
gpu_used_mem |
GPU memory usage.
|
≥ 0 |
MB |
N/A |
2.4.5 |
1 minute |
|
gpu_free_mem |
Remaining GPU memory.
|
≥ 0 |
MB |
N/A |
2.4.5 |
1 minute |
|
gpu_usage_encoder |
GPU encoder usage.
|
0-100 |
% |
N/A |
2.4.5 |
1 minute |
|
gpu_usage_decoder |
GPU decoder usage.
|
0-100 |
% |
N/A |
2.4.5 |
1 minute |
|
gpu_graphics_clocks |
GPU graphics (shader) clock frequency.
|
≥ 0 |
MHz |
N/A |
2.4.5 |
1 minute |
|
gpu_sm_clocks |
Streaming processor clock frequency of the GPU.
|
≥ 0 |
MHz |
N/A |
2.4.5 |
1 minute |
|
gpu_mem_clock |
Memory clock frequency of the GPU.
|
≥ 0 |
MHz |
N/A |
2.4.5 |
1 minute |
|
gpu_video_clocks |
Video (including codec) clock frequency of the GPU.
|
≥ 0 |
MHz |
N/A |
2.4.5 |
1 minute |
|
gpu_tx_throughput_pci |
Outbound bandwidth of the GPU.
|
≥ 0 |
MByte/s |
N/A |
2.4.5 |
1 minute |
|
gpu_rx_throughput_pci |
Inbound bandwidth of the GPU.
|
≥ 0 |
MByte/s |
N/A |
2.4.5 |
1 minute |
|
gpu_volatile_correctable |
Number of correctable ECC errors since the GPU is reset. The value is reset to 0 each time the GPU is reset.
|
≥ 0 |
count |
N/A |
2.4.5 |
1 minute |
|
gpu_volatile_uncorrectable |
Number of uncorrectable ECC errors since the GPU is reset. The value is reset to 0 each time the GPU is reset.
|
≥ 0 |
count |
N/A |
2.4.5 |
1 minute |
|
gpu_aggregate_correctable |
Number of correctable ECC errors on the GPU.
|
≥ 0 |
count |
N/A |
2.4.5 |
1 minute |
|
gpu_aggregate_uncorrectable |
Number of uncorrectable ECC Errors on the GPU.
|
≥ 0 |
count |
N/A |
2.4.5 |
1 minute |
|
gpu_retired_page_single_bit |
Number of retired page single bit errors, which indicates the number of single-bit pages isolated by the GPU.
|
≥ 0 |
count |
N/A |
2.4.5 |
1 minute |
|
gpu_retired_page_double_bit |
Number of retired page double bit errors, which indicates the number of double-bit pages isolated by the GPU.
|
≥ 0 |
count |
N/A |
2.4.5 |
1 minute |
|
gpu_lnkcap_speed |
Maximum speed supported by the PCIe link of the GPU.
|
≥ 0 |
GT/s |
N/A |
2.6.7 |
1 minute |
|
gpu_lnkcap_width |
Link width of the PCIe link.
|
≥ 0 |
count |
N/A |
2.6.7 |
1 minute |
|
gpu_lnksta_speed |
PCIe connection speed of the GPU.
|
≥ 0 |
GT/s |
N/A |
2.6.7 |
1 minute |
|
gpu_lnksta_width |
PCIe link width of the GPU.
|
≥ 0 |
count |
N/A |
2.6.7 |
1 minute |
|
gpu_nvlink_number |
Number of NVLink links of the GPU.
|
≥ 0 |
count |
N/A |
2.6.7 |
1 minute |
|
gpu_nvlink_bandwidth |
NVLink link width of the GPU.
|
≥ 0 |
GB/s |
N/A |
2.6.7 |
1 minute |
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.