What Metrics Are Supported by the Agent?
OS metric: CPU
Metric | Name | Description | Value Range | Unit | Conversion Rule | Earliest Agent Version Required | Monitoring Period (Raw Data) |
|---|---|---|---|---|---|---|---|
cpu_usage | (Agent) CPU Usage | Used to monitor CPU usage
| 0-100 | % | N/A | 2.4.1 | 1 minute |
cpu_usage_idle | (Agent) Idle CPU Usage | Percentage of the time that CPU is idle Unit: Percent
| 0-100 | % | N/A | 2.4.5 | 1 minute |
cpu_usage_other | (Agent) Other Process CPU Usage | Other CPU usage of the monitored object
| 0-100 | % | N/A | 2.4.5 | 1 minute |
cpu_usage_system | (Agent) Kernel Space CPU Usage | Percentage of time that the CPU is used by kernel space
| 0-100 | % | N/A | 2.4.5 | 1 minute |
cpu_usage_user | (Agent) User Space CPU Usage | Percentage of time that the CPU is used by user space
| 0-100 | % | N/A | 2.4.5 | 1 minute |
cpu_usage_nice | (Agent) Nice Process CPU Usage | Percentage of the time that the CPU is in user mode with low-priority processes which can easily be interrupted by higher-priority processes
| 0-100 | % | N/A | 2.4.5 | 1 minute |
cpu_usage_iowait | (Agent) iowait Process CPU Usage | Percentage of time that the CPU is waiting for I/O operations to complete
| 0-100 | % | N/A | 2.4.5 | 1 minute |
cpu_usage_irq | (Agent) CPU Interrupt Time | Percentage of time that the CPU is servicing interrupts
| 0-100 | % | N/A | 2.4.5 | 1 minute |
cpu_usage_softirq | (Agent) CPU Software Interrupt Time | Percentage of time that the CPU is servicing software interrupts
| 0-100 | % | N/A | 2.4.5 | 1 minute |
OS Metric: CPU Load
Metric | Name | Description | Value Range | Unit | Conversion Rule | Earliest Agent Version Required | Monitoring Period (Raw Data) |
|---|---|---|---|---|---|---|---|
load_average1 | (Agent) 1-Minute Load Average | CPU load averaged from the last 1 minute
| ≥ 0 | None | N/A | 2.4.1 | 1 minute |
load_average5 | (Agent) 5-Minute Load Average | CPU load averaged from the last 5 minutes
| ≥ 0 | None | N/A | 2.4.1 | 1 minute |
load_average15 | (Agent) 15-Minute Load Average | CPU load averaged from the last 15 minutes
| ≥ 0 | None | N/A | 2.4.1 | 1 minute |
OS Metric: Memory
Metric | Name | Description | Value Range | Unit | Conversion Rule | Earliest Agent Version Required | Monitoring Period (Raw Data) |
|---|---|---|---|---|---|---|---|
mem_available | (Agent) Available Memory | Amount of memory that is available and can be given instantly to processes
| ≥ 0 | GB | N/A | 2.4.5 | 1 minute |
mem_usedPercent | (Agent) Memory Usage | Memory usage of the instance
| 0-100 | % | N/A | 2.4.1 | 1 minute |
mem_free | (Agent) Idle Memory | Amount of memory that is not being used
| ≥ 0 | GB | N/A | 2.4.5 | 1 minute |
mem_buffers | (Agent) Buffer | Amount of memory that is being used for buffers
| ≥ 0 | GB | N/A | 2.4.5 | 1 minute |
mem_cached | (Agent) Cache | Amount of memory that is being used for file caches
| ≥ 0 | GB | N/A | 2.4.5 | 1 minute |
total_open_files | (Agent) Total File Handles | Total handles used by all processes
| ≥ 0 | Count | N/A | 2.4.5 | 1 minute |
OS Metric: Disk

Currently, CES Agent can collect only physical disk metrics and does not support disks mounted using the network file system protocol.
By default, CES Agent will not monitor Docker-related mount points. The prefix of the mount point is as follows:
/var/lib/docker;/mnt/paas/kubernetes;/var/lib/mesos
Metric | Name | Description | Value Range | Unit | Conversion Rule | Earliest Agent Version Required | Monitoring Period (Raw Data) |
|---|---|---|---|---|---|---|---|
disk_free | (Agent) Available Disk Space | Free space on the disks
| ≥ 0 | GB | N/A | 2.4.1 | 1 minute |
disk_total | (Agent) Disk Storage Capacity | Total disk capacity
| ≥ 0 | GB | N/A | 2.4.5 | 1 minute |
disk_used | (Agent) Used Disk Space | Disk's used space
| ≥ 0 | GB | N/A | 2.4.5 | 1 minute |
disk_usedPercent | (Agent) Disk Usage | Percentage of used disk space. It is calculated as follows: Disk Usage =Used Disk Space/Disk Storage Capacity.
| 0-100 | % | N/A | 2.4.1 | 1 minute |
disk_rwstate | (Agent) Disk Read/Write Status | Read and write status of the disk attached to the monitored object. The status can be 0 (read and write) or 1 (read-only).
|
| None | N/A | 2.5.6 | 1 minute |
OS Metric: Disk I/O
OS Metric: File System
Metric | Name | Description | Value Range | Unit | Conversion Rule | Earliest Agent Version Required | Monitoring Period (Raw Data) |
|---|---|---|---|---|---|---|---|
disk_fs_rwstate | (Agent) File System Read/Write Status | Read and write status of the mounted file system of the monitored object The status can be 0 (read and write) or 1 (read-only).
|
| None | N/A | 2.4.5 | 1 minute |
disk_inodesTotal | (Agent) Disk inode Total | Total number of index nodes on the disk
| ≥ 0 | None | N/A | 2.4.5 | 1 minute |
disk_inodesUsed | (Agent) Total inode Used | Number of used index nodes on the disk
| ≥ 0 | None | N/A | 2.4.5 | 1 minute |
disk_inodesUsedPercent | (Agent) Percentage of Total inode Used | Number of used index nodes on the disk
| 0-100 | % | N/A | 2.4.1 | 1 minute |
OS Metric: NTP
Metric | Name | Description | Value Range | Unit | Conversion Rule | Earliest Agent Version Required | Monitoring Period (Raw Data) |
|---|---|---|---|---|---|---|---|
ntp_offset | (Agent) NTP Offset | NTP offset of the monitored object
| ≥ 0 | ms | N/A | 2.7.1 | 1 minute |
OS Metric: TCP Connections
By default, two basic metrics related to TCP connections are collected: (Agent) TCP TOTAL and (Agent) TCP ESTABLISHED.
Metric | Name | Description | Value Range | Unit | Conversion Rule | Earliest Agent Version Required | Monitoring Period (Raw Data) |
|---|---|---|---|---|---|---|---|
net_tcp_total | (Agent) Total Number of TCP Connections | Total number of TCP connections
| ≥ 0 | count | N/A | 2.4.1 | 1 minute |
net_tcp_established | (Agent) Number of connections in the ESTABLISHED state | Number of TCP connections in the ESTABLISHED state
| ≥ 0 | count | N/A | 2.4.1 | 1 minute |
net_tcp_sys_sent | (Agent) Number of connections in the TCP SYS_SENT state. | Number of TCP connections that are being requested by the client
| ≥ 0 | count | N/A | 2.4.5 | 1 minute |
net_tcp_sys_recv | (Agent) Number of connections in the TCP SYS_RECV state. | Number of pending TCP connections received by the server
| ≥ 0 | count | N/A | 2.4.5 | 1 minute |
net_tcp_fin_wait1 | (Agent) Number of TCP connections in the FIN_WAIT1 state. | Number of TCP connections waiting for ACK packets when the connections are being actively closed by the client
| ≥ 0 | count | N/A | 2.4.5 | 1 minute |
net_tcp_fin_wait2 | (Agent) Number of TCP connections in the FIN_WAIT2 state. | Number of TCP connections in the FIN_WAIT2 state
| ≥ 0 | count | N/A | 2.4.5 | 1 minute |
net_tcp_time_wait | (Agent) TCP TIME_WAIT Connections | Number of TCP connections in the TIME_WAIT state
| ≥ 0 | count | N/A | 2.4.5 | 1 minute |
net_tcp_close | (Agent) Number of TCP connections in the CLOSE state. | Number of closed TCP connections
| ≥ 0 | count | N/A | 2.4.5 | 1 minute |
net_tcp_close_wait | (Agent) TCP CLOSE_WAIT Connections | Number of TCP connections in the CLOSE_WAIT state
| ≥ 0 | count | N/A | 2.4.5 | 1 minute |
net_tcp_last_ack | (Agent) Number of TCP connections in the LAST_ACK state. | Number of TCP connections waiting for ACK packets when the connections are being passively closed by the client
| ≥ 0 | count | N/A | 2.4.5 | 1 minute |
net_tcp_listen | (Agent) Number of TCP connections in the LISTEN state. | Number of TCP connections in the LISTEN state
| ≥ 0 | count | N/A | 2.4.5 | 1 minute |
net_tcp_closing | (Agent) Number of TCP connections in the CLOSING state. | Number of TCP connections to be automatically closed by the server and the client at the same time
| ≥ 0 | count | N/A | 2.4.5 | 1 minute |
net_tcp_retrans | (Agent) TCP Retransmission Rate | Percentage of packets that are resent
| 0-100 | % | N/A | 2.4.5 | 1 minute |
OS Metric: NIC
Metric | Name | Description | Value Range | Unit | Conversion Rule | Earliest Agent Version Required | Monitoring Period (Raw Data) |
|---|---|---|---|---|---|---|---|
net_bitRecv | (Agent) Outbound Bandwidth | Number of bits sent by this NIC per second
| ≥ 0 | bit/s | 1024(IEC) | 2.4.1 | 1 minute |
net_bitSent | (Agent) Inbound Bandwidth | Number of bits received by this NIC per second
| ≥ 0 | bit/s | 1024(IEC) | 2.4.1 | 1 minute |
net_packetRecv | (Agent) NIC Packet Receive Rate | Number of packets received by this NIC per second
| ≥ 0 | Count/s | N/A | 2.4.1 | 1 minute |
net_packetSent | (Agent) NIC Packet Send Rate | Number of packets sent by this NIC per second
| ≥ 0 | Count/s | N/A | 2.4.1 | 1 minute |
net_errin | (Agent) Receive Error Rate | Percentage of receive errors detected by this NIC per second
| 0-100 | % | N/A | 2.4.5 | 1 minute |
net_errout | (Agent) Transmit Error Rate | Percentage of transmit errors detected by this NIC per second
| 0-100 | % | N/A | 2.4.5 | 1 minute |
net_dropin | (Agent) Received Packet Drop Rate | Percentage of packets received by this NIC which were dropped per second
| 0-100 | % | N/A | 2.4.5 | 1 minute |
net_dropout | (Agent) Transmitted Packet Drop Rate | Percentage of packets transmitted by this NIC which were dropped per second
| 0-100 | % | N/A | 2.4.5 | 1 minute |
Process Monitoring Metrics
Metric | Name | Description | Value Range | Unit | Conversion Rule | Earliest Agent Version Required | Monitoring Period (Raw Data) |
|---|---|---|---|---|---|---|---|
proc_pHashId_cpu | (Agent) CPU Usage | CPU consumed by a process. pHashId (process name and process ID) is the value of md5.
| 0–1 x Number of vCPUs | % | N/A | 2.4.1 | 1 minute |
proc_pHashId_mem | (Agent) Memory Usage | Memory consumed by a process. pHashId (process name and process ID) is the value of md5.
| 0-100 | % | N/A | 2.4.1 | 1 minute |
proc_pHashId_file | (Agent) Number of opened files | Number of files opened by a process. pHashId (process name and process ID) is the value of md5.
| ≥ 0 | Count | N/A | 2.4.1 | 1 minute |
proc_running_count | (Agent) Running processes | Number of processes that are running
| ≥ 0 | None | N/A | 2.4.1 | 1 minute |
proc_idle_count | (Agent) Idle Processes | Number of processes that are idle
| ≥ 0 | None | N/A | 2.4.1 | 1 minute |
proc_zombie_count | (Agent) Zombie Processes | Number of zombie processes
| ≥ 0 | None | N/A | 2.4.1 | 1 minute |
proc_blocked_count | (Agent) Blocked Processes | Number of processes that are blocked
| ≥ 0 | None | N/A | 2.4.1 | 1 minute |
proc_sleeping_count | (Agent) Sleeping Processes | Number of processes that are sleeping
| ≥ 0 | None | N/A | 2.4.1 | 1 minute |
proc_total_count | (Agent) Total Processes | Total number of processes on the monitored object
| ≥ 0 | None | N/A | 2.4.1 | 1 minute |
proc_specified_count | (Agent) Specified Processes | Number of specified processes
| ≥ 0 | None | N/A | 2.4.1 | 1 minute |
GPU Specifications

If a GPU server has eight GPU cards and the PM mode is disabled, data may fail to be collected. You can enable the PM mode and restart the monitoring process.
Category | Metric Name | Description | Value Range | Unit | Conversion Rule | Earliest Agent Version Required | Collection Interval |
|---|---|---|---|---|---|---|---|
GPU Specifications | gpu_status | GPU health status of the VM. This metric is a composite metric.
|
| None | N/A | 2.4.5 | 1 minute |
gpu_performance_state | Performance status of the GPU
|
| None | N/A | 2.4.1 | 1 minute | |
gpu_power_draw | Power of the GPU.
| ≥ 0 | W | N/A | 2.4.5 | 1 minute | |
gpu_temperature | Temperature of the GPU.
| ≥ 0 | °C | N/A | 2.4.5 | 1 minute | |
gpu_usage_gpu | GPU computing power usage.
| 0-100 | % | N/A | 2.4.1 | 1 minute | |
gpu_usage_mem | GPU memory usage.
| 0-100 | % | N/A | 2.4.1 | 1 minute | |
gpu_used_mem | Used GPU memory.
| ≥ 0 | MB | N/A | 2.4.5 | 1 minute | |
gpu_free_mem | Remaining GPU memory.
| ≥ 0 | MB | N/A | 2.4.5 | 1 minute | |
gpu_usage_encoder | GPU encoder usage.
| 0-100 | % | N/A | 2.4.5 | 1 minute | |
gpu_usage_decoder | GPU decoder usage.
| 0-100 | % | N/A | 2.4.5 | 1 minute | |
gpu_graphics_clocks | GPU graphics (shader) clock frequency.
| ≥ 0 | MHz | N/A | 2.4.5 | 1 minute | |
gpu_sm_clocks | Streaming processor clock frequency of the GPU.
| ≥ 0 | MHz | N/A | 2.4.5 | 1 minute | |
gpu_mem_clocks | Memory clock frequency of the GPU.
| ≥ 0 | MHz | N/A | 2.4.5 | 1 minute | |
gpu_video_clocks | Video (including codec) clock frequency of the GPU.
| ≥ 0 | MHz | N/A | 2.4.5 | 1 minute | |
gpu_tx_throughput_pci | Outbound bandwidth of the GPU.
| ≥ 0 | MByte/s | N/A | 2.4.5 | 1 minute | |
gpu_rx_throughput_pci | Inbound bandwidth of the GPU.
| ≥ 0 | MByte/s | N/A | 2.4.5 | 1 minute | |
gpu_volatile_correctable | Number of correctable ECC errors since the GPU is reset. The value is reset to 0 each time the GPU is reset.
| ≥ 0 | count | N/A | 2.4.5 | 1 minute | |
gpu_volatile_uncorrectable | Number of uncorrectable ECC errors since the GPU is reset. The value is reset to 0 each time the GPU is reset.
| ≥ 0 | count | N/A | 2.4.5 | 1 minute | |
gpu_aggregate_correctable | Number of correctable ECC errors on the GPU.
| ≥ 0 | count | N/A | 2.4.5 | 1 minute | |
gpu_aggregate_uncorrectable | Number of uncorrectable ECC errors on the GPU.
| ≥ 0 | count | N/A | 2.4.5 | 1 minute | |
gpu_retired_page_single_bit | Number of retired page single bit errors, which indicates the number of single-bit pages isolated by the GPU.
| ≥ 0 | count | N/A | 2.4.5 | 1 minute | |
gpu_retired_page_double_bit | Number of retired page double bit errors, which indicates the number of double-bit pages isolated by the GPU.
| ≥ 0 | count | N/A | 2.4.5 | 1 minute | |
gpu_lnkcap_speed | Maximum speed supported by the PCIe link of the GPU.
| ≥ 0 | GT/s | N/A | 2.6.7 | 1 minute | |
gpu_lnkcap_width | Link width of the PCIe link.
| ≥ 0 | count | N/A | 2.6.7 | 1 minute | |
gpu_lnksta_speed | PCIe connection speed of the GPU.
| ≥ 0 | GT/s | N/A | 2.6.7 | 1 minute | |
gpu_lnksta_width | PCIe link width of the GPU.
| ≥ 0 | count | N/A | 2.6.7 | 1 minute | |
gpu_nvlink_number | Number of NVLink links of the GPU.
| ≥ 0 | count | N/A | 2.6.7 | 1 minute | |
gpu_nvlink_bandwidth | NVLink link width of the GPU.
| ≥ 0 | GB/s | N/A | 2.6.7 | 1 minute |
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.

