OS Monitoring Metrics Supported by ECSs with the Agent Installed
Description
OS monitoring provides system-level, proactive, and fine-grained monitoring. It requires the Agent to be installed on the ECSs to be monitored. This section describes OS monitoring metrics reported to Cloud Eye. Monitoring data is collected every one minute.
OS monitoring supports metrics about the CPU, CPU load, memory, disk, disk I/O, file system, GPU, network interface, NTP, and TCP connections.
After the Agent is installed, you can view monitoring metrics of ECSs running different OSs.
Namespace
AGT.ECS
OS Metrics: CPU
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
cpu_usage |
(Agent) CPU Usage |
CPU usage of the monitored object
|
0-100 |
% |
N/A |
ECS |
1 minute |
cpu_usage_idle |
(Agent) Idle CPU Usage |
Percentage of time that CPU is idle
|
0-100 |
% |
N/A |
ECS |
1 minute |
cpu_usage_user |
(Agent) User Space CPU Usage |
Percentage of time that the CPU is used by user space
|
0-100 |
% |
N/A |
ECS |
1 minute |
cpu_usage_system |
(Agent) Kernel Space CPU Usage |
Percentage of time that the CPU is used by kernel space
|
0-100 |
% |
N/A |
ECS |
1 minute |
cpu_usage_other |
(Agent) Other Process CPU Usage |
Percentage of time that the CPU is used by other processes
|
0-100 |
% |
N/A |
ECS |
1 minute |
cpu_usage_nice |
(Agent) Nice Process CPU Usage |
Percentage of time that the CPU is in user mode with low-priority processes which can easily be interrupted by higher-priority processes
|
0-100 |
% |
N/A |
ECS |
1 minute |
cpu_usage_iowait |
(Agent) iowait Process CPU Usage |
Percentage of time that the CPU is waiting for I/O operations to complete
|
0-100 |
% |
N/A |
ECS |
1 minute |
cpu_usage_irq |
(Agent) CPU Interrupt Time |
Percentage of time that the CPU is servicing interrupts
|
0-100 |
% |
N/A |
ECS |
1 minute |
cpu_usage_softirq |
(Agent) CPU Software Interrupt Time |
Percentage of time that the CPU is servicing software interrupts
|
0-100 |
% |
N/A |
ECS |
1 minute |
OS Metric: CPU Load
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
load_average1 |
(Agent) 1-Minute Load Average |
CPU load averaged from the last 1 minute Linux: Obtain the metric value from the number of logic CPUs in load1/ in file /proc/loadavg. Run the top command to check the load1 value. |
≥ 0 |
N/A |
N/A |
ECS |
1 minute |
load_average5 |
(Agent) 5-Minute Load Average |
CPU load averaged from the last 5 minutes Linux: Obtain the metric value from the number of logic CPUs in load5/ in file /proc/loadavg. Run the top command to check the load5 value. |
≥ 0 |
N/A |
N/A |
ECS |
1 minute |
load_average15 |
(Agent) 15-Minute Load Average |
CPU load averaged from the last 15 minutes Linux: Obtain the metric value from the number of logic CPUs in load15/ in file /proc/loadavg. Run the top command to check the load15 value. |
≥ 0 |
N/A |
N/A |
ECS |
1 minute |
OS Metric: Memory
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
mem_available |
(Agent) Available Memory |
Amount of memory that is available and can be given instantly to processes
|
≥0 |
GB |
N/A |
ECS |
1 minute |
mem_usedPercent |
(Agent) Memory Usage |
Memory usage of the monitored object
|
0-100 |
% |
N/A |
ECS |
1 minute |
mem_free |
(Agent) Idle Memory |
Amount of memory that is not being used
|
≥0 |
GB |
N/A |
ECS |
1 minute |
mem_buffers |
(Agent) Buffer |
Amount of memory that is being used for buffers
|
≥0 |
GB |
N/A |
ECS |
1 minute |
mem_cached |
(Agent) Cache |
Amount of memory that is being used for file caches
|
≥0 |
GB |
N/A |
ECS |
1 minute |
total_open_files |
(Agent) Total File Handles |
Total handles used by all processes
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
OS Metric: Disk

- Currently, only physical disks are monitored. The NFS-attached disks cannot be monitored.
- By default, Docker-related mount points are shielded. The prefix of the mount point is as follows:
/var/lib/docker;/mnt/paas/kubernetes;/var/lib/mesos
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
disk_free |
(Agent) Available Disk Space |
Free space on the disks
|
≥0 |
GB |
N/A |
ECS - Mount point |
1 minute |
disk_total |
(Agent) Disk Storage Capacity |
Total space on the disks, including used and free
|
≥0 |
GB |
N/A |
ECS - Mount point |
1 minute |
disk_used |
(Agent) Used Disk Space |
Used space on the disks
|
≥0 |
GB |
N/A |
ECS - Mount point |
1 minute |
disk_usedPercent |
(Agent) Disk Usage |
Percentage of total disk space that is used, which is calculated as follows: Disk Usage = Used Disk Space/Disk Storage Capacity
|
0-100 |
% |
N/A |
ECS - Mount point |
1 minute |
OS Metric: Disk I/O
OS Metric: File System
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
disk_fs_rwstate |
(Agent) File System Read/Write Status |
Read and write status of the mounted file system of the monitored object. Value: 0 (read and write) or 1 (read only) Linux: Check file system information in the fourth column in file /proc/mounts. |
|
N/A |
N/A |
ECS - Mount point |
1 |
disk_inodesTotal |
(Agent) Disk inode Total |
Total number of index nodes on the disk Linux: Run the df -i command to check the value in the Inodes column. The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), periods (.), and swung dashes (~). |
≥ 0 |
Count |
N/A |
ECS - Mount point |
1 minute |
disk_inodesUsed |
(Agent) Total inode Used |
Number of used index nodes on the disk Linux: Run the df -i command to check the value in the IUsed column. The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), periods (.), and swung dashes (~). |
≥ 0 |
Count |
N/A |
ECS - Mount point |
1 minute |
disk_inodesUsedPercent |
(Agent) Percentage of Total inode Used |
Number of used index nodes on the disk Linux: Run the df -i command to check the value in the IUse% column. The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), periods (.), and swung dashes (~). |
0-100 |
% |
N/A |
ECS - Mount point |
1 minute |

The Windows OS does not support the file system metrics.
OS Metric: NIC
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
net_bitRecv |
(Agent) Outbound Bandwidth |
Number of bits sent by this NIC per second
|
≥ 0 |
bit/s |
1024(IEC) |
ECS |
1 minute |
net_bitSent |
(Agent) Inbound Bandwidth |
Number of bits received by this NIC per second
|
≥ 0 |
bit/s |
1024(IEC) |
ECS |
1 minute |
net_packetRecv |
(Agent) NIC Packet Receive Rate |
Number of packets received by this NIC per second
|
≥ 0 |
Counts/s |
N/A |
ECS |
1 minute |
net_packetSent |
(Agent) NIC Packet Send Rate |
Number of packets sent by this NIC per second
|
≥ 0 |
Counts/s |
N/A |
ECS |
1 minute |
net_errin |
(Agent) Receive Error Rate |
Percentage of error packets detected by this NIC to the total number of packets received by the NIC per second
|
0-100 |
% |
N/A |
ECS |
1 minute |
net_errout |
(Agent) Transmit Error Rate |
Percentage of transmit errors detected by this NIC per second
|
0-100 |
% |
N/A |
ECS |
1 minute |
net_dropin |
(Agent) Received Packet Drop Rate |
Percentage of packets received by this NIC which were dropped per second
|
0-100 |
% |
N/A |
ECS |
1 minute |
net_dropout |
(Agent) Transmitted Packet Drop Rate |
Percentage of packets transmitted by this NIC which were dropped per second
|
0-100 |
% |
N/A |
ECS |
1 minute |
OS Metric: NTP
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
ntp_offset |
(Agent) NTP Offset |
NTP offset of the monitored object
|
≥ 0 |
ms |
N/A |
ECS |
1 minute |
OS Metric: TCP
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
net_tcp_total |
(Agent) Total TCP Connections |
Total number of TCP connections in all states
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_established |
(Agent) TCP ESTABLISHED Connection |
Number of TCP connections in ESTABLISHED state
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_sys_sent |
(Agent) TCP SYS_SENT Connections |
Number of TCP connections that are being requested by the client
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_sys_recv |
(Agent) TCP SYS_RECV Connections |
Number of pending TCP connections received by the server
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_fin_wait1 |
(Agent) TCP FIN_WAIT1 Connections |
Number of TCP connections waiting for ACK packets when the connections are being actively closed by the client
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_fin_wait2 |
(Agent) TCP FIN_WAIT2 Connections |
Number of TCP connections in the FIN_WAIT2 state
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_time_wait |
(Agent) TCP TIME_WAIT Connections |
Number of TCP connections in TIME_WAIT state
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_close |
(Agent) TCP CLOSE Connections |
Number of closed TCP connections
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_close_wait |
(Agent) TCP CLOSE_WAIT Connections |
Number of TCP connections in CLOSE_WAIT TCP state
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_last_ack |
(Agent) TCP LAST_ACK Connections |
Number of TCP connections waiting for ACK packets when the connections are being passively closed by the client
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_listen |
(Agent) TCP LISTEN Connections |
Number of TCP connections in the LISTEN state
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_closing |
(Agent) TCP CLOSING Connections |
Number of TCP connections to be automatically closed by the server and the client at the same time
|
≥ 0 |
Count |
N/A |
ECS |
1 minute |
net_tcp_retrans |
(Agent) TCP Retransmission Rate |
Percentage of packets that are resent
|
0-100 |
% |
N/A |
ECS |
1 minute |
OS Metric: GPU
Metric |
Parameter |
Description |
Value Range |
Unit |
Conversion Rule |
Monitored Object & Dimension |
Monitoring Period (Raw Data) |
---|---|---|---|---|---|---|---|
gpu_status |
(Agent) GPU Health Status |
Overall measurement of the GPU health
|
|
N/A |
N/A |
|
1 minute |
gpu_usage_encoder |
(Agent) Encoding Usage |
Encoding capability usage of the GPU
|
0-100 |
% |
N/A |
|
1 minute |
gpu_usage_decoder |
(Agent) Decoding Usage |
Decoding capability usage of the GPU
|
0-100 |
% |
N/A |
|
1 minute |
gpu_volatile_correctable |
(Agent) Volatile Correctable ECC Errors |
Number of correctable ECC errors since the GPU is reset. The value is reset to 0 each time the GPU is reset.
|
≥ 0 |
Count |
N/A |
|
1 minute |
gpu_volatile_uncorrectable |
(Agent) Volatile Uncorrectable ECC Errors |
Number of uncorrectable ECC errors since the GPU is reset. The value is reset to 0 each time the GPU is reset.
|
≥ 0 |
Count |
N/A |
|
1 minute |
gpu_aggregate_correctable |
(Agent) Aggregate Correctable ECC Errors |
Aggregate correctable ECC errors on the GPU
|
≥ 0 |
Count |
N/A |
|
1 minute |
gpu_aggregate_uncorrectable |
(Agent) Aggregate Uncorrectable ECC Errors |
Aggregate uncorrectable ECC Errors on the GPU
|
≥ 0 |
Count |
N/A |
|
1 minute |
gpu_retired_page_single_bit |
(Agent) Retired Page Single Bit Errors |
Number of retired page single bit errors, which indicates the number of single-bit pages blocked by the graphics card
|
≥ 0 |
Count |
N/A |
|
1 minute |
gpu_retired_page_double_bit |
(Agent) Retired Page Double Bit Errors |
Number of retired page double bit errors, which indicates the number of double-bit pages blocked by the graphics card
|
≥ 0 |
Count |
N/A |
|
1 minute |
gpu_performance_state |
(Agent) Performance Status |
GPU performance status
|
P0-P15, P32
|
N/A |
N/A |
|
1 minute |
gpu_usage_mem |
(Agent) GPU Memory Usage |
GPU memory usage
|
0-100 |
% |
N/A |
|
1 minute |
gpu_usage_gpu |
(Agent) GPU Usage |
GPU compute usage
|
0-100 |
% |
N/A |
|
1 minute |
Dimensions
Dimension |
Key |
Value |
---|---|---|
ECS |
instance_id |
Specifies the ECS ID. |
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.