Help Center> Elastic Cloud Server> User Guide> Monitoring> OS Monitoring Metrics Supported by ECSs with the Agent Installed

OS Monitoring Metrics Supported by ECSs with the Agent Installed

After installing the agent on an ECS, you can view its OS monitoring metrics. Monitoring data is collected every 1 minute.

CPU, CPU load, memory, disk, disk I/O, file system, GPU, and NIC metrics can be monitored.

After the agent is installed, you can view the metrics listed in the following table by default, regardless of ECS types and OSs.

Table 1 CPU metrics

Metric

Parameter

Description

Value Range

Monitored Object

Monitoring Period (Raw Data)

cpu_usage_idle

(Agent) Idle CPU Usage

Percentage of time that CPU is idle

Unit: percent

  • Linux: Check metric value changes in file /proc/stat in a collection period.
  • Windows: Run the top command to check the %Cpu(s) id value.

0-100

ECS

1 minute

cpu_usage_other

(Agent) Other Process CPU Usage

Percentage of time that the CPU is used by other processes

Unit: percent

  • Linux: Other Process CPU Usage = 1- Idle CPU Usage - Kernel Space CPU Usage - User Space CPU Usage
  • Windows: Other Process CPU Usage = 1- Idle CPU Usage - Kernel Space CPU Usage - User Space CPU Usage

0-100

ECS

1 minute

cpu_usage_system

(Agent) Kernel Space CPU Usage

Percentage of time that the CPU is used by kernel space

Unit: percent

  • Linux: Check metric value changes in file /proc/stat in a collection period. Run the top command to check the %Cpu(s) sy value.
  • Windows: Obtain the metric value using the Windows API GetSystemTimes.

0-100

ECS

1 minute

cpu_usage_user

(Agent) User Space CPU Usage

Percentage of time that the CPU is used by user space

Unit: percent

  • Linux: Check metric value changes in file /proc/stat in a collection period. Run the top command to check the %Cpu(s) us value.
  • Windows: Obtain the metric value using the Windows API GetSystemTimes.

0-100

ECS

1 minute

cpu_usage

(Agent) CPU Usage

CPU usage of the monitored object

Unit: percent

  • Linux: Check metric value changes in file /proc/stat in a collection period. Run the top command to check the %Cpu(s) value.
  • Windows: Obtain the metric value using the Windows API GetSystemTimes.

0-100

ECS

1 minute

cpu_usage_nice

(Agent) Nice Process CPU Usage

Percentage of time that the CPU is in user mode with low-priority processes which can easily be interrupted by higher-priority processes

Unit: percent

  • Linux: Check metric value changes in file /proc/stat in a collection period. Run the top command to check the %Cpu(s) ni value.
  • Windows is not supported currently.

0-100

ECS

1 minute

cpu_usage_iowait

(Agent) iowait Process CPU Usage

Percentage of time that the CPU is waiting for I/O operations to complete

Unit: percent

  • Linux: Check metric value changes in file /proc/stat in a collection period. Run the top command to check the %Cpu(s) wa value.
  • Windows is not supported currently.

0-100

ECS

1 minute

cpu_usage_irq

(Agent) CPU Interrupt Time

Percentage of time that the CPU is servicing interrupts

Unit: percent

  • Linux: Check metric value changes in file /proc/stat in a collection period. Run the top command to check the %Cpu(s) hi value.
  • Windows is not supported currently.

0-100

ECS

1 minute

cpu_usage_softirq

(Agent) CPU Software Interrupt Time

Percentage of time that the CPU is servicing software interrupts

Unit: percent

  • Linux: Check metric value changes in file /proc/stat in a collection period. Run the top command to check the %Cpu(s) si value.
  • Windows is not supported currently.

0-100

ECS

1 minute

Table 2 CPU load metrics

Metric

Parameter

Description

Value Range

Monitored Object

Monitoring Period (Raw Data)

load_average1

(Agent) 1-Minute Load Average

CPU load averaged from the last 1 minute

Linux: Obtain the metric value from the number of logic CPUs in load1/ in file /proc/loadavg. Run the top command to check the load1 value.

≥ 0

ECS

1 minute

load_average5

(Agent) 5-Minute Load Average

CPU load averaged from the last 5 minutes

Linux: Obtain the metric value from the number of logic CPUs in load5/ in file /proc/loadavg. Run the top command to check the load5 value.

≥ 0

ECS

1 minute

load_average15

(Agent) 15-Minute Load Average

CPU load averaged from the last 15 minutes

Linux: Obtain the metric value from the number of logic CPUs in load15/ in file /proc/loadavg. Run the top command to check the load15 value.

≥ 0

ECS

1 minute

The Windows OS does not support the CPU load metrics.

Table 3 Memory metrics

Metric

Parameter

Description

Value Range

Monitored Object

Monitoring Period (Raw Data)

mem_available

(Agent) Available Memory

Amount of memory that is available and can be given instantly to processes

Unit: GB

  • Linux: Obtain the metric value from MemAvailable in file /proc/meminfo. If MemAvailable is not available in /proc/meminfo, it equals to MemFree+Buffers+Cached.
  • Windows: It is calculated by available memory minuses used memory. The value is obtained by calling the Windows API GlobalMemoryStatusEx.

≥ 0

ECS

1 minute

mem_usedPercent

(Agent) Memory Usage

Memory usage of the monitored object

Unit: percent

  • Linux: Obtain the metric value from the /proc/meminfo file (MemTotal-MemAvailable)/MemTotal.
  • Windows: The calculation formula is as follows: Used memory size/Total memory size*100%.

0-100

ECS

1 minute

mem_free

(Agent) Idle Memory

Amount of memory that is not being used

Unit: GB

  • Linux: Obtain the metric value from /proc/meminfo.
  • Windows is not supported currently.

≥ 0

ECS

1 minute

mem_buffers

(Agent) Buffer

Amount of memory that is being used for buffers

Unit: GB

  • Linux: Obtain the metric value from /proc/meminfo. Run the top command to check the KiB Mem:buffers value.
  • Windows is not supported currently.

≥ 0

ECS

1 minute

mem_cached

(Agent) Cache

Amount of memory that is being used for file caches

Unit: GB

  • Linux: Obtain the metric value from /proc/meminfo. Run the top command to check the KiB Swap:cached Mem value.
  • Windows is not supported currently.

≥ 0

ECS

1 minute

Table 4 Disk metrics

Metric

Parameter

Description

Value Range

Monitored Object

Monitoring Period (Raw Data)

mountPointPrefix_disk_free

(Agent) Available Disk Space

Free space on the disks

Unit: GB

  • Linux: Run the df -h command to check the value in the Avail column. The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).
  • Windows: Use the WMI interface to call GetDiskFreeSpaceExW API to obtain disk space data. The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

≥ 0

ECS

1 minute

mountPointPrefix_disk_total

(Agent) Disk Storage Capacity

Total space on the disks, including used and free

Unit: GB

  • Linux: Run the df -h command to check the value in the Size column.

    The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

  • Windows: Use the WMI interface to call GetDiskFreeSpaceExW API to obtain disk space data. The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

≥ 0

ECS

1 minute

mountPointPrefix_disk_used

(Agent) Used Disk Space

Used space on the disks

Unit: GB

  • Linux: Run the df -h command to check the value in the Used column. The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).
  • Windows: Use the WMI interface to call GetDiskFreeSpaceExW API to obtain disk space data. The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

≥ 0

ECS

1 minute

mountPointPrefix_disk_usedPercent

(Agent) Disk Usage

Percentage of total disk space that is used, which is calculated as follows: Disk Usage = Used Disk Space/Disk Storage Capacity

Unit: percent

  • Linux: It is calculated as follows: Used/Size. The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).
  • Windows: Use the WMI interface to call GetDiskFreeSpaceExW API to obtain disk space data. The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

0-100

ECS

1 minute

Table 5 Disk I/O metrics

Metric

Parameter

Description

Value Range

Monitored Object

Monitoring Period (Raw Data)

mountPointPrefix_disk_agt_read_bytes_rate

(Agent) Disks Read Rate

Number of bytes read from the monitored disk per second

Unit: byte/s

  • Linux:

    The disk read rate is calculated based on the data changes in the sixth column of the corresponding device in file /proc/diskstats in a collection period.

    The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

  • Windows:
    • Use Win32_PerfFormattedData_PerfDisk_LogicalDisk object in the WMI to obtain disk I/O data.
    • The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).
    • When the CPU usage is high, monitoring data obtaining timeout may occur and result in the failure of obtaining monitoring data.

≥ 0

ECS

1 minute

mountPointPrefix_disk_agt_read_requests_rate

(Agent) Disks Read Requests

Number of read requests sent to the monitored disk per second

Unit: request/s

  • Linux:

    The disk read requests are calculated based on the data changes in the fourth column of the corresponding device in file /proc/diskstats in a collection period.

    The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

  • Windows:
    • Use Win32_PerfFormattedData_PerfDisk_LogicalDisk object in the WMI to obtain disk I/O data.
    • The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).
    • When the CPU usage is high, monitoring data obtaining timeout may occur and result in the failure of obtaining monitoring data.

≥ 0

ECS

1 minute

mountPointPrefix_disk_agt_write_bytes_rate

(Agent) Disks Write Rate

Number of bytes written to the monitored disk per second

Unit: byte/s

  • Linux:

    The disk write rate is calculated based on the data changes in the tenth column of the corresponding device in file /proc/diskstats in a collection period.

    The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

  • Windows:
    • Use Win32_PerfFormattedData_PerfDisk_LogicalDisk object in the WMI to obtain disk I/O data.
    • The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).
    • When the CPU usage is high, monitoring data obtaining timeout may occur and result in the failure of obtaining monitoring data.

≥ 0

ECS

1 minute

mountPointPrefix_disk_agt_write_requests_rate

(Agent) Disks Write Requests

Number of write requests sent to the monitored disk per second

Unit: request/s

  • Linux:

    The disk write requests are calculated based on the data changes in the eighth column of the corresponding device in file /proc/diskstats in a collection period.

    The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

  • Windows:
    • Use Win32_PerfFormattedData_PerfDisk_LogicalDisk object in the WMI to obtain disk I/O data.
    • The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).
    • When the CPU usage is high, monitoring data obtaining timeout may occur and result in the failure of obtaining monitoring data.

≥ 0

ECS

1 minute

disk_readTime

(Agent) Average Read Request Time

Average amount of time that read requests have waited on the disks

Unit: ms/count

  • Linux:

    The average read request time is calculated based on the data changes in the seventh column of the corresponding device in file /proc/diskstats in a collection period.

    The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

  • Windows is not supported currently.

≥ 0

ECS

1 minute

disk_writeTime

(Agent) Average Write Request Time

Average amount of time that write requests have waited on the disks

Unit: ms/count

  • Linux:

    The average write request time is calculated based on the data changes in the eleventh column of the corresponding device in file /proc/diskstats in a collection period.

    The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

  • Windows is not supported currently.

≥ 0

ECS

1 minute

disk_ioUtils

(Agent) Disk I/O Usage

Percentage of the time that the disk has had I/O requests queued to the total disk operation time

Unit: percent

  • Linux:

    The disk I/O usage is calculated based on the data changes in the thirteenth column of the corresponding device in file /proc/diskstats in a collection period.

    The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

  • Windows is not supported currently.

0-100

ECS

1 minute

disk_queue_length

(Agent) Disk Queue Length

Average number of read or write requests queued up for completion for the monitored disk in the monitoring period

Unit: count

  • Linux:

    The average disk queue length is calculated based on the data changes in the fourteenth column of the corresponding device in file /proc/diskstats in a collection period.

    The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

  • Windows is not supported currently.

≥ 0

ECS

1 minute

disk_write_bytes_per_operation

(Agent) Average Disk Write Size

Average number of bytes in an I/O write for the monitored disk in the monitoring period

Unit: byte/op

  • Linux:

    The average disk write size is calculated based on the data changes in the tenth column of the corresponding device to divide that of the eighth column in file /proc/diskstats in a collection period.

    The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

  • Windows is not supported currently.

≥ 0

ECS

1 minute

disk_read_bytes_per_operation

(Agent) Average Disk Read Size

Average number of bytes in an I/O read for the monitored disk in the monitoring period

Unit: byte/op

  • Linux:

    The average disk read size is calculated based on the data changes in the sixth column of the corresponding device to divide that of the fourth column in file /proc/diskstats in a collection period.

    The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

  • Windows is not supported currently.

≥ 0

ECS

1 minute

disk_io_svctm

(Agent) Disk I/O Service Time

Average time in an I/O read or write for the monitored disk in the monitoring period

Unit: ms/op

  • Linux:

    The average disk I/O service time is calculated based on the data changes in the thirteenth column of the corresponding device to divide the sum of data changes in the fourth and eighth columns in file /proc/diskstats in a collection period.

    The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

  • Windows is not supported currently.

≥ 0

ECS

1 minute

Table 6 File system metrics

Metric

Parameter

Description

Value Range

Monitored Object

Monitoring Period (Raw Data)

disk_fs_rwstate

(Agent) File System Read/Write Status

Read and write status of the mounted file system of the monitored object Possible values are 0 (read and write) and 1 (read only).

Linux: Check file system information in the fourth column in file /proc/mounts.

0 or 1

ECS

1

disk_inodesTotal

(Agent) Disk inode Total

Total number of index nodes on the disk

Linux: Run the df -i command to check the value in the Inodes column. The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

≥ 0

ECS

1 minute

disk_inodesUsed

(Agent) Total inode Used

Number of used index nodes on the disk

Linux: Run the df -i command to check the value in the IUsed column. The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

≥ 0

ECS

1 minute

disk_inodesUsedPercent

(Agent) Percentage of Total inode Used

Number of used index nodes on the disk

Unit: percent

Linux: Run the df -i command to check the value in the IUse% column. The path of the mount point prefix cannot exceed 64 characters. It must start with a letter, and contain only digits, letters, hyphens (-), dots (.), and swung dashes (~).

0-100

ECS

1 minute

The Windows OS does not support the file system metrics.

Table 7 NIC metrics

Metric

Parameter

Description

Value Range

Monitored Object

Monitoring Period (Raw Data)

net_bitRecv

(Agent) Outbound Bandwidth

Number of bits sent by this NIC per second

Unit: bit/s

  • Linux: Check metric value changes in file /proc/net/dev in a collection period.
  • Windows: Use the MibIfRow object in the WMI to obtain network metric data.

≥ 0

ECS

1 minute

net_bitSent

(Agent) Inbound Bandwidth

Number of bits received by this NIC per second

Unit: bit/s

  • Linux: Check metric value changes in file /proc/net/dev in a collection period.
  • Windows: Use the MibIfRow object in the WMI to obtain network metric data.

≥ 0

ECS

1 minute

net_packetRecv

(Agent) NIC Packet Receive Rate

Number of packets received by this NIC per second

Unit: count/s

  • Linux: Check metric value changes in file /proc/net/dev in a collection period.
  • Windows: Use the MibIfRow object in the WMI to obtain network metric data.

≥ 0

ECS

1 minute

net_packetSent

(Agent) NIC Packet Send Rate

Number of packets sent by this NIC per second

Unit: count/s

  • Linux: Check metric value changes in file /proc/net/dev in a collection period.
  • Windows: Use the MibIfRow object in the WMI to obtain network metric data.

≥ 0

ECS

1 minute

net_errin

(Agent) Receive Error Rate

Percentage of receive errors detected by this NIC per second

Unit: percent

  • Linux: Check metric value changes in file /proc/net/dev in a collection period.
  • Windows is not supported currently.

0-100

ECS

1 minute

net_errout

(Agent) Transmit Error Rate

Percentage of transmit errors detected by this NIC per second

Unit: percent

  • Linux: Check metric value changes in file /proc/net/dev in a collection period.
  • Windows is not supported currently.

0-100

ECS

1 minute

net_dropin

(Agent) Received Packet Drop Rate

Percentage of packets received by this NIC which were dropped per second

Unit: percent

  • Linux: Check metric value changes in file /proc/net/dev in a collection period.
  • Windows is not supported currently.

0-100

ECS

1 minute

net_dropout

(Agent) Transmitted Packet Drop Rate

Percentage of packets transmitted by this NIC which were dropped per second

Unit: percent

  • Linux: Check metric value changes in file /proc/net/dev in a collection period.
  • Windows is not supported currently.

0-100

ECS

1 minute

Table 8 Process metrics

Metric

Parameter

Description

Value Range

Monitored Object

Monitoring Period (Raw Data)

proc_pHashId_cpu

CPU Usage

CPU consumed by a process. pHashId (process name and process ID) is the value of md5.

Unit: percent

  • Linux: Check metric value changes in file /proc/pid/stat.
  • Windows: Call the Windows API GetProcessTimes to obtain the CPU usage of the process.

0-100

ECS

1 minute

proc_pHashId_mem

Memory Usage

Memory consumed by a process. pHashId (process name and process ID) is the value of md5.

Unit: percent

  • Linux:

    RSS*PAGESIZE/MemTotal

    Obtain the RSS value by checking the second column of file /proc/pid/statm.

    Obtain the PAGESIZE value by running the getconf PAGESIZE command.

    Obtain the MemTotal value by checking file /proc/meminfo.

  • Windows: Call the Windows API procGlobalMemoryStatusEx to obtain the total memory size. Call GetProcessMemoryInfo to obtain the used memory size. Use the used memory size to divide the total memory size to get the memory usage.

0-100

ECS

1 minute

proc_pHashId_file

Opened Files

Number of files opened by a process. pHashId (process name and process ID) is the value of md5.

  • Linux: Run the ls -l /proc/pid/fd command to view the number of opened files.
  • Windows is not supported currently.

≥ 0

ECS

1 minute

proc_running_count

Running Processes

Number of processes that are running

  • Linux: You can obtain the state of each process by checking the Status value in the /proc/pid/status file, and then collect the total number of processes in each state.
  • Windows is not supported currently.

≥ 0

ECS

1 minute

proc_idle_count

Idle Processes

Number of processes that are idle

  • Linux: You can obtain the state of each process by checking the Status value in the /proc/pid/status file, and then collect the total number of processes in each state.
  • Windows is not supported currently.

≥ 0

ECS

1 minute

proc_zombie_count

Zombie Processes

Number of zombie processes

  • Linux: You can obtain the state of each process by checking the Status value in the /proc/pid/status file, and then collect the total number of processes in each state.
  • Windows is not supported currently.

≥ 0

ECS

1 minute

proc_blocked_count

Blocked Processes

Number of processes that are blocked

  • Linux: You can obtain the state of each process by checking the Status value in the /proc/pid/status file, and then collect the total number of processes in each state.
  • Windows is not supported currently.

≥ 0

ECS

1 minute

proc_sleeping_count

Sleeping Processes

Number of processes that are sleeping

  • Linux: You can obtain the state of each process by checking the Status value in the /proc/pid/status file, and then collect the total number of processes in each state.
  • Windows is not supported currently.

≥ 0

ECS

1 minute

proc_total_count

Total Processes

Total number of processes on the monitored object

  • Linux: You can obtain the state of each process by checking the Status value in the /proc/pid/status file, and then collect the total number of processes in each state.
  • Windows: Obtain the total number of processes by using the system process status support module psapi.dll.

≥ 0

ECS

1 minute

Table 9 GPU Metrics

Metric

Parameter

Description

Value Range

Monitored Object

Monitoring Period (Raw Data)

slot0_gpu_performance_state

Performance Status

GPU performance status of the monitored object

Unit: none

Linux: Run the nvidia-smi command to check the value in the Perf column.

P0-P15, P32

P0 indicates the maximum performance status. P15 indicates the minimum performance status. P32 indicates the unknown status.

GPU-accelerated ECS

1 minute

slot0_gpu_usage_mem

Memory Usage

Memory usage of the monitored object

Unit: percent

Linux: Run the nvidia-smi command to check the value in the Memory-Usage column.

0-100

GPU-accelerated ECS

1 minute

slot0_gpu_usage_gpu

GPU Usage

GPU usage of the monitored object

Unit: percent

Linux: Run the nvidia-smi command to check the value in the GPU-Util column.

0-100

GPU-accelerated ECS

1 minute

The Windows OS does not support the GPU metrics.