Updated on 2024-08-16 GMT+08:00

Cluster View

A cluster view is generated based on metrics and Prometheus Query Language (PromQL) statements. It displays information about nodes, CPUs, memory, networks, and disks, so you can better monitor a cluster. The following describes cluster resources from two parts: metric description and metric list. In the resource diagrams, larger values in bytes can be converted to ones in MB, KB, or GB.

Metric Description

Cluster metrics include basic metrics, network metrics, and disk metrics. You can view these metrics in the following tables.

Figure 1 Basic metrics
Table 1 Basic metrics

Metric

Unit

Description

Nodes

N/A

The number of nodes in a cluster.

Nodes with Unavailable Disks

N/A

The number of nodes with unavailable disks in a cluster.

Nodes Unavailable

N/A

The number of unready nodes in a cluster.

CPU Usage

%

Total CPUs used by all containers/Total CPU limits in a cluster

CPU Requests Commitment

%

Total CPU requests/Total CPU of the cluster

CPU Limits Commitment

%

Total CPU limits/Total CPU of the cluster

Memory Usage

%

Total memory used by all containers/Total memory limits in a cluster

Memory Requests Commitment

%

Total memory requests/Total memory of the cluster

Memory Limits Commitment

%

Total memory limits/Total memory of the cluster

Pods

N/A

The number of pods in different states (Failed, Pending, Running, Succeeded, and Unknown) in a cluster.

Containers

N/A

The number of containers in different states (Running, Waiting, and Terminated) in a cluster.

Used CPU

core

Total CPUs used by all containers in each namespace.

Used Memory

byte

Total memory used by all containers in each namespace.

Figure 2 Network metrics
Table 2 Network metrics

Metric

Unit

Description

Network Receive Rate

byte/s

Total number of bytes received by all containers in each namespace per second.

Network Transmit Rate

byte/s

Total number of bytes sent by all containers in each namespace per second.

Average Network Receive Rate

byte/s

Average number of bytes received by all containers in each namespace per second.

Average Network Transmit Rate

byte/s

Average number of bytes sent by all containers in each namespace per second.

Packet Receive Rate

packet/s

Total number of packets received by all containers in each namespace per second.

Packet Transmit Rate

packet/s

Total number of packets sent by all containers in each namespace per second.

Packet Loss Rate (Receive)

packet/s

Total number of packets not received by all containers in each namespace per second.

Packet Loss Rate (Transmit)

packet/s

Total number of packets sent from all containers but not received by the recipient in each namespace per second.

Figure 3 Disk metrics
Table 3 Disk metrics

Metric

Unit

Description

Disk I/O Rate (Read + Write)

N/A

Total number of read and write operations on the disk by all containers in each namespace per second.

Disk Throughput (Read + Write)

byte/s

Total number of bytes read from and written to the disk by all containers in each namespace per second.

Metric List

The following is the metric list of the cluster view.

Table 4 Metric description

Metric

Type

Description

kube_pod_container_resource_requests

Gauge

Resource requests of a container.

kube_pod_container_resource_limits

Gauge

Resource limits of a container.

kube_node_status_allocatable

Gauge

Resources allocatable on a node.

kube_pod_status_phase

Gauge

The current phase of a pod.

node_memory_MemAvailable_bytes

Gauge

The amount of memory available on a node.

node_memory_MemTotal_bytes

Gauge

Cumulative count of bytes that can be consumed by a node.

node_cpu_seconds_total

Counter

Seconds the CPUs spent in each mode.

kube_node_info

Gauge

Node information.

kube_node_status_condition

Gauge

Node status.

kube_pod_container_status_waiting

Gauge

Whether the container is in the waiting state.

kube_pod_container_status_terminated

Gauge

Whether the container is in the terminated state.

container_cpu_usage_seconds_total

Counter

Cumulative CPU time consumed by a container, in seconds.

container_memory_rss

Gauge

Resident set size (RSS), which is the amount of space of physical memory (RAM) held by a process.

container_network_receive_bytes_total

Counter

Cumulative count of bytes received by a container.

container_network_transmit_bytes_total

Counter

Cumulative count of bytes sent by a container.

container_network_receive_packets_total

Counter

Cumulative count of packets received by a container.

container_network_transmit_packets_total

Counter

Cumulative count of packets sent by a container.

container_network_receive_packets_dropped_total

Counter

Cumulative count of packets not received by a container.

container_network_transmit_packets_dropped_total

Counter

Cumulative count of packets sent from a container but not received by the recipient.

container_fs_reads_total

Counter

Cumulative count of reads completed by a container.

container_fs_reads_bytes_total

Counter

Cumulative count of bytes read by a container.