Updated on 2024-04-15 GMT+08:00

Monitoring Overview

The O&M page provides a full-link, multi-layer, and one-stop O&M page for resources, applications, and user experience. It displays the following cards: infrastructure monitoring, application monitoring, alarm statistics, component monitoring (CPU and memory), host monitoring (disk), host monitoring (CPU and memory), container instance monitoring (CPU and memory), and host monitoring (network).

Infrastructure Monitoring

This card mainly displays infrastructure metrics. You can select one cluster to view its information. When you select a cluster, the following information is displayed:

  • Host running status, CPU usage, and physical memory usage.
  • Trend graph of network traffic in the last 30 minutes. The values of each point in the graph respectively indicate the total downlink/uplink rates of selected clusters in one minute. The values displayed above the trend graph respectively indicate the total downlink/uplink rates of the cluster at the latest time point.
  • Trend graph of CPU and memory usage in the last 30 minutes. The values of each point in the graph respectively indicate the average CPU and memory usage of the cluster in one minute. The values displayed above the trend graph respectively indicate the average CPU and memory usage of the cluster at the latest time point.

Application Monitoring

This card mainly displays application metrics:

  1. Running status of applications and components.
  2. The following information is displayed when you select an application:
    • Trend graph of network traffic in the last 30 minutes. The values of each point in the graph respectively indicate the receive rate (BPS) and send rate (BPS) of the selected application in one minute. The values above the graph respectively indicate the receive rate (BPS) and send rate (BPS) of the selected application at the latest time point.
    • Trend graph of CPU and memory usage in the last 30 minutes. The values of each point in the graph respectively indicate the CPU and memory usage of the selected application in one minute. The values above the graph respectively indicate the CPU and memory usage of the selected application at the latest time point.

Alarm Statistics

This card mainly displays alarms, alarm rules, and trends of alarms and hosts.

Component Monitoring (CPU and Memory)

This card mainly displays:

  • The top 5 components with high CPU and memory usage in the last minute.
  • Trend graph of the CPU and memory usage of the selected component in the last hour. The values of each point in the graph respectively indicate the average CPU and memory usage of the component in one minute.
  • CPU and memory usage of the selected component at the latest time point, which is displayed above the trend graph.
  • Option Hide system components, which can be selected to hide system components.

Host Monitoring (Disk)

This card mainly displays:

  • The top 5 hosts with high disk read/write rate in the last minute.
  • Trend graph of the disk read/write rate of the selected host in the last hour. The values of each point in the graph respectively indicate the average disk read/write rate of the selected host in one minute.
  • Disk read/write rate of the selected host at the latest time point, which is displayed above the trend graph.

Host Monitoring (CPU and Memory)

This card mainly displays:

  • The top 5 hosts with high CPU and memory usage in the last minute.
  • Trend graph of the CPU and memory usage of the selected host in the last hour. The values of each point in the graph respectively indicate the average CPU and memory usage of the host in one minute.
  • CPU and memory usage of the selected host at the latest time point, which is displayed above the trend graph.

Container Instance Monitoring (CPU and Memory)

This card mainly displays:

  • The top 5 container instances with high CPU and memory usage in the last minute.
  • Trend graph of the CPU and memory usage of the selected container instance in the last hour. The values of each point in the graph respectively indicate the average CPU and memory usage of the container instance in one minute.
  • CPU and memory usage of the selected container instance at the latest time point, which is displayed above the trend graph.
  • Hide system instances option, which can be selected to hide system instances.

Host Monitoring (Network)

This card mainly displays:

  • The top 5 hosts with high uplink/downlink network rate in the last minute.
  • Trend graph of the uplink/downlink network rate of the selected host in the last hour. The values of each point in the graph respectively indicate the average uplink/downlink network rate of the selected host in one minute.
  • Uplink/downlink network rate of the selected host at the latest time point, which is displayed above the trend graph.

More Operations

You can also perform the operations listed in Table 1.

Table 1 Related operations

Operation

Description

Adding a card to favorites

To hide a card, click in the upper right corner of the card and choose Add to Favorites. After a card is added to favorites, it is hidden from the O&M page. To view the card later, obtain it from favorites.

Enlarging a graph

Click in the upper right corner of the metric graph.

Drilling down blue texts

Click the blue texts, such as Host, Application, or Component to drill down to the details page.