Updated on 2024-09-06 GMT+08:00

Workload Monitoring

Workload monitoring is for CCE and CCI workloads. It enables you to monitor the resource usage, status, and alarms of workloads in a timely manner so that you can quickly handle alarms or events to ensure smooth workload running. Workloads are classified into Deployments, StatefulSets, DaemonSets, Jobs, and Pods.

Function Introduction

  • The workload monitoring solution is ready-to-use. After AOM is enabled, the workload status, CPU usage, and physical memory usage of CCE and CCI are displayed on the workload monitoring page by default.
    Figure 1 Workload monitoring
  • For customer-built Kubernetes containers, only Prometheus remote write is supported. After container metrics are written into AOM's metric library, you can query metric data by following instructions listed in Metric Browsing.
  • Workload monitoring adopts the layer-by-layer drill-down design. The hierarchy is as follows: workload > Pod instance > container > process. You can view their relationships on the UI. Metrics, logs, and alarms are monitored at each layer.
    Figure 2 Workload details
  • In the upper right corner of the workload monitoring page, click Associate Application and perform operations as prompted. Then CCE workloads can be reported to AOM. They can also be displayed as components in the application tree on the Application Monitoring page.

    To use the function of associating applications, enable Application Insights in Menu Settings. For details, see Menu Settings.

Procedure

  1. Log in to the AOM 2.0 console.
  2. In the navigation pane, choose Infrastructure Monitoring > Workload Monitoring.
  3. In the upper right corner of the page, set filter criteria.

    1. Set a time range to view the workloads reported. There are two methods to set a time range:

      Method 1: Use a predefined time label, such as Last hour or Last 6 hours. You can select a time range as required.

      Method 2: Specify the start time and end time to customize a time range. You can specify 30 days at most.

    2. Set the interval for refreshing information. Click and select a value from the drop-down list, such as Refresh manually or 1 minute auto refresh.

  4. Click any workload tab to view information, such as workload name, status, cluster, and namespace.

    • In the upper part of the workload list, filter workloads by cluster, pod, or namespace.

      To query namespaces, IAM users with the AOM FullAccess or AOM ReadOnlyAccess permission need to log in to the CCE console, choose Permissions in the navigation pane, and click Add Permission in the upper right corner of the page to add required permissions.

    • Click in the upper right corner to obtain the latest workload information within the time range specified in 3.a.
    • Click in the upper right corner and select or deselect columns to display.
    • Click the name of a workload to view its details.
      • On the Pods tab page, view the all pod conditions of the workload. Click a pod name to view the resource usage and health status of the pod's containers.
      • On the Monitoring Views tab page, view the resource usage of the workload.
      • On the Logs tab page, view the raw and real-time logs of the workload and analyze them as required. For details, see Log Streams.
      • On the Alarms tab page, view the alarm details of the workload. For details, see Checking Alarms.
      • On the Events tab page, view the event details of the workload. For details, see Viewing Events.