Updated on 2024-08-16 GMT+08:00

Overview

By leveraging the experience of our container O&M experts, health diagnosis monitors cluster health to detect cluster faults and identify risks in a timely manner. It also provides rectification suggestions.

Health Diagnosis Coverage

The following figure shows the health diagnosis coverage.

Figure 1 Health diagnosis coverage

Health Diagnosis Capabilities

  • Out-of-the-box diagnosis (without Monitoring Center enabled)
  • Comprehensive check of cluster health (with Monitoring Center enabled)
  • Health scores based on diagnosis results
  • Scheduled inspection and visualized inspection results
  • Inspection history for analyzing fault causes
  • Risk levels and rectification suggestions

Application Scenarios

  • You can check the cluster health before a cluster change and perform health diagnosis at any time.
  • You can set a scheduled inspection to identify cluster risks on schedule.

The following table lists the inspection items.

Dimension

Inspection Item

O&M

  • Cluster O&M
  • Cluster security group configuration
  • Cluster resource planning
  • Cloud service quota

Resources and services

  • Storage add-on (everest) status
  • Logging add-on (log-agent) status
  • Domain name resolution add-on (coredns) status
  • Worker node load status
  • Worker node status
  • Pod configuration
  • Pod workload
  • Pod status

For more information, see Diagnosis Items and Rectification Solutions.