Updated on 2025-05-22 GMT+08:00

OPS06-03 Developing and Implementing Observability Indicators

  • Risk level

    High

  • Key strategies

    Metrics measure the data within a period. Observability metrics focus on the discovery rate, grading accuracy, demarcation duration, coverage, validity, and consistency. Observability design specifications, design requirements, and O&M management requirements are released centrally.

  • Design suggestions

    The overall technical solution will be standardized and released. Architects of each service system should adhere to this standard during design to ensure capabilities can be utilized in design, running, and high availability scenarios.

    Observability metrics can be implemented using monitoring tools, and alarms can be sent when an exception occurs. Many monitoring tools are available, such as Prometheus, Grafana, Zabbix, and Huawei Cloud Cloud Eye. These tools can periodically collect metrics, provide visualized metric reports, and send alarms to help organizations detect problems in a timely manner.

    For details, see Best Practices of Cloud Eye.