Updated on 2024-02-01 GMT+08:00

What Is AOM?

Application Operations Management (AOM) is a one-stop, multi-dimensional O&M management platform for cloud applications. It integrates observable data sources, such as Cloud Eye, Log Tank Service (LTS), Application Performance Management (APM), real user experience, and backend link data. It also provides unified application resource management, automated O&M, and one-stop observability analysis solutions. With AOM, you can detect faults in a timely manner, monitor applications, resources, and services in real time, and improve automated O&M capability and efficiency.

Figure 1 AOM architecture
  • Hosting & Running

    AOM seamlessly interconnects with multiple upper-layer O&M services. It can quickly collect metric data from services such as ServiceStage, FunctionGraph, and Cloud Service Engine (CSE), and display them in real time.

  • Observability Analysis

    Provides observable analysis capabilities such as exception detection, historical data analysis, performance analysis, correlation analysis, and scenario-based analysis through transaction/container/Prometheus monitoring based on the four-layer (infrastructure/middleware/application/business) metric system.

  • Automation

    Provides functions such as batch disk cleanup, job orchestration, and script execution, and standardizes and automates routine O&M operations.

  • CMDB

    Provides functions such as application management and resource search, centrally manages resources and applications, and provides accurate and consistent resource configuration data for upper-layer O&M services in a timely manner.

  • Collection Management

    Manages plug-ins centrally and issue instructions for operation such as script delivery and execution.

  • Openness

    Supports reporting of native Prometheus Query Language (PromQL) data, data reporting through APIs, data viewing through Grafana, and data dumping through Kafka.