What Is AOM?
Background and Challenges
Previously, most enterprises purchased infrastructure resources and built clusters by themselves. They implemented O&M by focusing on hosts, and built their own application and database monitoring systems. With the popularization of container technologies, more and more enterprises develop applications through the microservice framework. As the number of cloud services increases, enterprises gradually turn to cloud O&M. Cloud O&M poses the following challenges:
- Lots of O&M tools, resulting in high usage and maintenance costs
Cloud O&M has high requirements on personnel skills, O&M tools are hard to configure, and multiple systems need to be maintained at the same time. In addition, the distributed tracing system features high learning and usage costs, but poor stability.
- Difficult problem analysis for distributed applications on the cloud
Lots of problems need to be solved: how to visualize the dependency between microservices, how to improve user experience, how to associate scattered logs for analysis, and how to quickly trace problems.
Application Operations Management (AOM) is a one-stop and multi-dimensional O&M management platform for cloud applications. It monitors applications and related cloud resources in real time, collects and associates resource metrics, logs, and events to analyze application health status, and provides flexible alarm reporting and data visualization. With AOM, you can detect faults in a timely manner and monitor running statuses of applications, resources, and services in real time.
AOM can monitor and manage cloud hosts, storage devices, networks, web containers, and applications hosted in Docker and Kubernetes in a centralized, unified, and visualized manner. This effectively prevents problems and helps O&M personnel locate faults, reducing O&M costs. In addition, AOM provides unified APIs for connecting self-developed monitoring or reporting systems. Unlike traditional monitoring systems, AOM monitors services from the perspective of applications. It meets enterprises' requirements for high efficiency and fast iteration, effectively supports their services through IT, and protects and optimizes their IT assets, so that enterprises can achieve strategic goals.
- Multi-dimensional O&M
Provides one-stop and multi-dimensional O&M platform for mobile apps, application services, middleware, and cloud resources, improving O&M efficiency.
- Health check
Monitors service health status in real time, and detects exceptions or performance bottlenecks in minutes. When a fault occurs, AOM helps you determine the resource, application, or service code that causes the fault.
- Intelligent analysis
Analyzes root causes using Artificial Intelligence (AI)-powered threshold detection and machine learning based on historical data.
- Ease of use
Connects to applications without having to modify code, and collects data in a non-intrusive way. Proactively discovers and monitors applications based on the application running environment, visualizes application data in real time, and facilitates O&M.
- Massive log management
Supports high-performance search and service analysis, automatically associates logs, and quickly filters logs by application, host, file name, or instance.
- Association analysis
Automatically associates applications and resources, and displays data in a panorama view. Metric and alarm data about applications, services, instances, hosts, and transactions is associated for analysis, so that you can easily locate faults.
- Open ecosystem
Opens O&M data query APIs and collection standards, and supports independent development.
Next Article: Architecture
Thank you for your score！Your feedback would help us improve the website.