- What's New
- Function Overview
- Service Overview (2.0)
- Getting Started (2.0)
-
User Guide (2.0)
- Introduction
- Access Center
- Dashboard
- Alarm Management
- Metric Browsing
- Log Analysis
-
Prometheus Monitoring
- Prometheus Monitoring
- Creating Prometheus Instances
- Managing Prometheus Instances
- Configuring a Recording Rule
- Metric Management
- Dashboard Monitoring
-
Access Guide
- Connecting Node Exporter
-
Exporter Access in the VM Scenario
- Access Overview
- MySQL Component Access
- Redis Component Access
- Kafka Component Access
- Nginx Component Access
- MongoDB Component Access
- Consul Component Access
- HAProxy Component Access
- PostgreSQL Component Access
- Elasticsearch Component Access
- RabbitMQ Component Access
- Access of Other Components
- Custom Plug-in Access
- Other Operations
- Obtaining the Service Address of a Prometheus Instance
- Viewing Prometheus Instance Data Through Grafana
- Reading Prometheus Instance Data Through Remote Read
- Reporting Self-Built Prometheus Instance Data to AOM
- Resource Usage Statistics
- Business Monitoring (Beta)
- Infrastructure Monitoring
- Settings
- Remarks
- Permissions Management
- Auditing
- Subscribing to AOM 2.0
- Upgrading to AOM 2.0
- Best Practices (2.0)
-
FAQs (2.0)
- Overview
- Dashboard
- Alarm Management
- Log Analysis
- Prometheus Monitoring
- Infrastructure Monitoring
-
Collection Management
- Are ICAgent and UniAgent the Same?
- What Can I Do If an ICAgent Is Offline?
- Why Is an Installed ICAgent Displayed as "Abnormal" on the Agent Management Page?
- Why Can't I View the ICAgent Status After It Is Installed?
- Why Can't AOM Monitor CPU and Memory Usage After ICAgent Is Installed?
- How Do I Obtain an AK/SK?
- FAQs About ICAgent Installation
- How Do I Enable the Nginx stub_status Module?
- Other FAQs
-
API Reference
- Before You Start
- API Overview
- Calling APIs
-
APIs
-
Alarm
- Querying the Event Alarm Rule List
- Adding an Event Alarm Rule
- Modifying an Event Alarm Rule
- Deleting an Event Alarm Rule
- Obtaining the Alarm Sending Result
- Deleting a Silence Rule
- Adding a Silence Rule
- Modifying a Silence Rule
- Obtaining the Silence Rule List
- Querying an Alarm Action Rule Based on Rule Name
- Adding an Alarm Action Rule
- Deleting an Alarm Action Rule
- Modifying an Alarm Action Rule
- Querying the Alarm Action Rule List
- Querying Metric or Event Alarm Rules
- Adding or Modifying Metric or Event Alarm Rules
- Deleting Metric or Event Alarm Rules
- Querying Events and Alarms
- Counting Events and Alarms
- Reporting Events and Alarms
-
Monitoring
- Querying Time Series Objects
- Querying Time Series Data
- Querying Metrics
- Querying Monitoring Data
- Adding Monitoring Data
- Adding or Modifying One or More Service Discovery Rules
- Deleting a Service Discovery Rule
- Querying Existing Service Discovery Rules
- Adding a Threshold Rule
- Querying the Threshold Rule List
- Modifying a Threshold Rule
- Deleting a Threshold Rule
- Querying a Threshold Rule
- Deleting Threshold Rules in Batches
-
Prometheus Monitoring
- Querying Expression Calculation Results in a Specified Period Using the GET Method
- (Recommended) Querying Expression Calculation Results in a Specified Period Using the POST Method
- Querying the Expression Calculation Result at a Specified Time Point Using the GET Method
- (Recommended) Querying Expression Calculation Results at a Specified Time Point Using the POST Method
- Querying Tag Values
- Obtaining the Tag Name List Using the GET Method
- (Recommended) Obtaining the Tag Name List Using the POST Method
- Querying Metadata
- Log
- Prometheus Instance
- Configuration Management
-
Alarm
- Historical APIs
- Examples
- Permissions Policies and Supported Actions
- Appendix
- SDK Reference
-
Service Overview (1.0)
- What Is AOM?
- Product Architecture
- Functions
- Application Scenarios
- Edition Differences
-
Metric Overview
- Introduction
- Network Metrics and Dimensions
- Disk Metrics and Dimensions
- Disk Partition Metrics
- File System Metrics and Dimensions
- Host Metrics and Dimensions
- Cluster Metrics and Dimensions
- Container Metrics and Dimensions
- VM Metrics and Dimensions
- Instance Metrics and Dimensions
- Service Metrics and Dimensions
- Restrictions
- Privacy and Sensitive Information Protection Statement
- Relationships Between AOM and Other Services
- Basic Concepts
- Permissions
- Billing
- Getting Started (1.0)
-
User Guide (1.0)
- Overview
- Subscribing to AOM
- Permissions Management
- Connecting Resources to AOM
- Monitoring Overview
- Alarm Management
- Resource Monitoring
- Log Management
- Configuration Management
- Auditing
- Upgrading to AOM 2.0
- Best Practices (1.0)
-
FAQs (1.0)
- User FAQs
-
Consultation FAQs
- What Are the Usage Restrictions of AOM?
- What Are the Differences Between AOM and APM?
- How Do I Distinguish Alarms from Events?
- What Is the Relationship Between the Time Range and Statistical Cycle?
- Does AOM Display Logs in Real Time?
- How Can I Do If I Cannot Receive Any Email Notification After Configuring a Threshold Rule?
- Why Are Connection Channels Required?
-
Usage FAQs
- What Can I Do If I Do Not Have the Permission to Access SMN?
- What Can I Do If Resources Are Not Running Properly?
- How Do I Set the Full-Screen Online Duration?
- What Can I Do If the Log Usage Reaches 90% or Is Full?
- How Do I Obtain an AK/SK?
- How Can I Check Whether a Service Is Available?
- Why Is the Status of an Alarm Rule Displayed as "Insufficient"?
- Why the Status of a Workload that Runs Normally Is Displayed as "Abnormal" on the AOM Page?
- How Do I Create the apm_admin_trust Agency?
- What Is the Billing Policy of Logs?
- Why Can't I See Any Logs on the Console?
- What Can I Do If an ICAgent Is Offline?
- Why Can't the Host Be Monitored After ICAgent Is Installed?
- Why Is "no crontab for root" Displayed During ICAgent Installation?
- Why Can't I Select an OBS Bucket When Configuring Log Dumping on AOM?
- Why Can't Grafana Display Content?
Show all
Host Monitoring
Hosts include the Elastic Cloud Server (ECS) and Bare Metal Server (BMS). AOM can monitor the hosts purchased during CCE and ServiceStage cluster creation as well as those purchased in non-CCE and -ServiceStage environments. (The purchased hosts must meet the OS and version requirements, and ICAgents must be installed on them. Otherwise, AOM cannot monitor them.) In addition, hosts support IPv4 addresses.
Host monitoring displays resource usage, trends, and alarms, so that you can quickly respond to malfunctioning hosts and handle errors to ensure smooth host running.
Precautions
- A maximum of five tags can be added to a host, and each tag must be unique.
- The same tag can be added to different hosts.
Procedure
- Log in to the AOM 2.0 console.
- In the navigation pane, choose Infrastructure Monitoring > Host Monitoring.
- Set filter criteria (such as the running status, host type, host name, and IP address) above the host list.
- You can enable or disable Hide master host. By default, this option is enabled.
- Click
next to Hide master host to synchronize host information.
- In the upper right corner of the page, set filter criteria.
- Set a time range to view the hosts reported. There are two methods to set a time range:
- Method 1: Use a predefined time label, such as Last 30 minutes, Last hour, Last 6 hours, Last day, or Last week. Select one as required.
- Method 2: Specify the start time and end time (max. 30 days).
- Set the interval for refreshing information. Click
and select a value from the drop-down list as required, such as Refresh manually, 30 seconds auto refresh, 1 minute auto refresh, or 5 minutes auto refresh.
- Click
in the upper right corner and select or deselect Tags.
- Perform the following operations if needed:
- Adding an alias
If a host name is too complex to identify, you can add an alias, which makes it easy to identify a host as required.
In the host list, click
in the Operation column of the target host, enter an alias, and click OK. The added alias can be modified but cannot be deleted.
- Adding a tag
Tags are identifiers of hosts. You can manage hosts using tags. After a tag is added, you can quickly identify and select a host.
In the host list, click
in the Operation column of the target host. In the displayed dialog box, enter a tag key and value, and click
and OK.
- Synchronizing host data
In the host list, locate the target host and click
in the Operation column to synchronize host information.
- Adding an alias
- Set filter criteria to search for the desired host.
NOTE:
Hosts cannot be searched by alias.
- Click a host name. On the displayed host details page, you can view the running status and ID of the host.
- Click any tab. In the list, you can monitor the instance resource usage and health status, and information about common resources such as GPUs and NICs.
- On the Process List tab page of the ECS host, you can view the process status and IP address of the host.
- In the search box in the upper right corner of the process list, you can set search criteria such as the process name to filter processes.
- Click
in the upper right corner to obtain the latest process information within the specified time range.
- On the Pods tab page of the CCE host, you can view the pod status and node IP address.
- Click a pod name to view details about the container and process of the pod.
- In the search box in the upper right corner of the pod list, you can set search criteria such as pod names to filter pods.
- Click
in the upper right corner to obtain the latest pod information within the specified time range.
- On the Monitoring Views tab page, view key metric graphs of the host.
- On the File Systems tab page, view the basic information about the file system of the host. Click a disk file partition to monitor its metrics on the Monitoring Views page.
- On the Disks tab page, view the basic information about the disks of the host. Click a disk to monitor its metrics on the Monitoring Views page.
- On the Disk Partitions tab page, view the disk partition information about the host. Click a disk partition to monitor its metrics on the Monitoring Views page.
- Click the NICs tab to view the basic information about the NICs of the host. Click a NIC to monitor its metrics on the Monitoring Views page.
- Click the GPUs tab to view the basic information about the GPUs of the host. Click a GPU to monitor its metrics on the Monitoring Views page.
- On the Events tab page, view the event details of the host. For details, see Viewing Events.
- On the Alarms tab page, view the alarm details of the host. For details, see Checking Alarms.
- On the File Systems, Disks, Disk Partitions, NICs, or GPUs tab page, click
in the upper right corner of the resource list and select or deselect items to display.
NOTE:
Disk partitions are supported by CentOS 7.x and EulerOS 2.5.
- On the Process List tab page of the ECS host, you can view the process status and IP address of the host.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.