Viewing Alarms of an MRS Cluster
Alarms and events are important mechanisms for ensuring the stability, reliability, and performance of MRS clusters.
An alarm is a system-generated notification that signals an abnormal condition or fault requiring attention. It is triggered by analyzing system events and requires manual intervention from a user or automatic handling by the system. You can view alarms reported by components on the management console or MRS Manager. You can also define alarm thresholds for threshold-related monitoring metrics in the cluster.
For alarms that can be automatically cleared, the system clears them as soon as the predefined conditions are met. If faults have been rectified and the alarms cannot be automatically cleared, you can manually clear the alarms.
You can view up to 100,000 latest alarms (including uncleared, manually cleared, and automatically cleared alarms) on MRS Manager. If the number of cleared alarms exceeds 100,000 and is about to reach 110,000, the system automatically dumps the earliest 10,000 cleared alarms to the dump path.
The alarm dump directory is as follows. The system automatically generates the directory when alarms are dumped for the first time.
- Clusters of versions earlier than MRS 3.x: ${BIGDATA_HOME}/OMSV100R001C00x8664/workspace/data directory on the active management node
- For MRS 3.x clusters: ${BIGDATA_HOME}/om-server/OMS/workspace/data directory on the active management node
Video Tutorial
This tutorial introduces how to view cluster alarms and events and configure an alarm threshold.
The UI may vary depending on the version. This tutorial is for reference only.
- Log in to the MRS console.
- On the Active Clusters page, select a running cluster and click its name to switch to the cluster details page.
- Click Alarms and view the alarm information in the alarm list.
- The alarm list page displays the latest 10 alarms by default.
- You can filter all alarms of the same severity. The results include cleared and uncleared alarms.
- Click Export All. In the displayed Export dialog box, set Save As and click OK.
- Click Advanced Search. In the displayed alarm search area, set search criteria and click Search to view information about specified alarms. Click Reset to clear the search criteria.
The start time and end time are specified in Time Range. You can search for alarms generated within the time range.
Handle the alarm by referring to the Alarm Reference. If the alarms in some scenarios are generated due to other cloud services that MRS depends on, you need to contact maintenance personnel of the corresponding cloud services.
- Click Clear Alarm if you need to. In the displayed dialog box, click OK.
If multiple alarms have been handled, you can select one or more alarms to be cleared and click Clear Alarm to clear the alarms in batches. A maximum of 300 alarms can be cleared in each batch.
- Log in to FusionInsight Manager of the MRS cluster.
For details about how to log in to FusionInsight Manager, see Accessing MRS Manager.
- Choose O&M > Alarm > Alarms.
- View the alarm information reported by each cluster on FusionInsight Manager, including the alarm name, ID, severity, and generation time. By default, the latest 10 alarms are displayed on each page.
- You can click
on the left of an alarm to view detailed alarm parameters. Table 2 describes the parameters.
Table 2 Alarm parameters Parameter
Description
Alarm ID
Alarm ID.
Alarm Name
Alarm name.
Alarm Severity
Alarm severity. The options are Critical, Major, Minor, and Suggestion.
Generated
Time when the alarm is generated.
Cleared
Time when an alarm is cleared. If the alarm is not cleared, -- is displayed.
Source
Cluster name.
Object
Service, process, or module that triggers the alarm.
Auto Clear
Whether the alarm can be automatically cleared after the fault is rectified.
Alarm Status
Current status of the alarm. The options are Auto, Manual, and Uncleared.
Alarm Cause
Possible cause of an alarm.
Serial Number
Number of alarms generated by the system.
Additional Information
Error information.
MRS 3.3.0 or later: You can view the monitoring metric values in Additional Information if thresholds are set for the metrics to generate alarms.
Location
Detailed information for locating the alarm, which includes the following:
- Source: cluster for which the alarm is generated.
- ServiceName: service for which the alarm is generated.
- RoleName: role for which the alarm is generated.
- HostName: host for which the alarm is generated.
- Manage alarms.
- Click Export All to export all alarm details.
- After handling multiple alarms, you can select and clear one or more of them in batches by clicking Clear Alarm. Each batch can only clear a maximum of 300 alarms.
- You can filter alarms by object or severity.
- You can click Advanced Search to search for alarms by alarm ID, name, type, start time, or end time. Click Search to filter alarms that meet the search criteria. Click Advanced Search again to view the number of search criteria that you have configured.
- You can click Clear, Mask, or View Help to perform corresponding operations on an alarm.
- If there are a large number of alarms, you can click View by Category to sort uncleared alarms by alarm ID. After alarms are classified, click the number of uncleared alarms to view alarm details.
Helpful Links
- You can customize the alarm thresholds for MRS clusters. If the preset alarm conditions are met, the system reports an alarm. For details, see Configuring Alarm Thresholds for an MRS Cluster.
- If you do not want to see an alarm, you can manually mask it. For details, see Configuring Alarm Masking for an MRS Cluster.
- If you do not want to see alarms during cluster upgrade or maintenance, you can temporarily disable alarm reporting. For details, see Enabling the MRS Cluster Maintenance Mode to Disable Alarm Reporting.
- You can configure alarm notifications to receive notification messages through different subscription endpoints (such as SMS messages and emails). For details, see Configuring Notifications for MRS Cluster Alarms and Events.
- For details about how to cancel the message notification for alarms, see How Do I Cancel Message Notification for Cluster Alarms?
- For details about component alarms and handling procedure, see MRS Cluster Alarm Handling Reference.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot