ALM-50228 Memory Usage of a Doris Tenant Exceeds the Threshold
Alarm Description
The system checks the memory usage of BE nodes every 30 seconds. This alarm is generated when the memory usage of a tenant exceeds the threshold.
This alarm is cleared when the system detects that the memory usage of tenant's BE nodes is lower than the threshold.
This alarm applies only to MRS 3.3.1 or later.
Alarm Attributes
Alarm ID |
Alarm Severity |
Auto Cleared |
---|---|---|
50228 |
Critical (default threshold: 90%) Major (default threshold: 85%) |
Yes |
Alarm Parameters
Type |
Parameter |
Description |
---|---|---|
Location Information |
Source |
Specifies the cluster or system for which the alarm was generated. |
ServiceName |
Specifies the service for which the alarm was generated. |
|
RoleName |
Specifies the role for which the alarm was generated. |
|
HostName |
Specifies the host for which the alarm was generated. |
|
Additional Information |
Detail |
Specifies the alarm triggering condition. |
Impact on the System
Processes respond slowly or do not work.
Possible Causes
The data queried by the tenant is too large, and memory soft limit is not enabled.
Handling Procedure
Check the memory used by the BE nodes of the tenant.
- Log in to FusionInsight Manager and choose O&M > Alarm > Alarms. In the alarm list, view the role name and obtain the IP address of the instance in Location of the alarm whose ID is 50228.
- Click Thresholds and choose Name of the desired cluster > Doris > Tenant Resources > Memory Usage Exceeds Threshold to view and record the threshold.
- Choose Cluster > Services > Doris > Instances, select the BE instance for which the alarm is generated, and click Chart. Select Tenant Resource from the Chart Category pane, check whether the actual memory usage in the Memory Used by Tenants chart is greater than the threshold obtained in 2, and record the name of the tenant whose memory usage exceeds the threshold.
- Check whether a large amount of table data were being queried during the alarm period.
- Choose Tenant Resources > Tenant Resources Management. In the tenant list, click the tenant name in 2, and then the Resource tab. Click the edit button on the right of Resource Details, and check whether Memory Soft Limit is enabled.
- Enable Soft Memory Limit and click OK. Check whether the alarm is cleared in the alarm list.
- If yes, no further action is required.
- If no, go to 7.
- Choose O&M > Alarm > Thresholds > Name of the desired cluster > Doris > Tenant Resources. Increase the threshold value and trigger counts based on service requirements. Check whether the alarm is cleared in the alarm list.
- If yes, no further action is required.
- If no, go to 8.
Collect fault information.
- On FusionInsight Manager, choose O&M. In the navigation pane on the left, choose Log > Download.
- Expand the Service drop-down list, and select Doris for the target cluster.
- In the Host area, select the host to which the role belongs and click OK.
- Click the edit icon in the upper right corner, and set Start Date and End Date for log collection to 1 hour ahead of and after the alarm generation time, respectively. Then, click Download.
- Contact O&M engineers and provide the collected logs.
Alarm Clearance
This alarm is automatically cleared after the fault is rectified.
Related Information
None.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot