Help Center/ MapReduce Service/ User Guide/ MRS Cluster O&M/ MRS Cluster Alarm Handling Reference/ ALM-50228 Memory Usage of a Doris Tenant Exceeds the Threshold
Updated on 2024-11-13 GMT+08:00

ALM-50228 Memory Usage of a Doris Tenant Exceeds the Threshold

Alarm Description

The system checks the memory usage of BE nodes every 30 seconds. This alarm is generated when the memory usage of a tenant exceeds the threshold.

This alarm is cleared when the system detects that the memory usage of tenant's BE nodes is lower than the threshold.

This alarm applies only to MRS 3.3.1 or later.

Alarm Attributes

Alarm ID

Alarm Severity

Auto Cleared

50228

Critical (default threshold: 90%)

Major (default threshold: 85%)

Yes

Alarm Parameters

Type

Parameter

Description

Location Information

Source

Specifies the cluster or system for which the alarm was generated.

ServiceName

Specifies the service for which the alarm was generated.

RoleName

Specifies the role for which the alarm was generated.

HostName

Specifies the host for which the alarm was generated.

Additional Information

Detail

Specifies the alarm triggering condition.

Impact on the System

Processes respond slowly or do not work.

Possible Causes

The data queried by the tenant is too large, and memory soft limit is not enabled.

Handling Procedure

Check the memory used by the BE nodes of the tenant.

  1. Log in to FusionInsight Manager and choose O&M > Alarm > Alarms. In the alarm list, view the role name and obtain the IP address of the instance in Location of the alarm whose ID is 50228.
  2. Click Thresholds and choose Name of the desired cluster > Doris > Tenant Resources > Memory Usage Exceeds Threshold to view and record the threshold.
  3. Choose Cluster > Services > Doris > Instances, select the BE instance for which the alarm is generated, and click Chart. Select Tenant Resource from the Chart Category pane, check whether the actual memory usage in the Memory Used by Tenants chart is greater than the threshold obtained in 2, and record the name of the tenant whose memory usage exceeds the threshold.

    • If yes, go to 3.
    • If no, go to 8.

  4. Check whether a large amount of table data were being queried during the alarm period.

    • If yes, go to 5.
    • If no, go to 8.

  5. Choose Tenant Resources > Tenant Resources Management. In the tenant list, click the tenant name in 2, and then the Resource tab. Click the edit button on the right of Resource Details, and check whether Memory Soft Limit is enabled.

    • If yes, go to 7.
    • If no, go to 6.

  6. Enable Soft Memory Limit and click OK. Check whether the alarm is cleared in the alarm list.

    • If yes, no further action is required.
    • If no, go to 7.

  7. Choose O&M > Alarm > Thresholds > Name of the desired cluster > Doris > Tenant Resources. Increase the threshold value and trigger counts based on service requirements. Check whether the alarm is cleared in the alarm list.

    • If yes, no further action is required.
    • If no, go to 8.

Collect fault information.

  1. On FusionInsight Manager, choose O&M. In the navigation pane on the left, choose Log > Download.
  2. Expand the Service drop-down list, and select Doris for the target cluster.
  3. In the Host area, select the host to which the role belongs and click OK.
  4. Click the edit icon in the upper right corner, and set Start Date and End Date for log collection to 1 hour ahead of and after the alarm generation time, respectively. Then, click Download.
  5. Contact O&M engineers and provide the collected logs.

Alarm Clearance

This alarm is automatically cleared after the fault is rectified.

Related Information

None.