Help Center/ MapReduce Service/ User Guide (Ankara Region)/ Alarm Reference/ ALM-43618 GraphServer Heap Memory Usage Exceeds the Threshold
Updated on 2024-11-29 GMT+08:00

ALM-43618 GraphServer Heap Memory Usage Exceeds the Threshold

Alarm Description

The system checks the heap memory usage of the GraphServer service every 30 seconds. This alarm is generated when the system detects that the heap memory usage of a GraphServer instance exceeds the threshold (90% of the maximum memory) for five consecutive periods (smoothing is performed for 5 times).

The alarm is cleared when the heap memory usage is less than the threshold.

Alarm Attributes

Alarm ID

Alarm Severity

Alarm Type

Service Type

Auto Cleared

43618

Major

Quality of service

GraphBase

Yes

Alarm Parameters

Type

Parameter

Description

Location Information

Source

Specifies the cluster for which the alarm is generated.

ServiceName

Specifies the service for which the alarm is generated.

RoleName

Specifies the role for which the alarm is generated.

HostName

Specifies the host for which the alarm is generated.

Impact on the System

If the heap memory usage of the GraphServer is too high, the performance of submitting and running GraphServer jobs may be affected, or the GraphServer service may break down due to memory overflow.

Possible Causes

The heap memory of the GraphServer instance is overused or the heap memory is inappropriately allocated. As a result, the usage exceeds the threshold.

Handling Procedure

Check the heap memory usage.

  1. On FusionInsight Manager, choose O&M > Alarm > Alarm > ALM-43618 GraphServer Heap Memory Usage Exceeds the Threshold > Location Information. View the IP address of the instance for which the alarm is generated.
  2. On the FusionInsight Manager homepage, choose Cluster > Name of the desired cluster > Service > GraphBase > Instance > GraphServer (corresponding to the IP address of the instance for which this alarm is generated) > GraphServer Heap Memory Usage Statics. check the heap memory usage.
  3. Check whether the heap memory used by GraphServer reaches 90% of the maximum heap memory (default threshold) configured for GraphServer.

    • If yes, go to 4.
    • If no, go to 6.

  4. On the FusionInsight Manager homepage, choose Cluster > Name of the desired cluster > Service > GraphBase > Configuration > All Configurations > GraphServer > System. Change the value of GC_OPTS as described in the following note.

    Change the value of GC_OPTS as follows:

    The default values of -Xmx and -Xms are -Xmx30720M -Xms30720M. When you increase the parameter values, you are advised to set -Xmx and -Xms to the same value. For example, increase them by 2048M using -Xmx32768M -Xms32768M. It is recommended that the value be less than or equal to 50% of the host memory of the node and the maximum value be less than or equal to 32 GB.

  5. Check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 6.

Collect the fault information.

  1. On FusionInsight Manager, choose O&M. In the navigation pane on the left, choose Log > Download.
  2. Expand the Service drop-down list, and select GraphBase for the target cluster.
  3. Click in the upper right corner, and set Start Date and End Date for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click Download.
  4. Contact O&M engineers and provide the collected logs.

Alarm Clearance

After the fault is rectified, the system automatically clears this alarm.

Related Information

None.