ALM-43007 Non-Heap Memory Usage of the JobHistory Process Exceeds the Threshold (For MRS 2.x or Earlier)
Description
The system checks the JobHistory process status every 30 seconds. The alarm is generated when the non-heap memory usage of the JobHistory process exceeds the threshold (90% of the maximum memory).
Attribute
Alarm ID |
Alarm Severity |
Automatically Cleared |
---|---|---|
43007 |
Major |
Yes |
Parameters
Parameter |
Description |
---|---|
ServiceName |
Specifies the service for which the alarm is generated. |
RoleName |
Specifies the role for which the alarm is generated. |
HostName |
Specifies the host for which the alarm is generated. |
Impact on the System
If the available JobHistory process non-heap memory is insufficient, a memory overflow occurs and the service breaks down.
Possible Causes
The non-heap memory of the JobHistory process is overused or the non-heap memory is inappropriately allocated.
Procedure
- Check non-heap memory usage.
- Go to the cluster details page and choose Alarms.
- Select the alarm whose Alarm ID is 43007 and view the IP address and role name of the instance in Location.
- Choose Components > Spark > Instance > JobHistory (IP address of the instance for which the alarm is generated) > Customize > Non-Heap Memory Statistics of the JobHistory Process. Click OK to view the non-heap memory usage.
- Check whether the non-heap memory usage of JobHistory has reached the threshold (90% of the maximum memory).
- Choose Components > Spark > Service Configuration. Set Type to All and choose JobHistory > Default. Increase the value of -XX:MaxMetaspaceSize in SPARK_DAEMON_JAVA_OPTS as required.
- Check whether the alarm is cleared.
- If yes, no further action is required.
- If no, go to 2.
- Collect fault information.
- On MRS Manager, choose .
- Contact the O&M engineers and send the collected logs.
Reference
None
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.