ALM-44004 Presto Coordinator Resource Group Queuing Tasks Exceed the Threshold (For MRS 2.x or Earlier)

Description

This alarm is generated when the system detects that the number of queuing tasks in a resource group exceeds the threshold. The system queries the number of queuing tasks in a resource group through the JMX interface. You can choose Components > Presto > Service Configuration (switch Basic to All) > Presto > resource-groups to configure a resource group. You can choose Components > Presto > Service Configuration (switch Basic to All) > Coordinator > Customize > resourceGroupAlarm to configure the threshold of each resource group.

Attribute

Alarm ID	Alarm Severity	Auto Clear
44004	Major	Yes

Parameter

Parameter	Description
ServiceName	Service for which the alarm is generated.
RoleName	Role for which the alarm is generated.
HostName	Host for which the alarm is generated.

Impact on the System

If the number of queuing tasks in a resource group exceeds the threshold, a large number of tasks may be in the queuing state. The Presto task time exceeds the expected value. When the number of queuing tasks in a resource group exceeds the maximum number (maxQueued) of queuing tasks in the resource group, new tasks cannot be executed.

Possible Causes

The resource group configuration is improper or too many tasks in the resource group are submitted.

Procedure

Choose Components > Presto > Service Configuration (switch Basic to All) > Presto > resource-groups to adjust the resource group configuration.
You can choose Components > Presto > Service Configuration (switch Basic to All) > Coordinator > Customize > resourceGroupAlarm to modify the threshold of each resource group.
Collect fault information.
1. Log in to the cluster node based on the host name in the fault information and query the number of queuing tasks on the Presto client based on Resource Group in the additional information.
2. Log in to the cluster node based on the host name in the fault information, view the /var/log/Bigdata/nodeagent/monitorlog/monitor.log file, and search for resource group information to view the monitoring collection information of the resource group.
3. Contact O&M engineers and send the collected logs.

Related Information

None

Parent Topic: MRS Cluster Alarm Handling Reference

Previous topic: ALM-43013 JDBCServer GC Time Exceeds the Threshold (For MRS 2.x or Earlier)

Next topic: ALM-44005 Presto Coordinator Process GC Time Exceeds the Threshold (For MRS 2.x or Earlier)

Feedback

Was this page helpful?

Helpful Not helpful

Provide feedback

Thank you very much for your feedback. We will continue working to improve the documentation.

The system is busy. Please try again later.