DWS_2000000020 Long SQL Probe Execution Duration in a Cluster

Alarm Description

DWS collects the execution status of the SQL probe on each node in the cluster every 30 seconds. If the execution duration of an SQL probe on a server in a cluster exceeds twice the threshold (or another user-defined value), a critical alarm is generated. If the execution duration of all SQL probes falls below the threshold, the critical alarm is cleared.

If the SQL probe duration remains higher than the alarm reporting threshold, the alarm is generated again in 24 hours(or another user-defined value).

Attributes

Alarm ID	Alarm Category	Alarm Severity	Alarm Type	Service Type	Auto Cleared
DWS_2000000020	Tenant plane	Important	Operation alarm	DWS	Yes

Alarm Parameters

Category	Name	Description
Location information	Name	Long SQL Probe Execution Duration in a Cluster
	Type	Operation alarm
	Generation time	Time when the alarm is generated
Other information	Cluster ID	Cluster details such as resourceId and domain_id

Impact on the System

The cluster performance deteriorates or the cluster is faulty.

Possible Causes

The service load of the cluster is high or the cluster is faulty. As a result, the execution of the SQL probe becomes slow.

Handling Procedure

In the navigation pane of the monitoring panel, choose Utilities > SQL Probes. Check SQL probe execution.
In the navigation pane, choose Monitoring > Performance Monitoring. Check the monitoring metrics such as the CPU usage, disk usage, and memory usage to determine whether the workloads are high or any metric is abnormal.
In the navigation pane, choose Monitoring > Real-Time Queries. Check whether there are queries or sessions that have been running for a long time and affect cluster running. You can terminate abnormal sessions or queries.