Updated on 2024-03-01 GMT+08:00

ALM-45325 Presto Service Unavailable

This section applies only to MRS 3.1.5 or later.

Alarm Description

The system checks the Presto service status every 60 seconds. This alarm is generated when the Presto service is unavailable. This alarm is cleared when the Presto service recovers.

Alarm Attributes

Alarm ID

Alarm Severity

Auto Cleared

45325

Critical

Yes

Alarm Parameters

Parameter

Description

ServiceName

Specifies the service for which the alarm was generated.

RoleName

Specifies the role for which the alarm was generated.

HostName

Specifies the host for which the alarm was generated.

Impact on the System

Presto cannot execute SQL statements.

Possible Causes

  • The Presto coordinator or worker process is faulty.
  • The network communication between Presto coordinator and worker instances is interrupted.

Handling Procedure

Check the status of the coordinator and worker processes.

  1. Log in to FusionInsight Manager, choose Cluster, click the name of the desired cluster, and choose Services > Presto. On the page that is displayed, click the Instance tab. In the Presto instance list, check whether the running status of all coordinator or worker instances is Unknown.

    • If yes, go to 2.
    • If no, go to 4.

  1. Above the Presto instance list, click More and select Restart Service to restart the coordinator and worker processes.
  2. In the alarm list, check whether ALM-45325 Presto Service Unavailable is cleared.

    • If yes, no further action is required.
    • If no, go to 4.

Collect fault information.

  1. On FusionInsight Manager, choose O&M. In the navigation pane on the left, choose Log > Download.
  1. Expand the Service drop-down list, and select Presto for the target cluster.
  2. Click in the upper right corner, and set Start Date and End Date for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click Download.
  3. Contact the O&M engineers and send the collected logs.

Alarm Clearance

This alarm is automatically cleared after the fault is rectified.