Updated at: 2022-05-19 GMT+08:00

ALM-12027 Host PID Usage Exceeds the Threshold

Description

The system checks the PID usage every 30 seconds and compares the actual PID usage with the default threshold. This alarm is generated when the PID usage exceeds the threshold.

This alarm is cleared when the host PID usage is less than or equal to the threshold.

Attribute

Alarm ID

Alarm Severity

Auto Clear

12027

Major

Yes

Parameters

Parameter

Description

ServiceName

Specifies the service for which the alarm is generated.

RoleName

Specifies the role for which the alarm is generated.

HostName

Specifies the host for which the alarm is generated.

Trigger Condition

Generates an alarm when the actual indicator value exceeds the specified threshold.

Impact on the System

No PID is available for new processes and service processes are unavailable.

Possible Causes

Too many processes are running on the node. You need to increase the value of pid_max. The system is abnormal.

Procedure

  1. Increase the value of pid_max.

    1. On the MRS cluster details page, click the alarm from the real-time alarm list. In the Alarm Details area, obtain the IP address of the host for which the alarm is generated.
    2. Log in to the node for which the alarm is generated.
    3. Run the cat /proc/sys/kernel/pid_max command to check the value of pid_max.
    4. If the PID usage exceeds the threshold, run the following command to double the value of pid_max:

      echo New pid_max value > /proc/sys/kernel/pid_max

      Example:

      echo 65536 > /proc/sys/kernel/pid_max

    5. Wait 5 minutes and check whether the alarm is cleared.
      • If yes, no further action is required.
      • If no, go to 2.

  2. Check whether the system environment is abnormal.

    1. Contact the O&M personnel to check whether the operating system is abnormal.
      • If yes, rectify the operating system fault and go to 2.b.
      • If no, go to 3.
    2. Wait 5 minutes and check whether the alarm is cleared.
      • If yes, no further action is required.
      • If no, go to 3.

  3. Collect fault information.

    1. On MRS Manager, choose System > Export Log.
    2. Contact the O&M personnel and send the collected logs.

Reference

None

close