Help Center/ MapReduce Service/ User Guide/ MRS Cluster O&M/ MRS Cluster Alarm Handling Reference/ ALM-12051 Disk Inode Usage Exceeds the Threshold (For MRS 2.x or Earlier)
Updated on 2024-09-23 GMT+08:00

ALM-12051 Disk Inode Usage Exceeds the Threshold (For MRS 2.x or Earlier)

Description

The system checks the disk inode usage every 30 seconds. This alarm is generated when the disk inode usage exceeds the threshold (the default threshold is 80%) for multiple times (the default value is 5).

You can change the threshold by choosing System > Threshold Configuration > Device > Host > Disk > Disk Inode Usage > Disk Inode Usage.

If the hit number is 1, this alarm is cleared when the disk inode usage is less than or equal to the threshold. If the hit number is greater than 1, this alarm is cleared when the disk inode usage is less than or equal to 90% of the threshold.

Attribute

Alarm ID

Alarm Severity

Auto Clear

12051

Major

Yes

Parameters

Parameter

Description

ServiceName

Specifies the service for which the alarm is generated.

RoleName

Specifies the role for which the alarm is generated.

HostName

Specifies the host for which the alarm is generated.

PartitionName

Specifies the disk partition for which the alarm is generated.

Trigger Condition

Specifies the threshold for triggering the alarm.

Impact on the System

Data cannot be written to the file system.

Possible Causes

  • There are too many small files on the disk.
  • The system is abnormal.

Procedure

There are too many small files on the disk.

  1. Go to the MRS cluster details page and choose Alarms.
  2. In the real-time alarm list, click the alarm. In the Alarm Details area, obtain the IP address and disk partitions of the host for which the alarm is generated.
  3. Use PuTTY to log in to the host for which the alarm is generated as user root.
  4. Run the df -i partition name command to check the current inode usage of the disk.
  5. If the inode usage exceeds the threshold, manually check whether the small files in the partition can be deleted.

    • If yes, delete the files and go to 6.
    • If no, adjust the capacity. Then go to 7.

  6. Wait 5 minutes and check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 7.

Check whether the system environment is normal.

  1. Contact the operating system maintenance personnel to check whether the system environment is abnormal.

    • If yes, rectify the operating system fault and go to 8.
    • If no, go to 9.

  2. Wait 5 minutes and check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 9.

  3. Collect fault information.

    1. On MRS Manager, choose System > Export Log.
    2. Contact the O&M engineers and send the collected logs.

Reference

None