Help Center/ MapReduce Service/ User Guide/ Alarm Reference (Applicable to MRS 3.x)/ ALM-19011 RegionServer Region Number Exceeds the Threshold
Updated on 2024-04-11 GMT+08:00

ALM-19011 RegionServer Region Number Exceeds the Threshold

Description

The system checks the number of regions on each RegionServer in each HBase service instance every 30 seconds. The region number is displayed on the HBase service monitoring page and RegionServer role monitoring page. This alarm is generated when the number of regions on a RegionServer exceeds the threshold (default value: 2000) for 20 consecutive times. The threshold can be changed by choosing O&M > Alarm > Thresholds > Name of the desired cluster > HBase. This alarm is cleared when the number of regions is less than or equal to the threshold.

Attribute

Alarm ID

Alarm Severity

Auto Clear

19011

Major

Yes

Parameters

Name

Meaning

Source

Specifies the cluster for which the alarm is generated.

ServiceName

Specifies the service for which the alarm is generated.

RoleName

Specifies the role for which the alarm is generated.

HostName

Specifies the host for which the alarm is generated.

Impact on the System

The data read/write performance of HBase is affected when the number of regions on a RegionServer exceeds the threshold.

Possible Causes

  • The RegionServer region distribution is unbalanced.
  • The HBase cluster scale is too small.

Procedure

View alarm location information.

  1. On the FusionInsight Manager home page, choose O&M > Alarm > Alarms, select this alarm, and view the service instance and host name in Location.
  2. On the FusionInsight Manager home page, choose Cluster > Name of the desired cluster > Services, click the HBase service instance for which the alarm is generated, and click HMaster(Active). On the displayed WebUI of the HBase instance, check whether the region distribution on the RegionServer is balanced.

    By default, the admin user does not have the permissions to manage other components. If the page cannot be opened or the displayed content is incomplete when you access the native UI of a component due to insufficient permissions, you can manually create a user with the permissions to manage that component.

    • If yes, go to 9.
    • If no, go to 3.
    Figure 1 WebUI of HBase instance

Enable load balancing.

  1. Log in to the node where the HBase client is located as user root. Go to the client installation directory, and set environment variables.

    cd client installation directory

    source bigdata_env

    If the cluster adopts the security mode, perform security authentication. Specifically, run the kinit hbase command and enter the password as prompted (obtain the password from the administrator).

  2. Run the following commands to go to the HBase shell command window and check whether the load balancing function is enabled.

    hbase shell

    balancer_enabled

    • If yes, go to 6.
    • If no, go to 5.

  3. On the HBase shell command window, run the following commands to enable the load balancing function and check whether the function is enabled.

    balance_switch true

    balancer_enabled

  4. On the HBase shell command window, run the balancer command to manually trigger the load balancing function.

    You are advised to enable and manually trigger the load balancing function during off-peak hours.

  5. On the FusionInsight Manager home page, choose Cluster > Name of the desired cluster > Services > HBase, and click HMaster(Active). On the displayed WebUI of the HBase instance, refresh the page and check whether the region distribution is balanced.

    • If yes, go to 8.
    • If no, go to 21.

  6. Check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 9.

Delete unwanted HBase tables.

Exercise caution when deleting data to ensure data is deleted correctly.

  1. On the FusionInsight Manager home page, choose Cluster > Name of the desired cluster > Services > HBase, and click HMaster(Active). On the displayed WebUI of the HBase instance, view tables stored in the HBase service instance and record unwanted tables that can be deleted.
  2. On the HBase shell command window, run the disable command and drop command to delete the table to decrease the number of regions.

    disable 'name of the table to be deleted'

    drop 'name of the table to be deleted'

  3. On the HBase shell command window, run the following command to check whether the load balancing function is enabled.

    balancer_enabled
    • If yes, go to 13.
    • If no, go to 12.

  4. On the HBase shell command window, run the following commands to enable the load balancing function and confirm that the function is enabled.

    balance_switch true

    balancer_enabled

  5. On the HBase shell command window, run the balancer command to manually trigger the load balancing function.
  6. On the FusionInsight Manager home page, choose Cluster > Name of the desired cluster > Services > HBase, and click HMaster(Active). On the displayed WebUI of the HBase instance, refresh the page and check whether the region distribution is balanced.

    • If yes, go to 15.
    • If no, go to 21.

  7. Check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 16.

Adjust the threshold.

  1. On the FusionInsight Manager home page, choose O&M > Alarm > Thresholds > Name of the desired cluster > HBase > Regions(RegionServer), select the applied rule, and click Modify to check whether the threshold is proper.

    • If it is excessively small, increase the threshold as required and go to 17.
    • If it is proper, go to 18.
    Figure 2 Regions(RegionServer_1)

  2. Check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 18.

    Perform system capacity expansion.

  3. Add nodes to the HBase cluster and add RegionServer instances to the nodes. Then enable and manually trigger the load balancing function.
  4. On the FusionInsight Manager home page, choose Cluster > Name of the desired cluster > Services, click the HBase service instance for which the alarm is generated, and click HMaster(Active). On the displayed WebUI of the HBase instance, refresh the page and check whether the region distribution is balanced.

    • If yes, go to 20.
    • If no, go to 21.

  5. Check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 21.

    Collect fault information.

  6. On the FusionInsight Manager home page of the active and standby clusters, choose O&M> Log > Download.
  7. Select HBase in the required cluster from the Service.
  8. Click in the upper right corner, and set Start Date and End Date for log collection to 10 minutes ahead of and after the alarm generation time, respectively. Then, click Download.
  9. Contact the O&M personnel and send the collected logs.

Alarm Clearing

After the fault is rectified, the system automatically clears this alarm.

Related Information

None