Updated on 2022-12-14 GMT+08:00

Failed to Reduce Task Nodes

Issue

A user fails to scale in an MRS 2.x cluster by reducing the number of task nodes to 0 on the MRS console.

Symptom

When the number of task nodes in an MRS cluster is reduced on the MRS console, the following information is displayed:

This operation is not allowed because the number of instances of NodeManager will be less than the minimum configuration after scale-in, which may cause data loss.

Cause Analysis

The NodeManager service of the core node is stopped. If the number of task nodes is changed to 0, there will be no NodeManager in the cluster and the Yarn service will be unavailable. Therefore, MRS allows the reduction of task nodes only when the number of NodeManagers is greater than or equal to 1.

Procedure

  1. Select the NodeManager instance of the core node, click More, and select Start Instance.

  2. Reduce the number of task nodes on the cluster details page.

    1. Click the cluster name, and select the Nodes tab.
    2. Locate the row that contains the task node group and click Scale In in the Operation column.
    3. Click OK. In the displayed dialog box, click Yes.

  3. After the scale-in is successful, stop NodeManager of the core node if you do not need it.

Summary and Suggestions

You are advised not to stop NodeManager of the core node.