Updated on 2022-06-09 GMT+08:00

Quick Creation of a Hadoop Analysis Cluster

This section describes how to quickly create a Hadoop analysis cluster for analyzing and querying vast amounts of data. In the open-source Hadoop ecosystem, Hadoop uses Yarn to manage cluster resources, Hive and Spark to provide offline storage and computing of large-scale distributed data, Spark Streaming and Flink to offer streaming data computing, and Presto to enable interactive queries, Tez to provide a distributed computing framework of directed acyclic graphs (DAGs).

Quick Creation of a Hadoop Analysis Cluster

  1. Log in to the MRS management console.
  2. Click Create Cluster. The page for creating a cluster is displayed.
  3. Click the Quick Config tab.
  4. Configure basic cluster information. For details about the parameters, see Creating a Custom Cluster.

    • Region: Use the default value.
    • Cluster Name: You can use the default name. However, you are advised to include a project name abbreviation or date for consolidated memory and easy distinguishing, for example, mrs_20180321.
    • Cluster Version: Select the latest version, which is the default value. (The components provided by a cluster vary according to the cluster version. Select a cluster version based on site requirements.)
    • Component: Select Hadoop analysis cluster.
    • AZ: Use the default value.
    • VPC: Use the default value. If there is no available VPC, click View VPC to access the VPC console and create a new VPC.
    • Subnet: Use the default value.
    • Enterprise Project: Use the default value.
    • Cluster Node: Select the number of cluster nodes and node specifications based on site requirements.
    • Cluster HA: Use the default value. This parameter is not available in MRS 3.x.
    • Kerberos Authentication: Specifies whether to enable Kerberos authentication.
    • Username: The default value is root/admin. User root is used to remotely log in to ECSs, and user admin is used to access the cluster management page.
    • Password: Set a password for user root/admin.
    • Confirm Password: Enter the password of user root/admin again.

  5. Select Enable to enable secure communications. For details, see Communication Security Authorization.
  6. Click Apply Now.

    If Kerberos authentication is enabled for a cluster, check whether Kerberos authentication is required. If yes, click Continue. If no, click Back to disable Kerberos authentication and then create a cluster.

  7. Click Back to Cluster List to view the cluster status. Click Access Cluster to view cluster details.

    For details about cluster status during creation, see the description of the status parameters in Table 1.

    It takes some time to create a cluster. The initial status of the cluster is Starting. After the cluster has been created successfully, the cluster status becomes Running.

    On the MRS management console, a maximum of 10 clusters can be concurrently created, and a maximum of 100 clusters can be managed.