Help Center > > User Guide> Configuring a Cluster> Quick Purchase of a Hadoop Analysis Cluster

Quick Purchase of a Hadoop Analysis Cluster

Updated at: Sep 02, 2021 GMT+08:00

This section describes how to quickly purchase a Hadoop analysis cluster for analyzing and querying vast amounts of data. In the open-source Hadoop ecosystem, Hadoop uses Yarn to manage cluster resources, Hive and Spark to provide offline storage and computing of large-scale distributed data, Spark Streaming and Flink to offer streaming data computing, and Presto to enable interactive queries, Tez to provide a distributed computing framework of directed acyclic graphs (DAGs).

The Hadoop analysis cluster consists of the following components:

  • MRS 1.9.2: Hadoop 2.8.3, Spark 2.2.2, Hive 2.3.3, Presto 0.216, Tez 0.9.1, Ranger 1.0.1, and Flink 1.7.0.
  • MRS 3.1.0: Hadoop 3.1.1, Hive 3.1.0, Spark2x 2.4.5, Flink 1.12.0, ZooKeeper 3.5.6, Ranger 2.0.0, Tez 0.9.2, and Presto 333.

Quick Purchase of a Hadoop Analysis Cluster

  1. Log in to the MRS management console.
  2. Click Buy Cluster. The page for buying a cluster is displayed.
  3. Click the Quick Config tab.
  4. Configure basic cluster information. For details about the parameters, see Custom Purchase of a Cluster.

    • Region: Use the default value.
    • Billing Mode: Select Pay-per-use.
    • Cluster Name: You can use the default name. However, you are advised to include a project name abbreviation or date for consolidated memory and easy distinguishing, for example, mrs_20180321.
    • Cluster Version: Select the latest version, which is the default value. (The components provided by a cluster vary according to the cluster version. Select a cluster version based on site requirements.)
    • Component: Select Hadoop analysis cluster.
    • AZ: Use the default value.
    • VPC: Use the default value. If there is no available VPC, click View VPC to access the VPC console and create a new VPC.
    • Subnet: Use the default value.
    • Enterprise Project: Use the default value.
    • CPU Architecture: Use the default value. This parameter is unavailable in MRS 3.x.
    • Cluster Node: Select the number of cluster nodes and node specifications based on site requirements.
    • Cluster HA: Use the default value. This parameter is not available in MRS 3.x.
    • Kerberos Authentication: Specifies whether to enable Kerberos authentication.
    • Username: The default value is root/admin. User root is used to remotely log in to ECSs, and user admin is used to access the cluster management page.
    • Password: Set a password for user root/admin.
    • Confirm Password: Enter the password of user root/admin again.
    Figure 1 Hadoop Analysis Cluster
    Figure 2 Cluster node configurations

  5. Select Enable to enable secure communications. For details, see Communication Security Authorization.
  6. Click Buy Now.

    If Kerberos authentication is enabled for a cluster, check whether Kerberos authentication is required. If yes, click Continue. If no, click Back to disable Kerberos authentication and then create a cluster.

    For any doubt about the pricing, click Pricing details in the lower left corner.

  7. Click Back to Cluster List to view the cluster status. Click Access Cluster to view cluster details.

    For details about cluster status during creation, see the description of the status parameters in Table 1.

    It takes some time to create a cluster. The initial status of the cluster is Starting. After the cluster has been created successfully, the cluster status becomes Running.

    On the MRS management console, a maximum of 10 clusters can be concurrently created, and a maximum of 100 clusters can be managed.

Did you find this page helpful?

Submit successfully!

Thank you for your feedback. Your feedback helps make our documentation better.

Failed to submit the feedback. Please try again later.

Which of the following issues have you encountered?

Please complete at least one feedback item.

Content most length 200 character

Content is empty.

OK Cancel