Quickly Creating a Hadoop Analysis Cluster
This section describes how to quickly create a Hadoop analysis cluster for analysis and query of vast amounts of data. In the open source Hadoop ecosystem, Hadoop uses YARN to manage cluster resources, Hive and Spark to provide offline storage and computing of large-scale distributed data, Spark Streaming and Flink to offer streaming data computing, and Presto to enable interactive queries, Tez to provide a distributed computing framework of directed acyclic graphs (DAGs).
Quickly Creating a Hadoop Analysis Cluster
- On the displayed page, click the Quick Config tab.
- Configure basic cluster information. For details about the parameters, see Creating a Custom Cluster.
- Region: Use the default value.
- Cluster Name: You can use the default name. However, you are advised to include a project name abbreviation or date for consolidated memory and easy distinguishing, for example, mrs_20180321.
- Cluster Type: Use the default value.
- Version Type: Normal is selected by default. (Components vary depending on the version type. Select a version type as needed.)
- Cluster Version: Select the latest version, which is the default value. (The components provided by a cluster vary according to the cluster version. Select a cluster version based on site requirements.)
- Component: Select Hadoop analysis cluster.
- AZ: Use the default value.
- Enterprise Project: Retain the default value.
- VPC: Use the default value. If there is no available VPC, click View VPC to access the VPC console and create a new VPC.
- Subnet: Use the default value.
- CPU Architecture: Use the default value.
- Cluster Node: Select the number of cluster nodes and node specifications based on site requirements.
- Username: The default value is root/admin. User root is used to remotely log in to ECSs, and user admin is used to access the cluster management page.
- Password: Set a password for user root/admin.
- Confirm Password: Enter the password of user root/admin again.
- Select the checkbox to enable secure communications. For details, see Communication Security Authorization.
- Click Create Now.
If Kerberos authentication is enabled, check whether this function is required. If it is, click Continue. If not, click Back to disable it and then proceed with the subsequent step. This option cannot be changed after you create a cluster.
- Click Back to Cluster List to view the cluster status. Click Access Cluster to view cluster details.
For details about cluster status during creation, see the description of the status parameters in Table 1.
It takes some time to create a cluster. The initial status of the cluster is Starting. After the cluster has been created successfully, the cluster status becomes Running.
On the MRS management console, a maximum of 10 clusters can be concurrently created, and a maximum of 100 clusters can be managed.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot