Updated on 2022-09-14 GMT+08:00

How to Use MRS

MapReducce Service is a Huawei Cloud service that is used to deploy and manage Hadoop clusters. MRS provides enterprise-class big data clusters on the cloud. Tenants can fully control these clusters and easily run big data components such as Hadoop, Spark, HBase, and Kafka in them.

MRS is easy to use. You can execute various tasks and process or store PB-level data using computers connected in a cluster.

The procedure of using MRS is as follows:

  1. On the MRS console, purchase clusters and specify these clusters for offline data analysis and stream processing, and specify the Elastic Cloud Server (ECS) instance specifications, quantity, data disk types (common I/O, high I/O, or ultra-high I/O), as well as components to be installed in the clusters.
  2. Develop a data processing program. For details about how to quickly develop such a program and execute it properly, see the sample code and tutorials provided in Method of Building an MRS Sample Project.
  3. Upload the prepared program and data files to Object Storage Service (OBS) or the HDFS in the cluster.
  4. After a cluster is created, you can directly add jobs and run your programs or SQL statements to process and analyze data.
  5. MRS provides you with MRS Manager, an enterprise-class unified management platform of big data clusters, helping you quickly know the health status of services and hosts. Through graphical metric monitoring and customization, you can obtain critical system information in a timely manner. In addition, you can modify service attribute configurations based on service performance requirements, and start or stop clusters, services, and role instances in one click.
  6. Terminate the cluster if it is no longer needed after job execution. The terminated cluster is no longer billed.