Instance
Scenario
The sample project illustrates how to compile MapReduce jobs to visit multiple service components in HDFS, HBase, and Hive, helping users to understand key actions such as certificating and configuration loading.
The logic of the sample project is as follows:
The input data is HDFS text file and the input file is log1.txt.
YuanJing,male,10 GuoYijun,male,5
Map:
- Obtain one row of the input data and extract the user name.
- Query one piece of data from HBase.
- Query one piece of data from Hive.
- Combine the data queried from HBase and that from Hive as the output of Map as the output of Map.
Reduce:
- Obtain the last piece of data from Map output.
- Import the data to HBase.
- Save the data to HDFS.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot