What Are the Application Scenarios of the Bulkload and put Data-loading Modes?

Question

Both the bulkload and put data-loading modes can be used to load data to HBase. Though the bulkload mode loads data faster than the put mode, the bulkload mode has its own disadvantages. The following describes the application scenarios of these two data-loading modes.

Answer

The bulkload starts MapReduce tasks to generate HFile files, and then registers HFile files with HBase. Incorrect use of the bulkload mode will consume more cluster memory and CPU resources due to started MapReduce tasks. The large number of HFile files may frequently trigger Compaction, decreasing the query speed drastically.

Incorrect use of the put mode may cause a slow data loading rate. If the memory allocated to RegionServer is not sufficient, the process may exit.

The application scenarios of the bulkload and put modes are as follows:

bulkload:
- Load a large amount of data to HBase in the one-off manner.
- Load data to HBase with low reliability requirements and without generating WAL files.
- Low loading and query speed if the put mode is used.
- The size of the HFile generated after data loading is similar to the size of HDFS block.
put:
- The size of the data loaded to one Region at a time is smaller than half the size of an HDFS block.
- Load data to HBase in real time.
- The query speed does not decrease wildly during data loading.

Parent topic: FAQs

Previous topic: How to Rectify the Fault When an Exception Occurs During the Running of an HBase-developed Application and "org.apache.hadoop.hbase.ipc.controller.ServerRpcControllerFactory" Is Displayed in the Error Information?

Next topic: An Error Occurred When Building a JAR Package

Feedback

Was this page helpful?

Helpful Not helpful

Provide feedback

Thank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.

The system is busy. Please try again later.

For any further questions, feel free to contact us through the chatbot.

Chatbot