Help Center/ MapReduce Service/ Component Operation Guide (Normal)/ Using HBase/ HBase Troubleshooting/ Data Is Successfully Imported Using HBase BulkLoad, but Different Results May Be Returned To the Same Query
Updated on 2024-10-08 GMT+08:00

Data Is Successfully Imported Using HBase BulkLoad, but Different Results May Be Returned To the Same Query

Question

When importing data using HBase bulkload, data with identical rowkey values can be successfully imported, but queries may yield different results.

Answer

Data with the same rowkey values is loaded to HBase in sequence. HBase takes the data with the latest timestamp as the latest data. In default queries, only the latest data is returned for data with the same rowkey value, unless a specific timestamp is specified.

When you use bulkload to load data, HFiles are generated quickly in memory. However, this can cause query results to be disordered if data has the same rowkey value and timestamp.

When creating tables and loading data, ensure that rowkey values are designed to prevent duplicate rowkey values within the same data file.