Data Is Successfully Imported Using HBase BulkLoad, but Different Results May Be Returned To the Same Query
Question
When importing data using HBase bulkload, data with identical rowkey values can be successfully imported, but queries may yield different results.
Answer
Data with the same rowkey values is loaded to HBase in sequence. HBase takes the data with the latest timestamp as the latest data. In default queries, only the latest data is returned for data with the same rowkey value, unless a specific timestamp is specified.
When you use bulkload to load data, HFiles are generated quickly in memory. However, this can cause query results to be disordered if data has the same rowkey value and timestamp.
When creating tables and loading data, ensure that rowkey values are designed to prevent duplicate rowkey values within the same data file.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot