RegionTooBusyException Is Reported When Data Is Imported to a CloudTable HBase Table Through a Datasource Table
Symptom
A datasource table was used to import data to a CloudTable HBase table. This HBase table contains a column family and a rowkey for 100 million simulating data records. The data volume is 9.76 GB. The job failed after 10 million data records were imported.
Cause Analysis
- View driver error logs.
- View executor error logs.
- View task error logs.
The rowkey was poorly designed causing a large amount of traffic redirected to single or very few numbers of nodes.
Procedure
- Pre-partition the HBase.
- Hash the rowkey.
Summary and Suggestions
Distribute data to different RegionServer. Add distribute by rand() to the end of the insert statement.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.