Creating Indexes in Batches

If a large amount of data exists in a data table, you can create indexes for the data in batches based on MapReduce tasks.

Only indexes in INACTIVE state can be created in batches. To re-create index data, change the index status first.
If a data table contains a large amount of data, the creation takes a long time. You are advised to run the nohup command in the background to prevent the operation from being interrupted unexpectedly.

Run the following command on the HBase client to create indexes in batches:

hbase org.apache.hadoop.hbase.hindex.global.mapreduce.GlobalTableIndexer -Dtablename.to.index='table' -Dindexnames.to.build='idx1'

The parameters are described as follows:

tablename.to.index: indicates the name of the data table whose index status needs to be changed.
indexnames.to.build: indicates the names of the indexes you want to create in batches. You can specify multiple names and separate them with number signs (#).
(Optional)hbase.gsi.cleandata.enabled: indicates whether to clear the index table before creating indexes. The default value is false.
(Optional) hbase.gsi.cleandata.timeout: indicates timeout interval for clearing the index table before creating indexes. The default value is 1800, in seconds.

Parent topic: Using the GSI Tool

Thank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.

The system is busy. Please try again later.

Which of the following issues have you encountered?

Content is inconsistent with the product UI

Unclear descriptions

Lack of examples or code

Incorrect steps

Can't find what I need

Lack of best practices

Feedback (optional)

0/500

Select at least one type of issue, and enter your comments or suggestions.

Enter a maximum of 500 characters.

Submit Cancel

For any further questions, feel free to contact us through the chatbot.