Updated on 2024-11-29 GMT+08:00

Relationship with Other Components

Elasticsearch Indexing HBase Data

When Elasticsearch indexes the HBase data, the HBase data is written to the HDFS and meanwhile Elasticsearch creates the corresponding HBase index data. The index ID is mapped to the rowkey of the HBase data, which ensures the unique mapping between each index data record and HBase data and implements full-text searching of the HBase data.

Batch indexing: For data already existing in HBase, an MR task is submitted to read all data in HBase, and then indexes are created in Elasticsearch. Figure 1 shows the indexing process.

Figure 1 Elasticsearch indexing HBase data