Before You Start
HetuEngine supports quick joint query of multiple data sources and GUI-based data source configuration and management. You can quickly add a data source on the HSConsole page.
Table 1 lists the data sources supported by HetuEngine of the current version.
HetuEngine Mode |
Data Source |
Data Source Mode |
Supported Data Source Version |
---|---|---|---|
Security mode |
Hive |
Security mode |
MRS 3.x and FusionInsight 6.5.1 |
HBase |
MRS 3.x |
||
Elasticsearch |
MRS 3.1.2 and later |
||
HetuEngine |
MRS 3.1.1 and later |
||
Hudi |
MRS 3.1.2 and later |
||
ClickHouse |
MRS 3.1.1 and later |
||
IoTDB |
MRS 3.2.0 and later |
||
GaussDB |
GaussDB 200 and GaussDB A 8.0.0 and later |
||
MySQL |
MySQL 5.7, MySQL 8.0, and later |
||
Normal mode |
Hive |
Normal mode |
MRS 3.x and FusionInsight 6.5.1 |
HBase |
MRS 3.x |
||
Elasticsearch |
MRS 3.1.2 and later |
||
Hudi |
MRS 3.1.2 and later |
||
ClickHouse |
MRS 3.1.1 and later |
||
IoTDB |
MRS 3.2.0 and later |
||
MySQL |
Security mode |
MySQL 5.7, MySQL 8.0 and later |
|
GaussDB |
GaussDB 200 and GaussDB A 8.0.0 or later |
Operations such as adding, configuring, and deleting a HetuEngine data source takes effect dynamically without restarting the cluster.
A configured data source takes effect dynamically and you cannot disable this function. By default, the interval for a data source to dynamically take effect is 60 seconds. You can change the interval to a desired one by changing the value of catalog.scanner-interval in coordinator.config.properties and worker.config.properties by referring to 3.e in Creating a HetuEngine Compute Instance. See the following example.
catalog.scanner-interval =120s
HetuEngine supports query pushdown. It can push down queries or partial queries to connected data sources. This means that special predicates, aggregate functions, or other operations can be passed to the underlying database or file system for processing. Query pushdown brings the following benefits:
- Improves the overall query performance.
- Reduces the network traffic between HetuEngine and data sources.
- Reduces the load of remote data sources.
Whether HetuEngine supports query pushdown depends on specific connectors and the underlying data sources or storage systems related to the connectors.
- The data source cluster and the HetuEngine cluster must use different domain names. Two data sources (Hive, HBase, and Hudi) with the same domain name cannot be connected to HetuEngine at the same time.
- Nodes in the data source cluster and the HetuEngine cluster can communicate with each other on the service plane.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot