Link to HBase

CDM supports the following HBase data sources:

MRS HBase

When connecting CDM to HBase of MRS, configure the parameters as described in Table 1.

Table 1 MRS HBase link parameters

Parameter

Description

Example Value

Name

Link name, which should be defined based on the data source type, so it is easier to remember what the link is for

mrs_hbase_link

Manager IP

Floating IP address of MRS Manager. Click Select next to the Manager IP text box to select an MRS cluster. CDM automatically fills in the authentication information.

127.0.0.1

Username

If Authentication Method is set to KERBEROS, you must provide the username and password used for logging in to MRS Manager.

cdm

Password

Password used for logging in to MRS Manager

-

Authentication Method

Authentication method used for accessing MRS
  • SIMPLE: Select this if MRS is in non-security mode.
  • KERBEROS: Select this if MRS is in security mode.

SIMPLE

Run Mode

Run mode of the HBase link. This parameter is used only when the HBase version is HBASE_2_X.

  • EMBEDDED: The link instance runs with CDM. This mode delivers better performance.
  • STANDALONE: The link instance runs in an independent process. If CDM needs to connect to multiple Hadoop data sources (MRS, Hadoop, or CloudTable) with both Kerberos and Simple authentication modes, select STANDALONE or configure different agents.

    Note: The STANDALONE mode is used to solve the version conflict problem. If the connector versions of the source and destination ends of the same link are different, a JAR file conflict occurs. In this case, you need to place the source or destination end in the STANDALONE process to prevent the migration failure caused by the conflict.

  • Agent: The link instance runs on an agent.

STANDALONE

FusionInsight HBase

When connecting CDM to HBase of FusionInsight HD, configure the parameters as described in Table 2.

Table 2 FusionInsight HBase link parameters

Parameter

Description

Example Value

Name

Link name, which should be defined based on the data source type, so it is easier to remember what the link is for

FI_hbase_link

Manager IP

IP address of FusionInsight Manager

127.0.0.1

Manager Port

Port number of FusionInsight Manager

28443

CAS Server Port

Port number of the CAS server used to connect to FusionInsight

20009

Username

Username used for logging in to FusionInsight Manager.

cdm

Password

Password used for logging in to FusionInsight Manager

-

Authentication Method

Authentication method used for accessing FusionInsight HD
  • SIMPLE: Select this if FusionInsight HD is in non-security mode.
  • KERBEROS: Select this if FusionInsight HD is in security mode.

Kerberos

Run Mode

Run mode of the HBase link. This parameter is used only when the HBase version is HBASE_2_X.

  • EMBEDDED: The link instance runs with CDM. This mode delivers better performance.
  • STANDALONE: The link instance runs in an independent process. If CDM needs to connect to multiple Hadoop data sources (MRS, Hadoop, or CloudTable) with both Kerberos and Simple authentication modes, select STANDALONE or configure different agents.

    Note: The STANDALONE mode is used to solve the version conflict problem. If the connector versions of the source and destination ends of the same link are different, a JAR file conflict occurs. In this case, you need to place the source or destination end in the STANDALONE process to prevent the migration failure caused by the conflict.

  • Agent: The link instance runs on an agent.

STANDALONE

Apache HBase

When connecting CDM to HBase of Apache Hadoop, configure the parameters as described in Table 3.

Table 3 Apache HBase link parameters

Parameter

Description

Example Value

Name

Link name, which should be defined based on the data source type, so it is easier to remember what the link is for

hadoop_hbase_link

ZK Link

ZooKeeper link of HBase

Format: <host1>:<port>,<host2>:<port>,<host3>:<port>

zk1.example.com:2181,zk2.example.com:2181,zk3.example.com:2181

Authentication Method

Authentication method used for accessing Hadoop
  • SIMPLE: Select this if Hadoop is in non-security mode.
  • KERBEROS: Select this if Hadoop is in security mode. Obtain the Principal account and Keytab File file of the client for authentication.

Kerberos

Principal

When Authentication Method is set to KERBEROS, the Principal account is used for authentication. You can contact the Hadoop administrator to obtain the account.

USER@YOUR-REALM.COM

Keytab File

When Authentication Method is set to KERBEROS, this file is used for authentication. You can contact the Hadoop administrator to obtain the file.

/opt/user.keytab

IP and Host Name Mapping

If the configuration file uses the host name, configure the mapping between the IP address and host name. Separate the IP addresses and host names by spaces and mappings by semicolons (;), carriage returns, or line feeds.

10.3.6.9 hostname01

10.4.7.9 hostname02

Run Mode

Run mode of the HBase link. This parameter is used only when the HBase version is HBASE_2_X.

  • EMBEDDED: The link instance runs with CDM. This mode delivers better performance.
  • STANDALONE: The link instance runs in an independent process. If CDM needs to connect to multiple Hadoop data sources (MRS, Hadoop, or CloudTable) with both Kerberos and Simple authentication modes, select STANDALONE or configure different agents.

    Note: The STANDALONE mode is used to solve the version conflict problem. If the connector versions of the source and destination ends of the same link are different, a JAR file conflict occurs. In this case, you need to place the source or destination end in the STANDALONE process to prevent the migration failure caused by the conflict.

  • Agent: The link instance runs on an agent.

STANDALONE