Connecting to an MRS Hive Data Source
Overview
ROMA Connect can use MRS Hive as a data source for data integration tasks. Before using the MRS Hive data source, you need to connect it to ROMA Connect.
- If two data integration tasks use MRS data sources of different versions (including MRS Hive, MRS HDFS, and MRS HBase) and Kerberos authentication is enabled for the MRS data sources, the two data integration tasks cannot be executed at the same time. Otherwise, the integration tasks fail.
- Only a maximum of one million data records can be integrated.
Prerequisites
- Each connected data source must belong to an integration application. Before connecting a data source, ensure that an integration application is available. Otherwise, create an integration application first.
- Kerberos authentication has been enabled for the MRS cluster where the MRS Hive data source is located. The execution permission has been configured for machine-machine interaction users. For details, see Setting Execution Permission.
Procedure
- Log in to the ROMA Connect console. On the Instances page, click View Console next to a specific instance.
- In the navigation pane on the left, choose Data Sources. In the upper right corner of the page, click Access Data Source.
- On the Default tab page, select MRS Hive and click Next.
- Configure the data source connection information.
Table 1 Data source connection information Parameter
Description
Name
Enter a data source name. It is recommended that you enter a name based on naming rules to facilitate search.
Integration Application
Select the integration application to which the data source belongs.
Description
Enter the descriptive information.
HDFS URL
- If the root directory is used, set this parameter to hdfs:///hacluster. This operation requires the administrator rights.
- If a planned directory is used, set this parameter to the planned directory.
- If a user database directory is used, for example, /user/hive/testdb, the user must have the permission on the directory.
Machine-machine Username
Enter the machine-machine username for connecting to MRS Hive.
Configuration File
Click Upload File to upload the MRS Hive configuration file. For details, see Obtaining the MRS Hive Configuration File.
Obtaining the MRS Hive Configuration File
- Obtain krb5.conf and user.keytab files.
Download the user authentication file from MRS Manager by following the procedure described in Downloading a User Authentication File, and decompress the file to obtain the krb5.conf and user.keytab files.
- Obtain the hiveclient.properties, core-site.xml, hdfs-site.xml, and hosts files.
Download the client configuration file from the MRS console by following the procedure described in Updating a Client Configuration File. After the file is decompressed:
- Obtain the hosts file from xxx_Services_ClientConfig_ConfigFiles.
- Obtain the hiveclient.properties file from xxx_Services_ClientConfig_ConfigFiles > Hive > config.
- Obtain the core-site.xml and hdfs-site.xml files from xxx_Services_ClientConfig_ConfigFiles > HDFS > config.
- Create a Version file.
Create a text file named Version without an extension, and add version=MRS 2.1.0 to the file.
- Generate the MRS Hive configuration file.
Save the obtained files to a new directory and compress them into a .zip package. All files are stored in the root directory of the .zip package.
- The file name contains a maximum of 255 characters, including only letters and digits.
- The file size cannot exceed 2 MB.
The connection configuration of the MRS Hive data source is similar to that of the FI HDFS data source. For details, see Connecting to an FI HDFS Data Source.
- Click Check Connectivity to check the connectivity between ROMA Connect and the data source.
- If the test result is Data source connected successfully, go to the next step.
- If the test result is Failed to connect to the data source, check the data source status and connection parameters, and click Recheck until the connection is successful.
- Click Create.
Last Article: Connecting to an MQS Data Source
Next Article: Connecting to an MRS HDFS Data Source
Did this article solve your problem?
Thank you for your score!Your feedback would help us improve the website.