MRS Hudi Connection Parameters (Internal Test)
Parameter |
Mandatory |
Description |
---|---|---|
Data Connection Type |
Yes |
The value is fixed at MRS Hudi. |
Name |
Yes |
Name of the data connection to create. Data connection names can contain a maximum of 100 characters. They can contain only letters, digits, underscores (_), and hyphens (-). |
Description |
No |
A description which can help identify the data connection more easily. It can contain a maximum of 100 characters. |
Tag |
No |
Attribute of the data connection to create. Tags make management easier.
NOTE:
The tag name can contain only letters, digits, and underscores (_) and cannot start with an underscore (_) or contain more than 100 characters. |
Applicable Modules |
Yes |
Select the modules for which this connection is available. The connection can be used in the selected modules.
NOTE:
|
Basic and Network Connectivity Configuration |
||
Manual |
No |
This parameter is mandatory when Connection Type is set to Proxy connection.
Select the connection mode. If you do not need to access MRS clusters in other projects or enterprise projects, select Cluster Name Mode.
|
MRS Cluster Name |
Yes |
The name of the MRS cluster. Select an MRS cluster that Hive belongs to. Only MRS clusters are supported. A Hadoop cluster can be selected only after it is managed by MRS. All the MRS clusters with the same project ID and enterprise project are displayed.
NOTE:
If the connection fails after you select a cluster, check whether the MRS cluster can communicate with the CDM instance which functions as the agent. They can communicate with each other in the following scenarios:
NOTE:
If an agent is connected to multiple MRS clusters and one of the MRS clusters is deleted or abnormal, connections to the other MRS clusters will be affected. Therefore, you are advised to connect an agent to only one MRS cluster. |
Manager IP |
No |
This parameter is mandatory when Connection String Mode is selected for Manual.
Set this parameter to the floating IP address of MRS Manager. Only MRS clusters are supported. A Hadoop cluster can be connected only after it is managed by MRS.
NOTE:
You can click Select next to the text box and select an MRS cluster in the same project and enterprise project. If you want to access an MRS cluster in another project or enterprise project, obtain and enter the floating IP address of MRS Manager and ensure that the connection's agent (CDM cluster) can communicate with the tenant-plane MRS cluster. To obtain the floating IP address of MRS Manager, log in to the active master node of the MRS cluster and run the ifconfig command. In the command output, the IP address of eth0:wsom is the floating IP address of MRS Manager. For details about how to log in to the master node of the MRS cluster, see Logging In to an ECS.
Enter multiple IP addresses based on the scenario in sequence and separate them with commas (,), for example, 127.0.0.1 or 127.0.0.1,127.0.0.2,127.0.0.3.
|
KMS Key |
Yes |
KMS key used to encrypt and decrypt data source authentication information. Select a default or custom key.
NOTE:
|
Agent |
Yes |
DataArts Studio cannot directly connect to non-fully managed services. An agent is required for DataArts Studio to communicate with non-fully managed services. A CDM cluster can function as an agent. If no CDM cluster is available, create one by referring to Creating a CDM Cluster.
NOTE:
If a CDM cluster functions as the agent for a data connection in Management Center, the cluster supports a maximum of 200 concurrent active threads. If multiple data connections share an agent, a maximum of 200 SQL, Shell, and Python scripts submitted through the connections can run concurrently. Excess tasks will be queued. You are advised to plan multiple agents based on the workload. |
Data Migration Configuration |
||
Properties |
No |
Configure other client attributes. The following are some examples: 1. socketTimeout: Hive JDBC connection timeout duration in seconds 2. fs.defaultFS: NameNode URI for the HDFS metadata 3. hadoop.user.name: username for accessing HDFS |
Data Source Authentication and Other Function Configuration |
||
Authentication Method |
Yes |
This parameter is mandatory when Manual is enabled.
It specifies the authentication method used for accessing the MRS cluster. The following options are available:
|
Username |
Yes |
The username of the MRS cluster. This parameter is mandatory when Connection Type is set to Proxy connection. If a new MRS user is used for connection, you need to log in to Manager and change the initial password.
To create a data connection for an MRS security cluster, do not use user admin. The admin user is the default management page user and cannot be used as the authentication user of the security cluster. You can create an MRS user by referring to Creating a Kerberos Authentication User for an MRS Security Cluster. When creating an MRS data connection, set Username and Password to the new MRS username and password.
NOTE:
|
Password |
Yes |
The password for accessing the MRS cluster. This parameter is mandatory when Connection Type is set to Proxy connection. |
OBS storage support |
Yes |
Whether to support OBS storage. If the Hudi table data is stored in OBS, you need to enable this function. |
Use Agency |
No |
This parameter is displayed when OBS storage support is enabled. If you enable the agency function, you can create a data connection without having a permanent AK/SK and execute CDM jobs using the scheduling identity configured in DataArts Factory. |
Public agency |
Yes |
This parameter is displayed when Use Agency is enabled. The agency is only used to check whether the connection agency function is normal. CDM jobs will be executed using the scheduling identity configured in DataArts Factory. |
AK |
Yes |
This parameter is available when CDM Enable is enabled, OBS storage support is enabled, and Use Agency is disabled. AK and SK are used to log in to the OBS server. You need to create an access key for the current account and obtain an AK/SK pair.
To obtain an access key, perform the following steps:
|
SK |
Yes |
|
OBS Test Path |
No |
This parameter is available when OBS storage support is set to Yes. Enter a complete file path. The permission to access the path will be verified through the metadata query API.
NOTE:
|
Creating a Kerberos Authentication User for an MRS Security Cluster
To create a data connection for an MRS security cluster, do not use user admin. The admin user is the default management page user and cannot be used as the authentication user of the security cluster. To create an MRS user, perform the following steps:
For clusters of MRS 3.x:
- Log in to MRS Manager as user admin.
- Choose System > Permission > Security Policy > Password Policy. Click Add Password Policy and add a policy under which the password never expires.
- Set Password Policy Name to neverexp.
- Set Password Validity Period (Days) to 0, indicating that the password never expires.
- Set Password Expiration Notification (Days) to 0.
- Retain the default values for other parameters.
- Choose Create to add a dedicated human-machine user as the Kerberos authentication user and set the password policy to neverexp. Select the user group superGroup for the user, and assign all roles to the user.
- For clusters of MRS 3.1.0 or later, the user must at least have permissions of the Manager_viewer role to create data connections in Management Center. To perform database, table, and data operations on components, the user must also have user group permissions of the components.
- For clusters earlier than MRS 3.1.0, the user must have permissions of the Manager_administrator or System_administrator role to create data connections in Management Center.
- A user with only the Manager_tenant or Manager_auditor permission cannot create connections.
. On the page displayed, click - Log in to Manager as the new user and change the initial password. Otherwise, the connection fails to be created.
- Synchronize IAM users.
- Log in to the MRS console.
- Choose Clusters > Active Clusters, select a running cluster, and click its name to go to its details page.
- In the Basic Information area of the Dashboard page, click Synchronize on the right side of IAM User Sync to synchronize IAM users.
- If the status is Synchronized, skip this step.
- When the policy of the user group to which the IAM user belongs changes from MRS ReadOnlyAccess to MRS CommonOperations, MRS FullAccess, or MRS Administrator, wait for 5 minutes until the new policy takes effect after the synchronization is complete because the SSSD (System Security Services Daemon) cache of cluster nodes needs time to be updated. Then, submit a job. Otherwise, the job may fail to be submitted.
- When the policy of the user group to which the IAM user belongs changes from MRS CommonOperations, MRS FullAccess, or MRS Administrator to MRS ReadOnlyAccess, wait for 5 minutes until the new policy takes effect after the synchronization is complete because the SSSD cache of cluster nodes needs time to be updated.
For clusters of MRS 2.x or earlier:
- Log in to the MRS Manager as user admin.
- On FusionInsight Manager, choose System Settings and click Configure Password Policy to modify the password policy.
- Set Password Validity Period (Days) to 0, indicating that the password never expires.
- Set Password Expiration Notification (Days) to 0.
- Retain the default values for other parameters.
- Choose
- For clusters of MRS 2.x or earlier, the user must have permissions of the Manager_administrator or System_administrator role to create data connections in Management Center.
- A user with only the Manager_tenant or Manager_auditor permission cannot create connections.
. On the page displayed, add a dedicated human-machine user as the Kerberos authentication user. Select the user group superGroup for the user, and assign all roles to the user.
- Log in to MRS Manager as the new user and change the initial password. Otherwise, the connection fails to be created.
- Synchronize IAM users.
- Log in to the MRS console.
- Choose Clusters > Active Clusters, select a running cluster, and click its name to go to its details page.
- In the Basic Information area of the Dashboard page, click Synchronize on the right side of IAM User Sync to synchronize IAM users.
- If the status is Synchronized, skip this step.
- When the policy of the user group to which the IAM user belongs changes from MRS ReadOnlyAccess to MRS CommonOperations, MRS FullAccess, or MRS Administrator, wait for 5 minutes until the new policy takes effect after the synchronization is complete because the SSSD (System Security Services Daemon) cache of cluster nodes needs time to be updated. Then, submit a job. Otherwise, the job may fail to be submitted.
- When the policy of the user group to which the IAM user belongs changes from MRS CommonOperations, MRS FullAccess, or MRS Administrator to MRS ReadOnlyAccess, wait for 5 minutes until the new policy takes effect after the synchronization is complete because the SSSD cache of cluster nodes needs time to be updated.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot