Updated on 2024-04-29 GMT+08:00

Configuring a Host Connection

Table 1 Host Connection parameters

Parameter

Mandatory

Description

Data Connection Type

Yes

Host Connection is selected by default and cannot be changed.

Name

Yes

Name of the data connection to create. Data connection names can contain a maximum of 100 characters. They can contain only letters, digits, underscores (_), and hyphens (-).

Tag

No

Attribute of the data connection to create. Tags make management easier.
NOTE:

The tag name can contain only letters, digits, and underscores (_) and cannot start with an underscore (_) or contain more than 100 characters.

Applicable Modules

Yes

Select the modules for which this connection is available.

All modules are selected by default, which means this connection is available for all the modules that support the data source connected by this connection. For details about the data sources supported by each module, see Data Sources.

Basic and Network Connectivity Configuration

Host Address

Yes

IP address of the Linux host

For details, see Viewing Details About an ECS.

Agent

Yes

CDM cluster used as an agent.

NOTE:
  • If a CDM cluster functions as the agent for a data connection in Management Center, the cluster supports a maximum of 200 concurrent active threads. If multiple data connections share an agent, a maximum of 200 SQL, Shell, and Python scripts submitted through the connections can run concurrently. Excess tasks will be queued. You are advised to plan multiple agents based on the workload.

  • When scheduling shell or Python scripts, the agent accesses the ECS. If shell and Python scripts are scheduled frequently, the ECS adds the private IP address of the agent to the blocklist. To ensure normal job scheduling, you are advised to use the root user of the ECS to add the private IP address bound to the agent (CDM cluster) to the /etc/hosts.allow file.

    For details about how to obtain the private IP address of the CDM cluster, see Viewing Basic Cluster Information and Modifying Cluster Configurations.

Port

Yes

SSH port number of the host.

By default, port 22 is used to log in to a Linux host. If the port number has been changed, you can obtain the new port number from the port field in the /etc/ssh/sshd_config file.

KMS Key

Yes

KMS key used to encrypt and decrypt the authentication information for the data source

Data Source Authentication and Other Function Configuration

Username

Yes

Username for logging in to the host

Login Mode

Yes

Mode for logging in to the host

  • Key Pair
  • Password

Key Pair

Yes

This parameter is available only when Login Mode is set to Key Pair.

If Key Pair is the login mode of the host, you need to obtain the private key file, upload it to OBS, and select an OBS path.

NOTE:

The uploaded private key must match the public key configured on the host. For details, see Application Scenarios for Using Key Pairs.

Key Pair Password

Yes

If no password is set for the key pair, you do not need to set this parameter.

Password

Yes

This parameter is available only when Login Mode is set to Password.

If the login mode of the host is to use a password, enter a login password.

Host Connection Description

No

Descriptive information about the host connection

  • The maximum number of shell or Python scripts that can run concurrently on the ECS is determined by the value of MaxSessions in the /etc/ssh/sshd_config file on the ECS. Set MaxSessions based on the scheduling frequency of shell or Python scripts.
  • You have the permission to create and execute files in the /tmp directory on the host.
  • Shell and Python scripts are executed in the /tmp directory on an ECS. Ensure that the disk space of the /tmp directory is not used up.