Updated on 2025-05-20 GMT+08:00

Connecting to a GaussDB(DWS) Data Source

DataArts Insight allows you to connect to a GaussDB(DWS) data source. This section describes how to connect DataArts Insight to a GaussDB(DWS) data source.

DataArts Insight can connect to GaussDB(DWS) through the following methods:

  • Public network connection: To connect to cloud service resources that do not belong to your current account, use this method. The data source must be bound to an EIP.
  • VPC network connection: To connect to cloud service resources under your current account, use this method.
  • VPCEP service: To connect to data sources that are not associated with your current account or for which you lack VPCEP permissions, as well as ECS self-built data sources, use this method.

Preparations

Do the following to check if an EIP or VPCEP security group is added:
  1. Log in to the management console.
  2. In the service list, choose EI Enterprise Intelligence > Data Warehouse Service.
  3. Select a region in the upper left corner.
  4. In the navigation pane on the left, choose Dedicated Clusters > Clusters.
  5. Click the name of the cluster you want to connect to. The cluster details page is displayed.
  6. On the displayed page, find the Network area and click the security group next to Security Group. Click the Inbound Rules tab and check if an EIP or VPCEP security group is added.
    • If yes, return to the DataArts Insight data source editing page to connect to the data source.
    • If no, perform the following steps to add a security group:
      1. Click Add Rule. In the displayed Add Inbound Rule dialog box, set the protocol, port, and source IP address.
      2. Confirm the parameter settings and click OK to return to the DataArts Insight page and connect to the data source.

      If the data source connection test fails after the security group is added, the possible cause is that SASL_SSL is not enabled for that data source.

Accessing a GaussDB(DWS) Data Source Through a Public Network

  1. Log in to the DataArts Insight console.
  2. Click in the upper left corner of the management console to select a region and select an enterprise project from Enterprise Project in the upper right corner of the Workspace page.
  3. On the top of the console, click Projects and click the name of the desired project.
  4. In the navigation pane on the left, choose Data Management > Data Sources. On the displayed page, click Create Data Source. In the slide-out panel, set Source Database Type to GaussDB(DWS) and Access Network Type to Public network.
  5. Set other parameters based on Table 1.
    Figure 1 Public network data source parameters

    Table 1 Parameters

    Parameter

    Mandatory

    Description

    Source Database Type

    Yes

    Type of the data source to be connected.

    Access Network Type

    Yes

    Select Public network for GaussDB(DWS) instances with public IP addresses, not restricted by account, region, or other limitations.

    Name

    Yes

    Name the current data connection for easier subsequent use and configuration.

    Domain Name

    Yes

    Public IP address or domain name bound to the target data source, which can be obtained from the public IP address in the Connection Information area on the GaussDB(DWS) instance details page.

    Port

    Yes

    External port of the target data source on the corresponding node, which can be obtained from the port number in the Connection Information area on the GaussDB(DWS) instance details page.

    Database

    Yes

    Select the database in the target data source based on service requirements.

    SASL_SSL

    N/A

    It is used for trusted identity authentication and secure data transmission when DataArts Insight retrieves data from the data source. This function is enabled by default.

    Username

    Yes

    Username used for connection authentication.

    Password

    Yes

    Password used for connection authentication.

  6. Click Test Connection to test the data source connectivity.
  7. Once the test is successful, click OK.

Accessing a GaussDB(DWS) Data Source Through a VPC Network

  1. Log in to the DataArts Insight console.
  2. Click in the upper left corner of the management console to select a region and select an enterprise project from Enterprise Project in the upper right corner of the Workspace page.
  3. On the top menu of the console, click Project. On the displayed My Projects page, click the name of the desired project.
  4. In the navigation pane on the left, choose Data Management > Data Sources. On the displayed page, click Create Data Source. In the slide-out panel, set Source Database Type to GaussDB(DWS) and Access Network Type to GaussDB(DWS).
  5. Set other parameters based on Table 2.
    Figure 2 VPC data source parameters

    Table 2 Parameters

    Parameter

    Mandatory

    Description

    Source Database Type

    Yes

    Type of the accessed data source. In this example, GaussDB(DWS) is selected.

    Access Network Type

    Yes

    Select GaussDB(DWS) for GaussDB(DWS) instances purchased under the current account. After this type is selected, the service directly calls the query API of GaussDB(DWS) to obtain the instance list and fill it into the Instance dropdown list.

    Region

    Yes

    Region where target instances are deployed. Currently, DataArts Insight only supports direct connection to GaussDB(DWS) instances within the same region. For cross-region scenarios, select another access network type.

    Name

    Yes

    Name the current data connection for easier subsequent use and configuration.

    Description

    No

    Supplementary information about the current data connection for future reference.

    Instance

    Yes

    List of GaussDB(DWS) service instances under the region corresponding to the current account, allowing direct selection for connection.

    Servers

    Yes

    Information about the node to be connected to the GaussDB(DWS) instance. You can obtain the information from the private IP address and port number in the Connection Information area on the GaussDB(DWS) instance details page.

    NOTE:

    If the cluster is associated with an ELB, the ELB address will be automatically populated. However, if there is no ELB associated with the cluster, the IP address and port number of the first node will be used by default.

    Database

    Yes

    Select the database in the target data source based on service requirements.

    SASL_SSL

    N/A

    It is used for trusted identity authentication and secure data transmission when DataArts Insight retrieves data from the data source. This function is enabled by default.

    Username

    Yes

    Username used for connection authentication.

    Password

    Yes

    Password used for connection authentication.

  6. Click Test Connection to test the data source connectivity.
  7. Once the test is successful, click OK.

Connecting to a GaussDB(DWS) Data Source Through a VPC Endpoint Service

VPCEP enables connections to data sources using the VPCEP service name, offering flexible data access and addressing issues related to cross-account access and self-built data source access for ECS. Furthermore, the same main account and its IAM users can share a single connection channel, streamlining connection management. The existing access method eliminates the need for permission delegation, further simplifying operations.

  • Prerequisites
  • Procedure
    1. Log in to the DataArts Insight console.
    2. Click in the upper left corner of the management console to select a region and select an enterprise project from Enterprise Project in the upper right corner of the Workspace page.
    3. On the top of the console, click Projects and click the name of the desired project.
    4. In the navigation pane on the left, choose Data Management > Data Sources. On the displayed page, click Create Data Source. In the slide-out panel, set Source Database Type to GaussDB(DWS) and Access Network Type to VPC Endpoint Service.
    5. Set other parameters based on Table 3.
      Figure 4 Connection through a VPC endpoint service
      Table 3 Parameters

      Parameter

      Mandatory

      Description

      Source Database Type

      Yes

      Type of the accessed data source. In this example, GaussDB(DWS) is selected.

      Access Network Type

      Yes

      Select VPC Endpoint Service for GaussDB(DWS) instances under different accounts. If the instances do not have public IP addresses, you need to manually configure the VPCEP information to establish the network environment before setting up the connection.

      Region

      Yes

      Region where target instances are deployed. Currently, DataArts Insight only supports direct connection to GaussDB(DWS) instances within the same region. For cross-region scenarios, select another access network type.

      Name

      Yes

      Name the current data connection for easier subsequent use and configuration.

      Description

      No

      Supplementary information about the current data connection for future reference.

      VPC Endpoint Service

      Yes

      Name of the VPCEP service to be connected. The prerequisites for connecting to a VPCEP service are as follows:

      Verify

      N/A

      After entering the VPCEP service name, click Verify. If a green tick is displayed next to the VPC endpoint ID, the verification is successful. If a red exclamation mark () is displayed next to the VPC endpoint ID, the VPCEP service is connected to for the first time. In this case, you need to authorize the VPCEP service. To do so, log in to the VPCEP console. Under Network Console, choose VPC Endpoint > VPC Endpoint Services. On the displayed page, click the desired VPCEP service and click the Connection Management tab. On the tab page, authorize the VPCEP service. For details, see Managing Connections of a VPC Endpoint Service. Once the authorization is successful, click Verify again. The verification is successful.

      NOTE:
      • All initial connections to the VPCEP service from different data sources require approval.
      • Only an administrator account or an account with administrator privileges can approve the connection. If your current account does not have approval privileges, contact an administrator account for approval.

      VPC Endpoint ID

      Yes

      This parameter is automatically filled in once you entered the VPCEP name and clicked Verify.

      Port

      Yes

      External port of the target data source on the corresponding node, which can be obtained from the port number in the Connection Information area on the GaussDB(DWS) instance details page.

      Database

      Yes

      Select the database in the target data source based on service requirements.

      Username

      Yes

      Username used for connection authentication.

      Password

      Yes

      Password used for connection authentication.

      SASL_SSL

      N/A

      It is used for trusted identity authentication and secure data transmission when DataArts Insight retrieves data from the data source. This function is enabled by default.

    6. Click Test Connection to test the data source connectivity.
    7. Once the test is successful, click OK.