Updated on 2025-05-20 GMT+08:00

Data Source Overview

Data sources are the foundation of data analysis. To analyze data, you must first connect DataArts Insight to a data source. DataArts Insight provides secure and reliable data connection methods, offering data analysts a variety of ways to access data sources. This effectively improves data analysis efficiency. DataArts Insight currently supports GaussDB(DWS), GaussDB, MySQL, PostgreSQL, Doris, ClickHouse, API, DLI, and Hive data sources. This section walks you through how to connect to a data source.

Supported Data Source Types

Table 1 Supported data source types

Data Source

Type

Connection Method

How to Connect

GaussDB(DWS)

Big data platform

Public network/VPC endpoint service/VPC

Connecting to a GaussDB(DWS) Data Source

GaussDB

Big data platform

Public network

Connecting to a GaussDB Data Source

MySQL

Relational database

Public network/VPC endpoint service/VPC

Connecting to a MySQL Data Source

PostgreSQL

Big data platform

Public network/VPC endpoint service/VPC

Connecting to a PostgreSQL Data Source

Doris

Big data platform

Public network/VPC endpoint service/VPC

Connecting to a Doris Data Source

ClickHouse

Big data platform

Public network/VPC endpoint service/VPC

Connecting to a ClickHouse Data Source

DLI

Big data platform

N/A

Connecting to a DLI Data Source

Hive

Big data platform

Public network/VPC endpoint service/VPC

Connecting to a Hive Data Source

Notes and Constraints

  • The data source and the DataArts Insight project must belong to the same enterprise project.
  • The data source and the DataArts Insight project must belong to the same region. This means only data sources in the same region as the DataArts Insight project are allowed to connect to DataArts Insight.
  • The network type of data sources must meet the following requirements:
    • If the data source is an on-premises database, a public network or a dedicated connection is required. Ensure that the host where the data source is located can access the public network and the port has been enabled in the firewall rule.
    • If the data source is a cloud service, such as GaussDB(DWS) and ClickHouse, the following conditions must be met for network interconnection:
      • If DataArts Insight and the cloud service are in different regions, a public network or a dedicated connection is required to enable the connection.
      • If DataArts Insight and the cloud service are in the same region:
        • By default, different instances in the same VPC, subnet, and security group can communicate with each other.
        • If the VPC is the same as the VPC but the subnets or security groups are different, you need to configure routing rules and security group rules. For details about how to configure routing rules, see How Do I Configure Routing Rules? For details about how to configure security group rules, see Security Group Configuration.
        • DataArts Insight automatically generates VPC endpoints for connecting to the VPC networks when data sources from various VPCs are connected to it.