Updated on 2026-05-20 GMT+08:00

DWS

DataArts Migration supports the main versions of Data Warehouse Service (DWS), meeting your data synchronization requirements in various deployment environments.

Preparation and Constraints

  • Network requirements

    The DWS data source can communicate with CDM. This ensures smooth data transmission.

  • Required permissions
    DWS read and write permissions:
    • Read permission:
      • To read data from DWS, you can grant the DWS ReadOnlyAccess system policy to the IAM user or user group of DataArts Migration. This policy allows users to view DWS resources.
      • For more refined permission control, you can create custom policies to grant users read permissions, such as the permissions to query databases and tables.
    • Write permission:
      • To write data to DWS, you can grant the DWS FullAccess or DWS Administrator system policy to the IAM user or user group of DataArts Migration. This policy has all permissions for DWS.
      • You can create custom policies to grant users write permissions, such as the permissions to insert data, create tables, and update data.

Supported Data Types

The following table lists the field types supported for reading and writing DWS data. The field types supported by DWS vary depending on the DWS version. DataArts Migration uses DWS 8.1.X as the baseline version and adapts to mainstream types and their variants.

Category

Field Type

Read Support

Write

Integer

TINYINT (also named INT1)

Yes

Yes

SMALLINT (also named INT2)

Yes

Yes

INTEGER (also named INT4)

Yes

Yes

BINARY_INTEGER

Yes

Yes

BIGINT (also named INT8)

Yes

Yes

INT

Yes

Yes

Floating point

NUMERIC

Yes

Yes

DECIMAL

Yes

Yes

NUMBER

Yes

Yes

REAL

Yes

Yes

FLOAT

Yes

Yes

FLOAT4

Yes

Yes

DOUBLE PRECISION

Yes

Yes

FLOAT8

Yes

Yes

BINARY_DOUBLE

Yes

Yes

DEC

Yes

Yes

Serial integer

SMALLSERIAL

Yes

Yes

SERIAL

Yes

Yes

BIGSERIAL

Yes

Yes

Currency

MONEY

Yes

Yes

Boolean

BOOL

Yes

Yes

BOOLEAN

Yes

Yes

Character

CHAR

Yes

Yes

BPCHAR

Yes

Yes

CHARACTER

Yes

Yes

NCHAR

Yes

Yes

VARCHAR (also named VARCHAR2)

Yes

Yes

CHARACTER VARYING

Yes

Yes

VARCHAR2

Yes

Yes

NVARCHAR2

Yes

Yes

CLOB

Yes

Yes

TEXT

Yes

Yes

Date and time

DATE

Yes

Yes

TIME

Yes

Yes

TIME WITHOUT TIME ZONE

Yes

Yes

TIME WITH TIME ZONE

Yes

Yes

TIMESTAMP

Yes

Yes

TIMESTAMP WITHOUT TIME ZONE

Yes

Yes

TIMESTAMP WITH TIME ZONE

Yes

Yes

SMALLDATETIME

Yes

Yes

INTERVAL

Yes

Yes

Bit

BIT

Yes

Yes

VARBIT

Yes

Yes

Binary

BLOB

×

×

RAW

×

×

BYTEA

×

×

JSON

JSON

Yes

Yes

Supported Migration Scenarios

DataArts Migration supports the following offline synchronization modes:

  • Single table synchronization

    DataArts Migration supports table/file synchronization in data ingestion into a data lake or data migration to the cloud.

  • Database and table shard synchronization

    DataArts Migration supports synchronization of data from multiple databases and tables in data ingestion into a data lake or data migration to the cloud.

  • Entire DB migration

    DataArts Migrations supports synchronization of data from an on-premises database in data ingestion into a data lake or data migration to the cloud.

Database and table shard synchronization and entire DB migration are not supported in all regions. The following table lists the supported DWS migration scenarios.

Supported Migration Scenario

DWS Single Table Read

DWS Single Table Write

DWS Database/Table Shard Read

DWS Database/Table Shard Write

DWS Entire DB Read

DWS Entire DB Write

Supported

Yes

Yes

No

Yes

Supported in some regions

No

Core Capabilities

  • Connection configuration

    Configuration Item

    Supported

    Description

    User/AK

    Yes

    User AK/SK authentication ensures connection security.

    SSL encryption

    Yes

    SSL encryption ensures secure data transmission. Currently, SSL authentication can be enabled only for RDS.

    SSL authentication

    One-way authentication

    Currently, SSL authentication can be enabled only for RDS. The standard Huawei Cloud CA certificate is used for authentication.

    Connection configuration optimization

    Yes

    Connection configuration such as connectTimeout can be optimized to improve connection performance.

    Custom driver

    No

    Custom drivers provide better flexibility. They are not supported for now.

  • Read capabilities

    Configuration Item

    Supported

    Description

    Shard concurrency

    Yes

    Horizontal sharding based on primary keys or common fields and multi-thread concurrent extraction significantly improve the throughput and efficiency.

    Dirty data processing

    Yes

    Abnormal data can be written to the dirty data bucket to prevent job failures caused by a small amount of abnormal data.

    Custom fields

    Yes

    You can add computed columns, constant columns, or masking functions for tasks to meet personalized service requirements.

    Incremental read

    Yes

    Where conditions and the SQL mode enable incremental data reading.

    Stream and batch reading

    Batch reading

    Batch reading improves efficiency when there is a small or medium amount of data.

    Optimization of the number of rows read

    Yes

    You can set Fetch Size in the connection to properly control the amount of data to be transmitted. This improves performance and prevents a transmission delay or the system from being overloaded when there is a large amount of data.

    View reading

    Yes

    Data can be read from views. This enables flexible data integration and processing.

  • Write capabilities

    Configuration Item

    Supported

    Description

    Conflict resolution

    Yes

    Multiple write modes, such as copy and upsert, can flexibly handle data conflicts.

    Pre- and post-import processing

    Yes

    Operations such as preSql and delete can clean and process data before and after data import.

    Concurrent write

    Yes

    Concurrent write improves efficiency.

    Optimization of the number of written rows

    Yes

    You can set the number of rows written by each request in the connection to properly control the amount of data to be transmitted. This improves performance and prevents a transmission delay or the system from being overloaded when there is a large amount of data.

    Dirty data processing

    Yes

    Abnormal data can be written to the dirty data bucket to prevent job failures caused by a small amount of abnormal data.

    Table creation in editing state

    Yes

    You can create a table when editing a job using the table structure and field types you have defined based on the data structure and requirements.

Creating a Data Source

Create a data source in Management Center. For details, see Configuring Data Connection Parameters.

Creating an Offline Data Migration Job

Create a DWS migration job in DataArts Factory. For details, see Creating an Offline Processing Migration Job.