Updated on 2025-07-11 GMT+08:00

Versions

Select a DataArts Studio version with caution based on the functions and specifications you need.
  • After you buy an instance of a specified version, you cannot directly downgrade the version. For example, if you have bought an instance of the enterprise version, you cannot directly downgrade the instance to the starter version. Instead, you will need to back up data of the instance, unsubscribe from it, buy a new instance, and migrate the backup data to the new instance.
  • If your business volume keeps increasing and the instance version you have bought cannot meet your requirements, you can upgrade the instance version. To upgrade the instance version, log in to the DataArts Studio console, locate the target DataArts Studio instance, click Upgrade, and buy a package with higher specifications.

Version Scenarios

The version mode of DataArts Studio has changed in some regions to provide flexible resource configuration and lightweight data governance capabilities. For details about the old and new version modes, see Version Mode.
  • Now you can purchase DataArts Studio instances of the starter, expert, or enterprise version.
  • The version mode change does not affect the DataArts Studio instances you have purchased before, which may be of the starter, basic, advanced, professional, or enterprise version.

Compared with the old version mode, the new version mode provides more favorable prices and more flexible resource scaling. If you want to experience the new version mode, you are advised to buy a new DataArts Studio instance, migrate service data from the original instance to the new instance by referring to DataArts Studio Data Migration Configuration, and then unsubscribe from the original instance.

Table 1 lists the recommended application scenarios of each version.

Table 1 Recommended application scenarios for each DataArts Studio version

Version Mode

Version

Application Scenario

New version mode

Starter

In the initial construction phase, the data lake project mainly manages data ETL tasks in big data development scenarios and does not involve data governance.

Expert

Small- and medium-sized enterprises (SMEs) have full-time data development and governance personnel and require lightweight data governance capabilities, such as data quality, data assets, and data services. Cost-effectiveness is preferred.

Enterprise

A complete data management team and system are available. For medium- and large-sized enterprises, the enterprise information architecture, data standards, data models, and data metrics need to be implemented to match the complete DAYU data governance methodology.

Old version mode

Starter

A primary data lake project with no full-time data development engineers and no data governance needs

Basic

One or two full-time data development engineers, and up to 1,000 data tables

Advanced

Five to ten full-time data development engineers, clear data standards and efficient data quality management, and up to 2,000 data tables

Professional

Large or medium enterprises with a team of 10 to 30 full-time data development engineers and well-designed systems

Enterprise

Large enterprises and enterprises with multiple branches

Version Specifications (New Version Mode)

The new version mode provides the basic, expert, and enterprise version. Table 2 provides the modules and functions supported by each version.

Table 2 Modules supported by DataArts Studio

DataArts Studio Module

Starter

Expert

Enterprise

DataArts Migration (CDM Jobs)

DataArts Migration (Offline Jobs)

DataArts Migration (Real-Time Jobs)

Management Center

DataArts Architecture

x

x

DataArts Factory

DataArts Quality

x

This module is available but does not support comparison jobs or quality reports.

DataArts Catalog

x

Supported. However, data directories (categories, tags, and collection tasks) cannot be exported through resource migration in the management center.

DataArts DataService

x

DataArts Security

x

Basic data security features

Advanced data security features

Table 3 lists advanced data security features. Except those, all the other data security capabilities are basic features.

Table 3 Advanced data security features

Capability

Sub-capability

Advanced Feature

Internal Test/Commercial Use

Access permissions management

Permission review

  • Requests for field permissions
  • Review policies
  • Validity period management
  • Permission notifications
  • Database creation requests
  • Database creation review

Commercial use in all regions

Permission applications

  • Account mapping
  • Future table permissions

Commercial use in all regions

DataArts resource permissions

  • Level-1 directory permissions
  • Download permission control

Commercial use in all regions

Sensitive data identification

Data identification rules

  • Combination rules
  • Ingested data detection rules

Commercial use in all regions

Data security operations

Table permission reports

  • Table permission view

Commercial use in all regions

Member permission view

  • Member permission view

Commercial use in all regions

In addition, DataArts Studio provides different specifications for different versions. For details, see Table 4. If the specifications cannot meet your business growth requirements, you can buy specifications incremental packages on the console.

Table 4 DataArts Studio version specifications (a single instance)

DataArts Studio Specifications

Starter

Expert

Enterprise

DataArts Studio CDM cluster[1]

Number of clusters: 1

Name: cdm.medium

vCPUs: 4 | Memory: 8 GB

Number of clusters: 1

Name: cdm.medium

vCPUs: 4 | Memory: 8 GB

Number of clusters: 1

Name: cdm.medium

vCPUs: 4 | Memory: 8 GB

Job node scheduling times/day[2]

5,000/day

5,000/day

5,000/day

Number of technical assets[3]

Not supported

500

5000

Number of data models[4]

Not supported

Not supported

100

Annotation:

[1] DataArts Studio CDM cluster: This is a free cluster provided together with the DataArts Studio instance. It can be used as an agent for the data connections in Management Center. However, you are not advised to use the node in a data migration job when the node is used as an agent. To buy a CDM cluster used to run CDM jobs, buy a CDM incremental package. For details, see Buying a CDM Incremental Package.

[2] Job scheduling times/day: This specification is calculated based on the total number of scheduling times of data development jobs, quality jobs, reconciliation jobs, service scenarios, and metadata collection jobs executed every day. You can expand the capacity using the job node scheduling times/day incremental package, for details, see Buying a Job Node Scheduling Times/Day Incremental Package. The number of scheduling times of data development job per day is measured by node (including the Dummy node), covering PatchData tasks, dry run, and scheduling calendar dry run but not test or retry upon failures. For example, if a job contains two DWS SQL nodes and one Dummy node, starts to be executed at 00:00 every day, is scheduled every 10 hours, and a PatchData task is performed on the current day to patch data of the last 10 days, then the number of scheduling times of the job is 66 (2 x 3 + 2 x 3 x 10) for the current day and 6 (2 x 3) for every following day.

In addition, if the total number of used scheduling times, scheduling times in use, and scheduling times to be used for job nodes on the current day exceeds the specifications of this version, a message is displayed indicating that the number of job node scheduling times/day exceeds the quota when a batch processing job is scheduled or a real-time job is started.

[3] Number of technical assets: This specification refers to the number of tables and OBS files in the Data Catalog. You can expand the capacity using the incremental package of technology asset quantity. For details about how to purchase the package, see Buying an Incremental Package for Technical Asset Quantity.

[4] Number of data models: This specification indicates the number of logical models, physical models, dimension tables, fact tables, and SDR tables in the data architecture. You can expand the capacity using the data model quantity incremental package. For details about how to purchase the package, see Buying an Incremental Package for Data Model Quantity.

Version Specifications (Old Version Mode)

Table 5 Modules supported by DataArts Studio

DataArts Studio Module

Starter

Basic

Advanced

Professional

Enterprise

DataArts Migration (CDM Jobs)

DataArts Migration (Offline Jobs)

DataArts Migration (Real-Time Jobs)

Management Center

DataArts Architecture

x

DataArts Factory

DataArts Quality

x

DataArts Catalog

x

DataArts DataService

x

DataArts Security

x

Table 6 DataArts Studio version specifications (a single instance)

DataArts Studio Specifications

Starter

Basic

Advanced

Professional

Enterprise

DataArts Studio CDM cluster[1]

Number of clusters: 1

Name: cdm.medium

vCPUs: 4 | Memory: 8 GB

Number of clusters: 1

Name: cdm.medium

vCPUs: 4 | Memory: 8 GB

Number of clusters: 1

Name: cdm.large

vCPUs: 8 | Memory: 16 GB

Number of clusters: 1

Name: cdm.xlarge

vCPUs: 16 | Memory: 32 GB

Number of clusters: 1

Name: cdm.xlarge

vCPUs: 16 | Memory: 32 GB

Job node scheduling times/day[2]

5,000/day

20,000/day

40,000/day

80,000/day

200,000/day

Number of technical assets[3]

Not supported

1,000

2,000

4,000

10,000

Number of data models[4]

Not supported

1,000

2,000

4,000

10,000

Notes:

[1] DataArts Studio CDM cluster: Due to specifications restrictions, the free CDM cluster provided by a DataArts Studio instance can only be used for informal scenarios such as testing and trial use. To run your migration workloads, buy a CDM incremental package. In addition, you are not advised to use a CDM cluster that serves as a data connection agent to run data migration jobs. For details, see Buying a CDM Incremental Package.

[2] Job node scheduling times/day: It refers to the total number of scheduling times of the data development jobs, quality jobs, comparison jobs, scenarios, and metadata collection jobs per day. The number of scheduling times of data development job per day is measured by node (including the Dummy node), covering PatchData and dry run but not test or retry upon failures. For example, if a job contains two DWS SQL nodes and one Dummy node, starts to be executed at 00:00 every day, is scheduled every 10 hours, and a PatchData task is performed on the current day to patch data of the last 10 days, then the number of scheduling times of the job is 66 (2 x 3 + 2 x 3 x 10) for the current day and 6 (2 x 3) for every following day.

In addition, if the total number of used scheduling times, scheduling times in use, and scheduling times to be used for job nodes on the current day exceeds the specifications of this version, a message is displayed indicating that the number of job node scheduling times/day exceeds the quota when a batch processing job is scheduled or a real-time job is started.

[3] Number of technical assets: number of tables and OBS files in DataArts Catalog

[4] Number of data models: number of logical models, physical models, dimension tables, fact tables, and summary tables in DataArts Architecture.

Version Mode

The version mode of DataArts Studio has changed in some regions to provide flexible resource configuration and lightweight data governance capabilities.
  • Now you can purchase DataArts Studio instances of the starter, expert, or enterprise version.
  • The version mode change does not affect the DataArts Studio instances you have purchased before, which may be of the starter, basic, advanced, professional, or enterprise version.

Compared with the old version mode, the new version mode provides more favorable prices and more flexible resource scaling. If you want to experience the new version mode, you are advised to buy a new DataArts Studio instance, migrate service data from the original instance to the new instance by referring to DataArts Studio Data Migration Configuration, and then unsubscribe from the original instance.

For details about the changes in the mode of the new version compared with that of the old version, see Table 7.

Table 7 Comparison between the old and new version modes

Difference

Old Version Mode

New Version Mode

Provided Version

  • Starter version: data integration and development
  • Basic version: data integration, development, and governance
  • Advanced version: data integration, development, and governance
  • Professional version: data integration, development, and governance
  • Enterprise version: data integration, development, and governance
  • Starter version: data integration and development
  • Expert version: data integration, development, and lightweight data governance
  • Enterprise version: data integration, development, and governance

Data governance capabilities

Not supported. Except the basic version, all versions provide full-function data governance capabilities, which are costly.

Supported. The expert version provides lightweight data governance capabilities to meet data governance requirements of small- and medium-sized enterprises.

Supported Incremental Packages

Only the function incremental package is provided.

  • CDM incremental package
  • DataArts DataService incremental package

Function incremental packages and specifications incremental packages are available. (For details about how to buy them, see Buying a DataArts Studio Incremental Package.

Function incremental packages:

  • CDM incremental package
  • DataArts Migration resource group incremental package
  • DataArts DataService Exclusive cluster incremental package

Specifications incremental packages:

  • Incremental package of job scheduling times/day
  • Incremental package of technical asset quantity
  • Incremental package of data model quantity