Help Center/ DataArts Studio/ User Guide/ DataArts Studio Introduction
Updated on 2022-09-23 GMT+08:00

DataArts Studio Introduction

DataArts Studio is a one-stop data operations platform that provides intelligent data lifecycle management. It supports intelligent construction of industrial knowledge libraries and incorporates data foundations such as big data storage, computing, and analysis engines. With DataArts Studio, your enterprise can easily construct end-to-end intelligent data systems. These systems can help eliminate data silos, unify data standards, accelerate data monetization, and promote digital transformation.

DataArts Studio Users

DataArts Studio provides four preset roles. They have different DataArts Studio permissions. For details, see DataArts Studio Permissions.

  • Admin

    This role is granted with the management, decision-making, and review permissions. Management personnel who are familiar with enterprise businesses can be assigned the admin role. An admin has the permissions of both developers and reviewers. Users who have the admin role can perform any operations in DataArts Studio. For example, they can manage workspaces and data assets, and configure jobs. In DataArts Architecture and DataArts DataService, operations like publishing and suspending data models and APIs must be reviewed by admins to guarantee data quality.

  • Developer

    Data modeling engineers and developers who are familiar with script development can be assigned the developer role. Developers who have the developer role can develop jobs in all DataArts Studio products. They can easily build end-to-end data systems with intelligence and at full speed.

    Developers can use a software development kit (SDK) to call DataArts DataService APIs provided by DataArts Studio to analyze data after intelligent data systems are built.

    Developers do not have the permissions required for reviewing operations and managing workspaces and workspace members. But they have most of DataArts Studio permissions.

  • Operator

    This role is granted with the permissions required to view job details, schedule O&M tasks, and monitor resources. O&M personnel can be assigned this role. DataArts Studio DataArts Catalog visualizes all data links. Data quality can be verified, controlled, and traced. O&M personnel who have the operator role can schedule and monitor jobs from end to end. Data collection, consumption, and O&M are all one-stop services.

  • Viewer

    This role is granted with the read-only permissions.

DataArts Studio Development Process

To use DataArts Studio, perform the following steps:
Table 1 DataArts Studio development process

Process

Description

Task

Helpful Link

Preparations

If you access DataArts Studio for the first time, register an account, buy a DataArts Studio instance, create a workspace and a user, authorize DataArts Studio permissions to the user, and add workspace members and roles.

Prepare before you use DataArts Studio.

DataArts Studio Preparations

Obtain the address of the data source to be connected and ensure that the host where the data source is located can communicate with the platform.

Prepare data sources.

Preparing a Data Source

Select a cloud service as the data lake. The data lake is used to store both original and real-time data, for the purposes of data development, governance, and operations.

Prepare data lakes.

Preparing a Data Lake

Select cloud services for data storage, query, and analysis as required. Then, create data connections required for the cloud services.

Create data connections.

Creating Data Connections

DataArts Migration

Use DataArts Studio to upload data from data sources to the cloud.

DataArts Migration migrates data between homogeneous and heterogeneous data sources such as self-built and cloud-based file systems, relational databases, data warehouses, NoSQLs, big data cloud services, and object storage.

DataArts Migration

Supported Data Sources

Creating a CDM Cluster

Creating Links

Table/File Migration Jobs

Metadata Collection

Collect metadata of raw data for data management and monitoring.

Metadata collection

Metadata Collection

DataArts Architecture

Use DataArts Architecture to create entity-relationship (ER) models and dimensional models to standardize and visualize data development and output data governance methods that can guide development personnel to work with ease.

In DataArts Architecture, you can create dimensions, fact tables, summary tables, and metrics that fit your needs.

Design the data architecture implementation process.

DataArts Architecture Use Process

Add reviewers.

Adding a Reviewer

Design subjects.

Designing Subjects

Manage lookup tables.

Creating Lookup Tables

Formulate data standards.

Creating Data Standards

Create ER models.

ER Modeling

Create dimensional models.

Dimensional Modeling

DataArts Factory

Use DataArts Factory to manage diverse big data services.

The one-stop big data development environment enables a variety of operations such as data management, data integration, script development, job development, job scheduling, O&M, and monitoring, facilitating data analysis and processing.

Manage data.

Data Management Process

Develop scripts.

Script Development Process

Develop jobs.

Job Development Process

Perform O&M and scheduling.

Overview

DataArts Quality

Use DataArts Quality to monitor business and technical metrics. Screen out unqualified data in a single column or cross columns, rows, and tables from the following perspectives: integrity, validity, timeliness, consistency, accuracy, and uniqueness. Use the automatically generated quality rules to cleanse and standardize data repeatedly.

Monitor business metrics.

Creating a Metric

Creating a Rule

Creating a Scenario

Monitor data quality.

Creating Rule Templates

Creating Quality Jobs

Creating a Comparison Job

DataArts Catalog

Use DataArts Studio DataArts Catalog to manage data permissions. DataArts Catalog provides data maps.

N/A

Overview

Overview

DataArts DataService

Use DataArts DataService to centrally manage API services, create data APIs based on tables, and register APIs with DataArts DataService itself for unified management and publication.

Develop APIs.

Preparations

Creating an API

Debugging an API

Publishing an API

Managing APIs

Creating Throttling Policies

Call APIs.

Calling APIs