Updated on 2024-06-12 GMT+08:00

Dataset Overview

Data management is being upgraded and is invisible to users who have not used data management.

Dataset Types

ModelArts supports the following types of datasets:

  • Images: in .jpg, .png, .jpeg, or .bmp format for image classification and object detection
  • Audio: in .wav format for sound classification, speech labeling, and speech paragraph labeling
  • Text: in .txt or .csv format for text classification, named entity recognition, and text triplet labeling
  • Free format: allows data in any format. Labeling is not available for free format data. The free format applies if labeling is not required or needs to be customized. Select this format if your data is in multiple formats or your data is not in any of the preceding formats.
    Figure 1 Example of a dataset in free format

Dataset Functions

Different types of datasets support different functions, such as auto labeling and team labeling. For details, see Table 1.

Table 1 Functions supported by different types of datasets

Dataset Type

Labeling Type

Creating a Dataset

Importing Data

Exporting Data

Publishing a Dataset

Modifying a Dataset

Managing Dataset Versions

Auto Grouping

Data Features

Image

Image classification

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Object detection

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Supported

Image segmentation

Supported

Supported

Supported

Supported

Supported

Supported

Supported

N/A

Audio

Sound classification

Supported

Supported

N/A

Supported

Supported

Supported

N/A

N/A

Speech labeling

Supported

Supported

N/A

Supported

Supported

Supported

N/A

N/A

Speech paragraph labeling

Supported

Supported

N/A

Supported

Supported

Supported

N/A

N/A

Text

Text classification

Supported

Supported

N/A

Supported

Supported

Supported

N/A

N/A

Named entity recognition

Supported

Supported

N/A

Supported

Supported

Supported

N/A

N/A

Text triplet

Supported

Supported

N/A

Supported

Supported

Supported

N/A

N/A

Video

Video labeling

Supported

Supported

N/A

Supported

Supported

Supported

N/A

N/A

Free format

Free format

Supported

N/A

_

Supported

Supported

Supported

N/A

N/A

Table

Table

Supported

Supported

N/A

Supported

Supported

Supported

N/A

N/A

Specifications Restrictions

  • The maximum numbers of samples and labels in a single text or audio database other than a table dataset are 1,000,000 and 10,000, respectively.
  • The maximum size of a sample in a single text or audio database other than an image dataset is 5 GB.
  • The maximum size of an image for object detection or image classification is 25 MB.
  • The maximum size of a manifest file is 5 GB.
  • The maximum size of a text file in a line is 100 KB.
  • The maximum size of a labeling result file is 100 MB.