Creating a Dataset Import Task
You can import new data from OBS through an OBS path or a manifest file.
dataset.import_data(path=None, anntation_config=None, **kwargs)
Table 1 lists the import modes supported by datasets.
Dataset Type | From an OBS Path | From a Manifest File | Remarks |
|---|---|---|---|
Image classification | Supported | Supported | None |
Object detection | Supported | Supported | None |
Image segmentation | Supported | Supported | None |
Text classification | Supported | Supported | None |
Named entity recognition | Not supported | Supported | None |
Text triplet | Not supported | Supported | None |
Sound classification | Supported | Supported | None |
Speech labeling | Not supported | Supported | None |
Speech paragraph labeling | Not supported | Supported | None |
Table dataset | Supported | Not supported | The schema of the newly imported table data is the same as that of the dataset. |
Video labeling | Not supported | Supported | None |
Sample Code
- Example 1: Import an object detection dataset from an OBS path.
from modelarts.session import Session from modelarts.dataset import Dataset session = Session() dataset = Dataset(session, dataset_id) annotation_config = dict() annotation_config['scene'] = "object_detection" annotation_config['format_name'] = "ModelArts PASCAL VOC 1.0" import_resp = dataset.import_data(path="/obs-gaia-test/data/image/image-detection/", annotation_config=annotation_config)
- Example 2: Import an object detection dataset from a manifest file.
annotation_config = dict() # Task with data imported from a manifest file. annotation_config is used to import labels. import_resp = dataset.import_data( path="/obs-gaia-test/data/output/work_path/dataset-5932-Qdd1RUZ3wqBQrwrTr3v/annotation/V001/V001.manifest",annotation_config=annotation_config) - Example 3: Import a table dataset from an OBS path.
import_resp = dataset.import_data( path="/obs-gaia-test/data/table/table1/", with_column_header=True)
Parameters
Parameter | Mandatory | Type | Description |
|---|---|---|---|
path | Yes | String | OBS path or manifest file path for importing data
|
annotation_config | No | Data labeling format. If this parameter is set to None, no labels will be imported. If data is to be imported from a manifest file, import an empty dict object so that labels can be imported. The following labeling formats are supported:
| |
with_column_header | No | Boolean | Whether the first row of a table is the table header. This parameter is mandatory for table datasets.
|
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.

