Creating a Dataset Import Task
You can import new data from OBS through an OBS path or a manifest file.
dataset.import_data(path=None, anntation_config=None, **kwargs)
Table 1 lists the import modes supported by datasets.
Dataset Type |
From an OBS Path |
From a Manifest File |
Remarks |
---|---|---|---|
Image classification |
Supported |
Supported |
None |
Object detection |
Supported |
Supported |
None |
Image segmentation |
Supported |
Supported |
None |
Text classification |
Supported |
Supported |
None |
Named entity recognition |
Not supported |
Supported |
None |
Text triplet |
Not supported |
Supported |
None |
Sound classification |
Supported |
Supported |
None |
Speech labeling |
Not supported |
Supported |
None |
Speech paragraph labeling |
Not supported |
Supported |
None |
Table dataset |
Supported |
Not supported |
The schema of the newly imported table data is the same as that of the dataset. |
Video labeling |
Not supported |
Supported |
None |
Sample Code
- Example 1: Import an object detection dataset from an OBS path.
from modelarts.session import Session from modelarts.dataset import Dataset session = Session() dataset = Dataset(session, dataset_id) annotation_config = dict() annotation_config['scene'] = "object_detection" annotation_config['format_name'] = "ModelArts PASCAL VOC 1.0" import_resp = dataset.import_data(path="/obs-gaia-test/data/image/image-detection/", annotation_config=annotation_config)
- Example 2: Import an object detection dataset from a manifest file.
annotation_config = dict() # Task with data imported from a manifest file. annotation_config is used to import labels. import_resp = dataset.import_data( path="/obs-gaia-test/data/output/work_path/dataset-5932-Qdd1RUZ3wqBQrwrTr3v/annotation/V001/V001.manifest",annotation_config=annotation_config)
- Example 3: Import a table dataset from an OBS path.
import_resp = dataset.import_data( path="/obs-gaia-test/data/table/table1/", with_column_header=True)
Parameters
Parameter |
Mandatory |
Type |
Description |
---|---|---|---|
path |
Yes |
String |
OBS path or manifest file path for importing data
|
annotation_config |
No |
Data labeling format. If this parameter is set to None, no labels will be imported. If data is to be imported from a manifest file, import an empty dict object so that labels can be imported. The following labeling formats are supported:
|
|
with_column_header |
No |
Boolean |
Whether the first row of a table is the table header. This parameter is mandatory for table datasets.
|
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.