Creating a Dataset Import Task
You can import new data from OBS through an OBS path or a manifest file.
dataset.import_data(path=None, anntation_config=None, **kwargs)
Table 1 lists the import modes supported by datasets.
Dataset Type |
From an OBS Path |
From a Manifest File |
Remarks |
---|---|---|---|
Image classification |
Supported |
Supported |
None |
Object detection |
Supported |
Supported |
None |
Image segmentation |
Supported |
Supported |
None |
Text classification |
Supported |
Supported |
None |
Named entity recognition |
Not supported |
Supported |
None |
Text triplet |
Not supported |
Supported |
None |
Sound classification |
Supported |
Supported |
None |
Speech labeling |
Not supported |
Supported |
None |
Speech paragraph labeling |
Not supported |
Supported |
None |
Table dataset |
Supported |
Not supported |
The schema of the newly imported table data is the same as that of the dataset. |
Video labeling |
Not supported |
Supported |
None |
Sample Code
- Example 1: Import an object detection dataset from an OBS path.
from modelarts.session import Session from modelarts.dataset import Dataset session = Session() dataset = Dataset(session, dataset_id) annotation_config = dict() annotation_config['scene'] = "object_detection" annotation_config['format_name'] = "ModelArts PASCAL VOC 1.0" import_resp = dataset.import_data(path="/obs-gaia-test/data/image/image-detection/", annotation_config=annotation_config)
- Example 2: Import an object detection dataset from a manifest file.
annotation_config = dict() # Task with data imported from a manifest file. annotation_config is used to import labels. import_resp = dataset.import_data( path="/obs-gaia-test/data/output/work_path/dataset-5932-Qdd1RUZ3wqBQrwrTr3v/annotation/V001/V001.manifest",annotation_config=annotation_config)
- Example 3: Import a table dataset from an OBS path.
import_resp = dataset.import_data( path="/obs-gaia-test/data/table/table1/", with_column_header=True)
Parameters
Parameter |
Mandatory |
Type |
Description |
---|---|---|---|
path |
Yes |
String |
OBS path or manifest file path for importing data
|
annotation_config |
No |
Data labeling format. If this parameter is set to None, no labels will be imported. If data is to be imported from a manifest file, import an empty dict object so that labels can be imported. The following labeling formats are supported:
|
|
with_column_header |
No |
Boolean |
Whether the first row of a table is the table header. This parameter is mandatory for table datasets.
|
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot