Processing Other Datasets
Before processing a dataset of other types, import data. For details, see Importing Data to the Pangu Platform.
Creating an Other-Type Dataset Processing Task
To create a processing task for a dataset of other types, perform the following steps:
- Log in to ModelArts Studio Large Model Deveopment Platform. In the My Spaces area, click the required workspace.
Figure 1 My Spaces
- In the navigation pane, choose Data engineering > Data Processing > Processing Tasks. Click Create Processing Job in the upper right corner.
- On the Create Processing Job page, select the other type of dataset to be processed and click Next.
- Go to the processing step arrangement page.
- In the Adding Operator pane on the left, select the required operators. You can select one or more operators based on the actual scenario.
For other types of datasets, the platform supports only custom operators. For details about how to create a custom operator, see Custom Data Processing Operators.
- On the processing step orchestration page on the right, set operator parameters. You can drag
on the right to adjust the operator execution sequence.
- During orchestration, you can click Save new template in the upper right corner to save the current orchestration process as a template. During the creation of subsequent data processing tasks, you can select a processing template.
If you select a processing template, the orchestrated processing steps will be deleted.
Figure 2 Selecting a processing template
- In the Adding Operator pane on the left, select the required operators. You can select one or more operators based on the actual scenario.
- After the processing steps are orchestrated, click Start Process to start the processing task.
After the data processing job is successfully executed, the status changes from Running to Succeeded, indicating that the data has been processed.
- After data processing is complete, click Generate in the Operation column on the Processing Task page to generate a processed dataset.
To view the processed dataset, choose Data Engineering > Data Management > Datasets, and click the Processed Dataset tab.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot