Help Center/ ModelArts/ FAQs/ ModelArts Standard Data Preparation/ How Do I Split an Image Dataset into Training and Validation Sets in ModelArts?
Updated on 2025-08-28 GMT+08:00

How Do I Split an Image Dataset into Training and Validation Sets in ModelArts?

ModelArts does not support manual addition of images to a training or validation dataset, but allows you to set split ratios for training and validation sets. The system randomly allocates the images to the training and validation datasets based on the configured ratios.

Setting Split Ratios

During dataset publishing, you can configure data splitting only for datasets used for image classification, object detection, text classification, or sound classification tasks.

By default, data splitting is disabled. After this function is enabled, set split ratios.

Enter a value ranging from 0 to 1 for the training set ratio. After the training set ratio is set, the validation set ratio is automatically filled. The sum of the training set ratio and the validation set ratio is 1.

The training set ratio is the ratio of sample data used for model training. The validation set ratio is the ratio of the sample data used for model validation. The training and validation ratios affect the performance of training templates.