Help Center> ModelArts> User Guide (Senior AI Engineers)> Model Management> Model Evaluation and Diagnosis> Model Optimization Suggestions> Data Augmentation Based on Model Evaluation

Data Augmentation Based on Model Evaluation

Scenario

Datasets are critical for model training. The ModelArts platform requires that each class to be trained must contain at least five images. However, datasets usually need to be expanded in industrial-grade scenarios. This section describes a method for expanding a dataset with a label.

Principles

Dataset details

This is a classification problem. The model must identify the defects on the surface of industrial parts and determine whether they are defective parts. The following figure shows an example.

Figure 1 Defects on the surface of industrial parts
Click to enlarge

This figure shows the surfaces of two PV modules. The left one is normal, and the right one is abnormal. A model is required to distinguish the two classes of images to help locate the abnormal PV modules. There are 754 normal samples and 358 abnormal samples. The validation set has 754 normal samples and 357 abnormal samples. There are about 2000 samples in total, which are far less than the samples required to achieve accuracy of 95% or higher. Use PyTorch to load the ResNet50 model for training based on this dataset. The overall precision is about 86.06%. The recall of normal classes is 97.3%, and that of abnormal classes is 62.9%. The result does not meet the customer's expectation.

Few-Shot Learning (FSL) is a common method of feeding a learning model with a very small amount of training data. It focuses on data and model training, that is, features extracted from images. In this dataset, the images are 300 x 300 grayscale images and are the front view images of the PV module surfaces. They are efficiently pre-processed images. Flipping this type of images has little impact on the overall image structure. Therefore, flip these images to increase data diversity. The following figure shows the flipping result.

Figure 2 Flipping result
Click to enlarge

In this way, the number of images in the dataset increases from 1100 to 2200. There is no other way available for expanding the dataset. In this case, you need to use the ModelArts model evaluation function to evaluate the data generalization capability of the model. Call the model evaluation API, that is, the analyse API under deep_moxing.model_analysis.

def validate(val_loader, model, criterion, args):
batch_time = AverageMeter('Time', ':6.3f')
losses = AverageMeter('Loss', ':.4e')
top1 = AverageMeter('Acc@1', ':6.2f')
top5 = AverageMeter('Acc@5', ':6.2f')
progress = ProgressMeter(
len(val_loader),
[batch_time, losses, top1, top5],
prefix='Test: ')
pred_list = []
target_list = []
# switch to evaluate mode
model.eval()
with torch.no_grad():
end = time.time()
for i, (images, target) in enumerate(val_loader):
if args.gpu is not None:
images = images.cuda(args.gpu, non_blocking=True)
target = target.cuda(args.gpu, non_blocking=True)

# compute output
output = model(images)
loss = criterion(output, target)
# Obtain the model inference result pred and the actual target target.
pred_list += output.cpu().numpy()[:, :2].tolist()
target_list += target.cpu().numpy().tolist()
# measure accuracy and record loss
acc1, acc5 = accuracy(output, target, topk=(1, 5), i=i)
losses.update(loss.item(), images.size(0))
top1.update(acc1[0], images.size(0))
top5.update(acc5[0], images.size(0))

# measure elapsed time
batch_time.update(time.time() - end)
end = time.time()

if i % args.print_freq == 0:
progress.display(i)
# TODO: this should also be done with the ProgressMeter
print(' * Acc@1 {top1.avg:.3f} Acc@5 {top5.avg:.3f}'
.format(top1=top1, top5=top5))
# Obtain the image storage path name.
name_list = val_loader.dataset.samples
for idx in range(len(name_list)):
name_list[idx] = name_list[idx][0]
analyse(task_type='image_classification', save_path='/home/image_labeled/',
pred_list=pred_list, label_list=target_list, name_list=name_list)
return top1.avg

Most of the preceding code is the validation code in the PyTorch training ImageNet. Obtain three lists: pred (model reference results), target (actual image classes), and name (image storage paths). Call the analyse API according to the preceding code. A JSON file is generated in the save_path directory and saved to the ModelArts training output directory. Then, you can view the model analysis result in the evaluation result. You can view the result visually after uploading the JSON file generated offline to the online directory. Table 1 and Table 2 list the sensitivity analysis results.

**Table 1** Analysis on the sensitivity of a model to image brightness
Feature Distribution	0	1
0% - 20%	0.7273	0.8864
20% - 40%	0.8446	0.6892
40% - 60%	0.9077	0.4615
60% - 80%	0.9496	0.5116
80% - 100%	0.966	0.5625
Standard deviation	0.0864	0.1516

**Table 2** Analysis on the sensitivity of a model to image brightness
Feature Distribution	0	1
0% - 20%	0.7556	0.8333
20% - 40%	0.8489	0.6466
40% - 60%	0.9239	0.6316
60% - 80%	0.9492	0.8
80% - 100%	0.9631	0.5946
Standard deviation	0.0771	0.0963

The preceding two tables indicate the test precision of images with different feature value ranges. For example, the test precision of class 0 when the brightness range is 0%-20% is much lower than that when the brightness is in other ranges. This model is used to detect images of class 1. Images of class 1 are sensitive to both brightness and clarity. That is, the model cannot properly process the images whose brightness and clarity change.

ModelArts provides the data augmentation function to directly expand datasets.

After data augmentation, the dataset contains 2,210 normal images and 1,174 abnormal images. Import the dataset into PyTorch for training. The result is as follows:

**Table 3** Training results after data augmentation
Item	accuracy	Recall of Normal Class	Recall of Abnormal Class
Originals	86.06%	97.3%	62.9%
Images expanded from 1,100 to 2,940	86.31%	97.6%	62.5%

The preceding results show that the precision does not improve significantly. Analyze the dataset again. The datasets for industry scenarios often have uneven samples. Although the ratio is close to 2:1 in this dataset, the detection requirements are high for the abnormal class. Therefore, the model should be oriented to the abnormal class. The results of class 1 indicate that the normal and abnormal samples are unbalanced. Therefore, the following two augmentation methods are used only for the abnormal class, that is, the abnormal images. Finally, about 3000 images (1508 normal images and 1432 abnormal images) are obtained. In this way, the samples are balanced. Import the model to ResNet50 for training. The following table lists the precision.

**Table 4** Results after data modification
Item	accuracy	Recall of Normal Class	Recall of Abnormal Class
Originals	86.06%	97.3%	62.9%
Images expanded from 1,100 to 2,940	89.13%	97.2%	71.3%

Summary

In the validation set, Accuracy 1 increases by nearly 3% and the recall increases by 8.4% when there are 754 normal samples and 357 abnormal samples. Therefore, direct dataset expansion is effective. Model evaluation is helpful for you to choose an effective augmentation method. It is important to eliminate the problems of the original dataset, for example, unbalanced samples.

Based on the results of this test, we modify some training policies and use a more powerful network. Our model training results meet the customer's requirements.

Dataset reference:

Buerhop-Lutz, C.; Deitsch, S.; Maier, A.; Gallwitz, F.; Berger, S.; Doll, B.; Hauch, J.; Camus, C. & Brabec, C. J. A Benchmark for Visual Identification of Defective Solar Cells in Electroluminescence Imagery. European PV Solar Energy Conference and Exhibition (EU PVSEC), 2018. DOI: 10.4229/35thEUPVSEC20182018-5CV.3.15

Deitsch, S.; Buerhop-Lutz, C.; Maier, A. K.; Gallwitz, F. & Riess, C. Segmentation of Photovoltaic Module Cells in Electroluminescence Images. CoRR, 2018, abs/1806.06530

Deitsch, S.; Christlein, V.; Berger, S.; Buerhop-Lutz, C.; Maier, A.; Gallwitz, F. & Riess, C. Automatic classification of defective photovoltaic module cells in electroluminescence images. Solar Energy, Elsevier BV, 2019, 185, 455-468. DOI: 10.1016/j.solener.2019.02.067

Parent topic: Model Optimization Suggestions

Last Article: Common Methods of Optimizing Model Precision in Model Optimization

Next Article: Model Attack Based on Sample Preprocessing

Did this article solve your problem?

Thank you for your score！Your feedback would help us improve the website.

Products

Compute

Application

Dedicated Cloud

Storage

Management & Deployment

Migration

Network

Enterprise Intelligence

Video

Database

Edge Cloud Services

DevCloud

Security

Cloud Communications

Internet of Things

Solutions

Industry-Specific Solutions

General-Purpose Solutions

Security

DevOps

Enterprise Intelligence

Essential Platform

Big Data

Visual Cognition

Speech and Semantics

Support

Help Center

Customer Services

Developers

Console

语言 - Language

中国站 - 简体中文

中国站 - English

International - 简体中文

International - English

Help Center

Data Augmentation Based on Model Evaluation

Scenario

Principles

Summary