当前位置：网站首页>【AutoAugment】《AutoAugment：Learning Augmentation Policies from Data》

【AutoAugment】《AutoAugment：Learning Augmentation Policies from Data》

2022-07-02 07:39:00 【bryant_ meng】

Insert picture description here

arXiv-2018

List of articles

1 Background and Motivation
2 Related Work
3 Advantages / Contributions
4 Method
5 Experiments
- 5.1 Datasets
- 5.2 Experiments and Results
6 Conclusion（own） / Future work

1 Background and Motivation

CV Most communities focus on designing better network structures ,Less attention has been paid to finding better data augmentation methods that incorporate more invariances（teach a model about invariances in the data domain）

The author aims to automate（Reinforcement Learning） the process of finding an effective data augmentation policy for a target dataset

2 Related Work

A little

3 Advantages / Contributions

automate Data augmentation , We have achieved SOTA, And can generalize well across different models and datasets（ Data distribution still needs to have a certain correlation , The augmentation strategy proposed earlier can shield the model , Across datasets ）

Despite the observed transferability, we find that policies learned on data distributions closest to the target yield the best performance

4 Method

Reinforcement learning （Proximal Policy Optimization algorithm） Search data expansion strategy ,

The search space is

https://pillow.readthedocs.io/en/stable/reference/ImageOps.html（PIL Realization ）
Insert picture description here
ShearX/Y, TranslateX/Y, Rotate, AutoContrast, Invert, Equalize, Solarize, Posterize, Contrast, Color, Brightness, Sharpness, Cutout, Sample Pairing

invert According to the given probability value, the pixel values of some or all channels are changed from v Set to 255-v
Equalize Histogram equalization

Solarize Within the specified threshold , Invert all pixels （ Above the threshold , be 255-v）.

from PIL import Image, ImageOps  
      
# creating a image1 object 
im1 = Image.open("1.jpg")  
im2 = ImageOps.solarize(im1, threshold = 130)  
im2.show()

Posterize： Retain Image The height of the pixel value of each channel bits position
```
from PIL import Image, ImageOps     
im1 = Image.open("1.jpg")  
im2 = ImageOps.posterize(im1, bits=2)  
im2.show()
```
128-64-32-16-8-4-2-1
bits=1, The maximum value of each channel of the image is 128
bits=2, The maximum value of each channel of the image is 128+64 = 192
By analogy bits 1~8 The maximum value of the corresponding image 128-192-224-240-248-252-254-255

Insert picture description here

16 Kind of data augmentation Method （ Different probabilities probability——11 individual values uniform spacing, Different parameter configurations magnitude——10 Level uniform spacing）, Some augmentation methods do not magnitude,eg：invert

Every sub-policies Two data augmentation methods （16 choose 2） Serial combination ——each sub-policy consisting of two image operations to be applied in sequence

A total of 5 Kind of sub-policies, The search space is roughly $16 \times 11 \times 10)^2)^5 = (16 \times 11 \times 10)^{10} = 2.9 \times 10^{32}$

Augmented form

During training ,a sub-policie is randomly chosen（5 choose 1） for each image in each mini-batch
Insert picture description here

Reward mechanism ,child model（a neural network trained as part of the search process） Of acc

5 Experiments

5.1 Datasets

CIFAR-10,5W,reduced CIFAR-10（which consists of 4,000 randomly chosen examples, to save time for training child models during the augmentation search process）
CIFAR-100
SVHN,reduced SVHN dataset of 1,000 examples sampled randomly from the core training set.
ImageNet,reduced ImageNet,with 120 classes (randomly chosen) and 6,000 samples
Stanford Cars
FGVC Aircraft

5.2 Experiments and Results

1）CIFAR-10 and CIFAR-100 Results
Insert picture description here

The more selected augmentation methods are Equalize, AutoContrast, Color, and Brightness, and ShearX/Y Less

Let's see the results

Insert picture description here

Let's look at the comparison with semi supervised methods

The author only used 4000 Zhang labeled samples, Semi supervised method use an additional 46,000 unlabeled samples in their training

The effect of the author is better

2）SVHN Results
Insert picture description here
Invert, Equalize, ShearX/Y, and Rotate More people were selected

the specific color of numbers is not as important as the relative color of the number and its background.

Insert picture description here
AutoAugment leads to more significant improvements on the reduced dataset than the full dataset（ Ha ha ha , Data itself is king , Data expansion is also to increase the diversity of data ）