当前位置:网站首页>【Cutout】《Improved Regularization of Convolutional Neural Networks with Cutout》
【Cutout】《Improved Regularization of Convolutional Neural Networks with Cutout》
2022-07-02 07:45:00 【bryant_ meng】
arXiv-2017
List of articles
1 Background and Motivation
With the development of deep learning technology ,CNN Emerging in many computer vision tasks , but increased representational power also comes increased probability of overfitting, leading to poor generalization.
To improve the generalization performance of the model , simulation object occlusion, The author puts forward Cutout Methods of data enhancement ——randomly masking out square regions of input during training,take more of the image context into consideration when making decisions.
This technique encourages the network to better utilize the full context of the image, rather than relying on the presence of a small set of specific visual features(which may not always be present).
2 Related Work
- Data Augmentation for Images
- Dropout in Convolutional Neural Networks
- Denoising Autoencoders & Context Encoders(self-supervised, Dig out some , Network supplement , To strengthen the characteristics )
3 Advantages / Contributions
It is proposed in supervised learning Cutout Data enhancement method (dropout A form of , There are similar methods in self-monitoring )
4 Method
Initial edition :remove maximally activated features
The final version : Random center point , The square blocks ( Can be outside the picture , After being intercepted by the image boundary, it is not square )
It needs to be centralized when using ( That is, subtract the mean )
the dataset should be normalized about zero so that modified images will not have a large effect on the expected batch statistics.
5 Experiments
5.1 Datasets and Metrics
- CIFAR-10(32x32)
- CIFAR-100(32x32)
- SVHN(Street View House Numbers,32x32)
- STL-10(96x96)
The evaluation index is top1 error
5.2 Experiments
1)CIFAR10 and CIFAR100
Individual experiments are repeated 5 Time ,±x
The following figure explores cutout Different from patch length Influence ,
2)STL-10
3)Analysis of Cutout’s Effect on Activations
introduce cutout After shallow activation, both are improved , Deep level in the tail end of the distribution.
The latter observation illustrates that cutout is indeed encouraging the network to take into account a wider variety of features when making predictions, rather than relying on the presence of a smaller number of features
Then focus on the single sample
6 Conclusion(own) / Future work
code:https://github.com/uoguelph-mlrg/Cutout
memory footprint Memory footprint
Related work introduction drop out when , This sentence appears in the article :All activations are kept when evaluating the network, but the resulting output is scaled according to the dropout probability
dropout It works on FC The effect is better than Conv Excellent , The author's explanation is :1)convolutional layers already have much fewer parameters than fully-connected layers; 2)neighbouring pixels in images share much of the same information( It's harmless to lose some )
cutout—— The continuous area only acts on the input layer dropout technology
Dropout Technology in : Visual interpretation and in DNN/CNN/RNN Application in
边栏推荐
- Record of problems in the construction process of IOD and detectron2
- MMDetection模型微调
- 【DIoU】《Distance-IoU Loss:Faster and Better Learning for Bounding Box Regression》
- CONDA common commands
- Open failed: enoent (no such file or directory) / (operation not permitted)
- [in depth learning series (8)]: principles of transform and actual combat
- Proof and understanding of pointnet principle
- Thesis tips
- 【Wing Loss】《Wing Loss for Robust Facial Landmark Localisation with Convolutional Neural Networks》
- PHP returns the abbreviation of the month according to the numerical month
猜你喜欢
Point cloud data understanding (step 3 of pointnet Implementation)
ModuleNotFoundError: No module named ‘pytest‘
【TCDCN】《Facial landmark detection by deep multi-task learning》
【Programming】
How to turn on night mode on laptop
Tencent machine test questions
[introduction to information retrieval] Chapter 1 Boolean retrieval
【MagNet】《Progressive Semantic Segmentation》
Faster-ILOD、maskrcnn_ Benchmark trains its own VOC data set and problem summary
Mmdetection trains its own data set -- export coco format of cvat annotation file and related operations
随机推荐
Implementation of purchase, sales and inventory system with ssm+mysql
Faster-ILOD、maskrcnn_ Benchmark training coco data set and problem summary
iOD及Detectron2搭建过程问题记录
使用百度网盘上传数据到服务器上
【Cascade FPD】《Deep Convolutional Network Cascade for Facial Point Detection》
[introduction to information retrieval] Chapter 7 scoring calculation in search system
Thesis tips
Mmdetection model fine tuning
生成模型与判别模型的区别与理解
传统目标检测笔记1__ Viola Jones
[torch] the most concise logging User Guide
MoCO ——Momentum Contrast for Unsupervised Visual Representation Learning
【TCDCN】《Facial landmark detection by deep multi-task learning》
PHP returns the abbreviation of the month according to the numerical month
【多模态】CLIP模型
【Mixup】《Mixup:Beyond Empirical Risk Minimization》
parser. parse_ Args boolean type resolves false to true
Calculate the difference in days, months, and years between two dates in PHP
SSM laboratory equipment management
SSM personnel management system