当前位置:网站首页>Deep Learning Theory - Overfitting, Underfitting, Regularization, Optimizers
Deep Learning Theory - Overfitting, Underfitting, Regularization, Optimizers
2022-08-04 06:19:00 【Learning Adventures】






Data augmentation: 1. Do not overdo it, otherwise it will only increase the training time and will not increase the generalization ability; 2.Add extraneous data



L2 regularity: tend to respond to the common characteristics of training set samples; make the model prefer samples with small parameters to reduce the risk of overfitting



Several common optimizers


For sparse data, try to choose an optimization method with an adaptive learning rate. It does not need to be adjusted manually. It is better to use the default value.
Stochastic gradient descent usually takes longer to train and is prone to saddle points, but results are more reliable with good initialization and learning rate scheduling.
Overall, Adam is by far the best choice.

边栏推荐
- 【CV-Learning】Convolutional Neural Network
- Linear Regression 02---Boston Housing Price Prediction
- DeblurGAN-v2: Deblurring (Orders-of-Magnitude) Faster and Better 图像去模糊
- yoloV5 使用——训练速度慢,加速训练
- 光条提取中的连通域筛除
- 【深度学习日记】第一天:Hello world,Hello CNN MNIST
- MNIST手写数字识别 —— Lenet-5首个商用级别卷积神经网络
- 图像resize
- 【CV-Learning】语义分割
- 中国联通、欧莱雅和钉钉都在争相打造的秘密武器?虚拟IP未来还有怎样的可能
猜你喜欢

Brief description of database and common operation guide

浅谈游戏音效测试点

【论文阅读】Multi-View Spectral Clustering with Optimal Neighborhood Laplacian Matrix

Lee‘s way of Deep Learning 深度学习笔记

Usage of RecyclerView

剪映专业版字幕导出随笔

典型CCN网络——efficientNet(2019-Google-已开源)

【论文阅读】Further Non-local and Channel Attention Networks for Vehicle Re-identification

TensorFlow2 study notes: 6. Overfitting and underfitting, and their mitigation solutions

CSDN大礼包--高校圆桌派大礼包
随机推荐
[CV-Learning] Convolutional Neural Network Preliminary Knowledge
CSDN大礼包--高校圆桌派大礼包
学习资料re-id
PyTorch
The use of the attribute of the use of the animation and ButterKnife
深度学习理论——过拟合、欠拟合、正则化、优化器
Deep Learning Theory - Initialization, Parameter Adjustment
[Introduction to go language] 12. Pointer
迅雷关闭自动更新
Usage of Thread, Handler and IntentService
Pytorch语义分割理解
Brief description of database and common operation guide
卷积神经网络入门详解
【论文阅读】SPANET: SPATIAL PYRAMID ATTENTION NETWORK FOR ENHANCED IMAGE RECOGNITION
Copy攻城狮5分钟在线体验 MindIR 格式模型生成
图像合并水平拼接
MNIST手写数字识别 —— ResNet-经典卷积神经网络
Comparison of oracle's number and postgresql's numeric
2020-10-29
TensorFlow2 study notes: 6. Overfitting and underfitting, and their mitigation solutions