当前位置:网站首页>Deep Learning Theory - Overfitting, Underfitting, Regularization, Optimizers
Deep Learning Theory - Overfitting, Underfitting, Regularization, Optimizers
2022-08-04 06:19:00 【Learning Adventures】
Data augmentation: 1. Do not overdo it, otherwise it will only increase the training time and will not increase the generalization ability; 2.Add extraneous data
L2 regularity: tend to respond to the common characteristics of training set samples; make the model prefer samples with small parameters to reduce the risk of overfitting
Several common optimizers
For sparse data, try to choose an optimization method with an adaptive learning rate. It does not need to be adjusted manually. It is better to use the default value.
Stochastic gradient descent usually takes longer to train and is prone to saddle points, but results are more reliable with good initialization and learning rate scheduling.
Overall, Adam is by far the best choice.
边栏推荐
- Vision Transformer 论文 + 详解( ViT )
- [CV-Learning] Semantic Segmentation
- Deep Adversarial Decomposition: A Unified Framework for Separating Superimposed Images
- 【CV-Learning】Object Detection & Instance Segmentation
- Brief description of database and common operation guide
- TensorFlow2 study notes: 6. Overfitting and underfitting, and their mitigation solutions
- 亚马逊云科技Build On-Amazon Neptune基于知识图谱的推荐模型构建心得
- [Introduction to go language] 12. Pointer
- 机器学习——分类问题对于文字标签的处理(特征工程)
- MNIST手写数字识别 —— 从感知机到卷积神经网络
猜你喜欢
【CV-Learning】卷积神经网络
【CV-Learning】图像分类
剪映专业版字幕导出随笔
"A minute" Copy siege lion log 】 【 run MindSpore LeNet model
亚马逊云科技Build On-Amazon Neptune基于知识图谱的推荐模型构建心得
CSDN大礼包--高校圆桌派大礼包
深度学习理论——过拟合、欠拟合、正则化、优化器
[Deep Learning 21 Days Learning Challenge] 2. Complex sample classification and recognition - convolutional neural network (CNN) clothing image classification
PP-LiteSeg
[Introduction to go language] 12. Pointer
随机推荐
深度确定性策略梯度(DDPG)
Androd Day02
[Introduction to go language] 12. Pointer
动手学深度学习_softmax回归
Usage of RecyclerView
Qt日常学习
Dictionary feature extraction, text feature extraction.
Pytorch问题总结
yoloV5 使用——训练速度慢,加速训练
[CV-Learning] Linear Classifier (SVM Basics)
Android foundation [Super detailed android storage method analysis (SharedPreferences, SQLite database storage)]
图像resize
【Copy攻城狮日志】“一分钟”跑通MindSpore的LeNet模型
SQL注入详解
(Navigation page) OpenStack-M version - manual construction of two nodes - with video from station B
图像形变(插值方法)
Linear Regression 02---Boston Housing Price Prediction
【CV-Learning】语义分割
【CV-Learning】Image Classification
空洞卷积