当前位置:网站首页>Deep Learning Theory - Overfitting, Underfitting, Regularization, Optimizers
Deep Learning Theory - Overfitting, Underfitting, Regularization, Optimizers
2022-08-04 06:19:00 【Learning Adventures】
Data augmentation: 1. Do not overdo it, otherwise it will only increase the training time and will not increase the generalization ability; 2.Add extraneous data
L2 regularity: tend to respond to the common characteristics of training set samples; make the model prefer samples with small parameters to reduce the risk of overfitting
Several common optimizers
For sparse data, try to choose an optimization method with an adaptive learning rate. It does not need to be adjusted manually. It is better to use the default value.
Stochastic gradient descent usually takes longer to train and is prone to saddle points, but results are more reliable with good initialization and learning rate scheduling.
Overall, Adam is by far the best choice.
边栏推荐
猜你喜欢
深度学习理论——过拟合、欠拟合、正则化、优化器
AWS使用EC2降低DeepRacer的训练成本:DeepRacer-for-cloud的实践操作
Logistic Regression --- Introduction, API Introduction, Case: Cancer Classification Prediction, Classification Evaluation, and ROC Curve and AUC Metrics
亚马逊云科技 Build On 2022 - AIot 第二季物联网专场实验心得
迅雷关闭自动更新
Pytorch问题总结
【CV-Learning】Convolutional Neural Network
MNIST手写数字识别 —— 图像分析法实现二分类
浅谈外挂常识和如何防御
深度确定性策略梯度(DDPG)
随机推荐
Halcon缺陷检测
The difference between oracle temporary table and pg temporary table
Android connects to mysql database using okhttp
【CV-Learning】线性分类器(SVM基础)
详解近端策略优化
target has libraries with conflicting names: libcrypto.a and libssl.a.
机器学习——分类问题对于文字标签的处理(特征工程)
(Navigation page) OpenStack-M version - manual construction of two nodes - with video from station B
动手学深度学习_多层感知机
深度学习理论 —— 初始化、参数调节
postgres recursive query
TensorFlow2 study notes: 7. Optimizer
ConnectionRefusedError: [Errno 111] Connection refused问题解决
[CV-Learning] Semantic Segmentation
fill_between in Matplotlib; np.argsort() function
Usage of Thread, Handler and IntentService
(导航页)OpenStack-M版-双节点手工搭建-附B站视频
yolov3 data reading (2)
双向LSTM
基于BiGRU和GAN的数据生成方法