当前位置：网站首页>Deep Learning Theory - Overfitting, Underfitting, Regularization, Optimizers

Deep Learning Theory - Overfitting, Underfitting, Regularization, Optimizers

2022-08-04 06:19:00 【Learning Adventures】

Data augmentation: 1. Do not overdo it, otherwise it will only increase the training time and will not increase the generalization ability; 2.Add extraneous data

L2 regularity: tend to respond to the common characteristics of training set samples; make the model prefer samples with small parameters to reduce the risk of overfitting

Several common optimizers

For sparse data, try to choose an optimization method with an adaptive learning rate. It does not need to be adjusted manually. It is better to use the default value.

Stochastic gradient descent usually takes longer to train and is prone to saddle points, but results are more reliable with good initialization and learning rate scheduling.

Overall, Adam is by far the best choice.

原网站

版权声明
本文为[Learning Adventures]所创，转载请带上原文链接，感谢
https://yzsam.com/2022/216/202208040525571131.html

当前位置：网站首页>Deep Learning Theory - Overfitting, Underfitting, Regularization, Optimizers

Deep Learning Theory - Overfitting, Underfitting, Regularization, Optimizers

边栏推荐

猜你喜欢

随机推荐