当前位置:网站首页>[basic knowledge of deep learning - 38] the difference between L1 regularization and L2 regularization
[basic knowledge of deep learning - 38] the difference between L1 regularization and L2 regularization
2022-07-27 19:47:00 【Yanyu up】
Concept
- L1 Regularization means that the regularization term added to the loss function is the model parameter L1 norm .
- L2 Regularization means that the regularization term added to the loss function is the model parameter L2 norm .
Purpose
- By adding a weight regularization term to the loss function , It can prevent the model from becoming too complex or the parameters of the model from being too large , So as to improve the generalization performance of the model .
L1 Regularization and L2 Difference of regularization
- add L1 Norm is easy to get the coefficient solution of the characteristic , That is, many parameters are 0, add L2 The characteristic solution obtained by norm is relatively smooth , But it can also ensure that the solution is close to 0 More of them . From the perspective of reciprocal ,L1 The norm has a mutation at zero , That is, the minimum point , Therefore, it is easy to optimize to this extreme point .
Bloggers will continue to update some basic knowledge related to in-depth learning, as well as problems and insights encountered in work , Please pay attention if you like 、 give the thumbs-up 、 Collection .
边栏推荐
- Fzu1669 right angled triangle
- 发布自己的npm组件库
- Release Samsung 3J1 sensor: the code implies that the safety of pixel 7 face recognition will be greatly increased
- SumMenuDemo(子菜单)
- Map和Set
- c语言:6、指针的简单使用与注意事项
- SQLServer 2008中事务日志已满问题处理
- Can set be used to define local variables in OPDS SQL
- Incluxdb series (III) detailed explanation of incluxdb configuration file
- 【深度学习基础知识 - 49】Kmeans
猜你喜欢

GridView(实现表格显示图标)

c语言:12、gdb工具调试c程序

GestureDetector(手势识别)

SystemService(系统服务)

C language: 11. Pipeline

Golang sets the domestic image, vscode configures the golang development environment, and vscode debugs the golang code

Flink简介以及运行架构
![[basic knowledge of deep learning - 46] Bayesian theorem and conditional probability formula](/img/9f/b9d7503404e068495fd8613df29366.png)
[basic knowledge of deep learning - 46] Bayesian theorem and conditional probability formula

BroadcastReceiver(广播)

C language: 10. Input stream, output stream, error stream
随机推荐
开启和禁用hyper-v
Intel releases horse ridge chip: 22nm process, which can control multiple qubits
Flink 算子简介
c语言:6、指针的简单使用与注意事项
【深度学习基础知识 - 47】贝叶斯网络与朴素贝叶斯
细数国产接口协作平台的六把武器!
一种比读写锁更快的锁,还不赶紧认识一下
爱立信承认在中国等五国行贿,向美支付10.6亿美元罚款
c语言:15、结构体
c语言:13、指针与内存
OPPO发布首款AR眼镜,宣布未来3年投入500亿进行研发
The valuation exceeds 15.6 billion yuan! Huaqin communication completed the round B financing of 1billion yuan! Qualcomm venture capital, Intel Capital led investment
下放三星3J1传感器:代码暗示Pixel 7人脸识别安全性将大增
【深度学习基础知识 - 38】L1正则化和L2正则化的区别
Samsung will promote a number of risc-v architecture chips, and 5g millimeter wave RF chips will be the first to be adopted
零知识证明的硬件加速
Low code implementation exploration (45) business parameters
[basic knowledge of deep learning - 47] Bayesian networks and naive Bayes
Intel launched the world's smallest high-resolution lidar, priced at only $349
Intel's process roadmap for the next 10 years is exposed: 1.4nm process will be launched in 2029! How?