当前位置:网站首页>[basic knowledge of deep learning - 39] comparison of BN, LN and WN
[basic knowledge of deep learning - 39] comparison of BN, LN and WN
2022-07-27 19:47:00 【Yanyu up】
BN
- batch normalization, Make it normalize the distribution of data into the standard normal distribution before the activation function receives the input , Make the input value of the activation function fall in the input sensitive area , That is, areas with large gradients . So that the gradient does not disappear 、 Reduce training time .
- BN More suitable batch Larger scenes , And the data distribution should be relatively close .
WN
- weigit normalization, The main thing is to regularize the weight of the network , Make the network not too complicated , Don't weigh too much , Can prevent over fitting .
LN
- layer normalization, It is relative to the BN, Not dependent on the whole batch, Instead, it normalizes the input of a certain layer , More suitable for small batch、RNN、MLP Scene .
Bloggers will continue to update some basic knowledge related to in-depth learning, as well as problems and insights encountered in work , Please pay attention if you like 、 give the thumbs-up 、 Collection .
边栏推荐
- 英特尔推出全球最小的高分辨率激光雷达,售价仅349美元
- BroadcastReceiver(广播)
- Tab control of MFC advanced control (CTabCtrl)
- RadioGroup(单选框)
- c语言:12、gdb工具调试c程序
- Time complexity and space complexity
- SQLServer 2008中事务日志已满问题处理
- Embedded C language pointer alias
- c语言:9、main函数中的return
- Under the heat wave of Web3.0, the ecological shock of Mensa struck
猜你喜欢

AutoCompleteTextView(输入框预匹配)

c语言:14、预处理

GestureOverlayView(手势识别2)

一种比读写锁更快的锁,还不赶紧认识一下
![[basic knowledge of deep learning - 45] distance calculation methods commonly used in machine learning](/img/6c/b0c2ea667ac361c13d38c8f5e6e5f1.png)
[basic knowledge of deep learning - 45] distance calculation methods commonly used in machine learning

Come to sword finger offer 03. Repeated numbers in the array

FileOutputStream(文件储存)与FileInputStream(文件读取)

【深度学习基础知识 - 37】解决正负样本不均衡 Focal Loss

C language: clion debugging method

C language: 14. Preprocessing
随机推荐
Embedded C language loop deployment
To create a MySQL data source resource group, you must choose to create a new exclusive data integration resource group? Or use a common resource group? thank you
The valuation exceeds 15.6 billion yuan! Huaqin communication completed the round B financing of 1billion yuan! Qualcomm venture capital, Intel Capital led investment
[basic knowledge of deep learning - 50] PCA dimensionality reduction principal component analysis
C language: 12. GDB tool debugging C program
GestureDetector(手势识别)
c语言:15、结构体
时间复杂度和空间复杂度
【日常积累 - 07】cuda多版本切换
It is said that Apple plans to buy some JDI factories with us $200million
Flink简介以及运行架构
c语言:clion调试方法
Release Samsung 3J1 sensor: the code implies that the safety of pixel 7 face recognition will be greatly increased
传苹果计划以2亿美元购买JDI部分工厂
[basic knowledge of deep learning - 45] distance calculation methods commonly used in machine learning
【深度学习基础知识 - 46】贝叶斯定理与条件概率公式
go-zero单体服务使用泛型简化注册Handler路由
AutoCompleteTextView(输入框预匹配)
Count the six weapons of the domestic interface cooperation platform!
【华为云Stack】【大架光临】第13期:管理区解耦架构见过吗?帮政企客户搞定大难题