当前位置:网站首页>[basic knowledge of deep learning - 39] comparison of BN, LN and WN
[basic knowledge of deep learning - 39] comparison of BN, LN and WN
2022-07-27 19:47:00 【Yanyu up】
BN
- batch normalization, Make it normalize the distribution of data into the standard normal distribution before the activation function receives the input , Make the input value of the activation function fall in the input sensitive area , That is, areas with large gradients . So that the gradient does not disappear 、 Reduce training time .
- BN More suitable batch Larger scenes , And the data distribution should be relatively close .
WN
- weigit normalization, The main thing is to regularize the weight of the network , Make the network not too complicated , Don't weigh too much , Can prevent over fitting .
LN
- layer normalization, It is relative to the BN, Not dependent on the whole batch, Instead, it normalizes the input of a certain layer , More suitable for small batch、RNN、MLP Scene .
Bloggers will continue to update some basic knowledge related to in-depth learning, as well as problems and insights encountered in work , Please pay attention if you like 、 give the thumbs-up 、 Collection .
边栏推荐
- Introduction to Flink operator
- Binary search tree
- C language: 14. Preprocessing
- IIS 发生未知FastCGI错误:0x80070005
- 首发骁龙765G!Redmi K30 5G版发布:支持5G双模120Hz屏,定价1999元起
- redis底层数据结构详解
- Oppo released the first AR glasses and announced that it would invest 50billion in research and development in the next three years
- c语言:8、makeFile编写
- Transaction log full problem handling in sqlserver 2008
- ContextMenu(上下文菜单)
猜你喜欢

嵌入式C语言结构体

MFC高级控件之Tab控件( CTabCtrl )
![[basic knowledge of deep learning - 45] distance calculation methods commonly used in machine learning](/img/6c/b0c2ea667ac361c13d38c8f5e6e5f1.png)
[basic knowledge of deep learning - 45] distance calculation methods commonly used in machine learning

C language: 15. Structure

S32k series chips -- Introduction

c语言:10、输入流,输出流,错误流

c语言:7、c语言多源码文件使用方法

c语言:14、预处理

Under the heat wave of Web3.0, the ecological shock of Mensa struck

c语言:9、main函数中的return
随机推荐
C language: 11. Pipeline
To create a MySQL data source resource group, you must choose to create a new exclusive data integration resource group? Or use a common resource group? thank you
HDU1323_ Perfection [water question]
反超华为?爱立信已拿下超过75份5G商用合同
嵌入式C语言指针别名
C language: 12. GDB tool debugging C program
嵌入式C语言对次数固定的循环的优化
Map and set
Intel releases horse ridge chip: 22nm process, which can control multiple qubits
GestureOverlayView(手势识别2)
Embedded C language structure
Time complexity and space complexity
The first in the field of mobile phone chip design in the world! Ziguang zhanrui won the international certification of tmmi4
rxbinding
c语言:clion调试方法
c语言:8、makeFile编写
【深度学习基础知识 - 46】贝叶斯定理与条件概率公式
DatePicker(日期选择器)与TimePicker(时间选择器)
High cost, difficult to implement, slow to take effect, what about open source security?
BroadcastReceiver(广播)