当前位置:网站首页>Machine learning and deep learning -- normalization processing
Machine learning and deep learning -- normalization processing
2022-07-28 04:36:00 【A large piece of meat floss】
One 、 Normalization in machine learning
1、 Normalization
(1) After normalization, the speed of gradient descent to find the optimal solution is accelerated
(2) Normalization has the potential to improve accuracy
Detailed explanation :
(1) After normalization, the speed of gradient descent to find the optimal solution is accelerated

\qquad a. The left figure is the contour line without normalization , It can be seen from the left figure that there is a big difference between the two feature change ranges , The contour line formed is very sharp , When the gradient descent method is used to find the optimal solution , It's very likely to go “ And ” Font route , So it needs many iterations to converge ;
\qquad b. Right picture , Normalize the data , At this time, the contour lines of the two features appear smooth , In the process of gradient descent, it can converge faster ;
\qquad Therefore, if the machine learning model uses gradient descent to find the optimal solution , Normalization is very necessary , Otherwise, it will take more iterations to converge or even fail to converge .
(2) Normalization may improve accuracy
Some classifiers need to calculate the distance between samples ( For example, European distance ), for example KNN. If a range of eigenvalues is very large , Then the calculation of distance depends on this feature , But it may not conform to the actual situation , For example, the characteristics of small cure range may be more important in the actual situation .
2、 Normalization method
(1) Linear normalization
(2) Normalization of standard deviation : Characteristics minus mean divided by variance
(3) Nonlinear normalization
Two 、 In depth learning BN layer
1.BN layer
\qquad BN,Batch Normalization( Normalized layer )
\qquad BN Problems solved by layer : In deep neural networks , Each layer will operate on the data , Even if the data is normalized initially , But with the deepening of neural network layers , The distribution of data is likely to change dramatically , At this time, it brings difficulty to the training of network model .
\qquad At this point, you need to join BN layer , Make the data keep the same distribution in the process of deep neural network training .
\qquad In short :BN Layer is to deal with data in a normative way , Make the distribution of data input values of each layer smooth , For example, the average value is 0, The variance of 1 Is a normal distribution .
2.BN Layer action
(1) Speed up the training and convergence of the network
(2) Prevent gradient disappearance and gradient explosion
(3) Prevent over fitting
边栏推荐
- CMake使用基础汇总
- RN interface jump description
- 【YOLOv5实战5】基于YOLOv5的交通标志识别系统-YOLOv5整合PyQt5
- Reading of seq2path: generating sentimental tuples as paths of a tree
- 【sylar】框架篇-Chapter9-hook 模块
- Information system project manager (2022) - key content: Project Risk Management (11)
- 【sylar】框架篇-Chapter6-协程调度模块
- 【sylar】框架篇-Chapter23-模块篇总结
- Information system project manager (2022) - key content: Information System Security Management (20)
- Elementary level of C language -- while, for, do while
猜你喜欢

How to upgrade a pair of 12.2 RAC(primary) and a pair of 12.2 RAC(dataguard) to 19c

VAE generation model (with VAE implementation MNIST code)

高数_第4章__曲线积分_习题解法

Render the data obtained from the database to the table in elementui
![[mathematical modeling] Based on MATLAB seismic exploration Marmousi model [including Matlab source code, 1977]](/img/fd/6b261670c12e4d89c27364bcdf2a02.jpg)
[mathematical modeling] Based on MATLAB seismic exploration Marmousi model [including Matlab source code, 1977]

RN interface jump description
![[kinematics] simulation of orbital angular momentum based on MATLAB [including Matlab source code 1971]](/img/5e/dfe029490183ee74687606941ce98e.jpg)
[kinematics] simulation of orbital angular momentum based on MATLAB [including Matlab source code 1971]

Fedformer MOE module

Jupyter Notebook安装代码提示功能

Some personal understandings of openpose
随机推荐
Reading the paper "learning span level interactions for aspect sentimental triple extraction"
Information system project manager (2022) - key content: Strategic Management (17)
Shanghai Telecom released public computing services and signed the action plan of "Joint Innovation Center for intelligent computing applications" with Huawei and other partners
[Sylar] framework chapter -chapter10-address module
could only be written to 0 of the 1 minReplication nodes. There are 0 datanode(s) running and 0 node
How to upgrade a pair of 12.2 RAC(primary) and a pair of 12.2 RAC(dataguard) to 19c
25 openwrt guest network add
CMake使用基础汇总
031 log4j log framework
Use Baidu developer tool 4.0 to build a dedicated applet IDE
[Niuke] find 1+2+3+... +n
Sort - cardinal sort
【sylar】框架篇-Chapter14-TcpServer 模块
2022-7-27 顾宇佳 学习笔记
Harmony's Application on the shelves reported an error. The solution of "please use the API of the released version to develop the application and apply for listing"
After login, the upper right corner changes to enter the login status
Information system project manager (2022) - key content: Project Risk Management (11)
mysql分区表改造
What to do when encountering slow SQL? (next)
【sylar】框架篇-Chapter21-环境变量模块