当前位置:网站首页>Machine learning and deep learning -- normalization processing
Machine learning and deep learning -- normalization processing
2022-07-28 04:36:00 【A large piece of meat floss】
One 、 Normalization in machine learning
1、 Normalization
(1) After normalization, the speed of gradient descent to find the optimal solution is accelerated
(2) Normalization has the potential to improve accuracy
Detailed explanation :
(1) After normalization, the speed of gradient descent to find the optimal solution is accelerated

\qquad a. The left figure is the contour line without normalization , It can be seen from the left figure that there is a big difference between the two feature change ranges , The contour line formed is very sharp , When the gradient descent method is used to find the optimal solution , It's very likely to go “ And ” Font route , So it needs many iterations to converge ;
\qquad b. Right picture , Normalize the data , At this time, the contour lines of the two features appear smooth , In the process of gradient descent, it can converge faster ;
\qquad Therefore, if the machine learning model uses gradient descent to find the optimal solution , Normalization is very necessary , Otherwise, it will take more iterations to converge or even fail to converge .
(2) Normalization may improve accuracy
Some classifiers need to calculate the distance between samples ( For example, European distance ), for example KNN. If a range of eigenvalues is very large , Then the calculation of distance depends on this feature , But it may not conform to the actual situation , For example, the characteristics of small cure range may be more important in the actual situation .
2、 Normalization method
(1) Linear normalization
(2) Normalization of standard deviation : Characteristics minus mean divided by variance
(3) Nonlinear normalization
Two 、 In depth learning BN layer
1.BN layer
\qquad BN,Batch Normalization( Normalized layer )
\qquad BN Problems solved by layer : In deep neural networks , Each layer will operate on the data , Even if the data is normalized initially , But with the deepening of neural network layers , The distribution of data is likely to change dramatically , At this time, it brings difficulty to the training of network model .
\qquad At this point, you need to join BN layer , Make the data keep the same distribution in the process of deep neural network training .
\qquad In short :BN Layer is to deal with data in a normative way , Make the distribution of data input values of each layer smooth , For example, the average value is 0, The variance of 1 Is a normal distribution .
2.BN Layer action
(1) Speed up the training and convergence of the network
(2) Prevent gradient disappearance and gradient explosion
(3) Prevent over fitting
边栏推荐
- Tiantian AMADA CNC bending machine touch screen maintenance rgm21003 host circuit board maintenance
- 01 node express system framework construction (express generator)
- 【sylar】框架篇-Chapter6-协程调度模块
- Cloud native Devops status survey questionnaire solicitation: kodelurover launched jointly with oschina
- Space complexity calculation super full sorting!! (calculation of hand tearing complexity
- Pyqt based grouping tool
- Reading of seq2path: generating sentimental tuples as paths of a tree
- Select sorting method
- Simple summary of Modbus Protocol
- 031 log4j log framework
猜你喜欢

空间复杂度计算超全整理!!(一起手撕复杂度计算

Information system project manager (2022) - key content: Information System Security Management (20)

Reading of seq2path: generating sentimental tuples as paths of a tree

C语言初阶——循环语句(while,for,do while)

Jupyter notebook installation code prompt function

Destructor of member function

Glusterfs file is not mounted, permission: R-S

Password key hard coding check

High number_ Chapter 4__ curvilinear integral

There are so many ways to view the web source code! Do you know?
随机推荐
Information system project manager (2022) - key content: intellectual property rights and standards and specifications (22)
There are so many ways to view the web source code! Do you know?
Harmony's Application on the shelves reported an error. The solution of "please use the API of the released version to develop the application and apply for listing"
DNS series (III): how to avoid DNS spoofing
01 node express system framework construction (express generator)
Constructor of member function
Introduction to this pointer
Reading the paper "learning span level interactions for aspect sentimental triple extraction"
[kinematics] simulation of orbital angular momentum based on MATLAB [including Matlab source code 1971]
Transformer landing | next vit realizes the real-time landing of industrial tensorrt, surpassing RESNET and cswin
【Oracle】083错题集
Odoo action analysis (action.client, action.act_window, action.server)
[Sylar] framework Chapter 8 timer module
[Sylar] practical part - redis based parameter query service
The unsatisfied analysis of setup and hold timing is the solution
[Sylar] framework -chapter14 tcpserver module
[Sylar] framework -chapter11 socket module
重要的 SQL Server 函数 - 日期函数
[mathematical modeling] Based on MATLAB seismic exploration Marmousi model [including Matlab source code, 1977]
登录之后右上角改变 进入登录状态