当前位置:网站首页>Machine learning and deep learning -- normalization processing
Machine learning and deep learning -- normalization processing
2022-07-28 04:36:00 【A large piece of meat floss】
One 、 Normalization in machine learning
1、 Normalization
(1) After normalization, the speed of gradient descent to find the optimal solution is accelerated
(2) Normalization has the potential to improve accuracy
Detailed explanation :
(1) After normalization, the speed of gradient descent to find the optimal solution is accelerated

\qquad a. The left figure is the contour line without normalization , It can be seen from the left figure that there is a big difference between the two feature change ranges , The contour line formed is very sharp , When the gradient descent method is used to find the optimal solution , It's very likely to go “ And ” Font route , So it needs many iterations to converge ;
\qquad b. Right picture , Normalize the data , At this time, the contour lines of the two features appear smooth , In the process of gradient descent, it can converge faster ;
\qquad Therefore, if the machine learning model uses gradient descent to find the optimal solution , Normalization is very necessary , Otherwise, it will take more iterations to converge or even fail to converge .
(2) Normalization may improve accuracy
Some classifiers need to calculate the distance between samples ( For example, European distance ), for example KNN. If a range of eigenvalues is very large , Then the calculation of distance depends on this feature , But it may not conform to the actual situation , For example, the characteristics of small cure range may be more important in the actual situation .
2、 Normalization method
(1) Linear normalization
(2) Normalization of standard deviation : Characteristics minus mean divided by variance
(3) Nonlinear normalization
Two 、 In depth learning BN layer
1.BN layer
\qquad BN,Batch Normalization( Normalized layer )
\qquad BN Problems solved by layer : In deep neural networks , Each layer will operate on the data , Even if the data is normalized initially , But with the deepening of neural network layers , The distribution of data is likely to change dramatically , At this time, it brings difficulty to the training of network model .
\qquad At this point, you need to join BN layer , Make the data keep the same distribution in the process of deep neural network training .
\qquad In short :BN Layer is to deal with data in a normative way , Make the distribution of data input values of each layer smooth , For example, the average value is 0, The variance of 1 Is a normal distribution .
2.BN Layer action
(1) Speed up the training and convergence of the network
(2) Prevent gradient disappearance and gradient explosion
(3) Prevent over fitting
边栏推荐
- CMake使用基础汇总
- Elementary level of C language -- while, for, do while
- Solana「迷惑行为」:造手机、开门店
- Cmake usage base summary
- [blood vessel detection] Based on MATLAB mom method, combined with Hessian and curve fitting, blood vessel diameter measurement [including Matlab source code, 1970]
- 重要的 SQL Server 函数 - 数字函数
- The unsatisfied analysis of setup and hold timing is the solution
- Use Baidu developer tool 4.0 to build a dedicated applet IDE
- pytorch_ Lightning in lightning_ The output of hparams.yaml in logs is null
- How much does it cost to build a self built server for ark survival evolution?
猜你喜欢

Advanced architects, 16 common principles of microservice design and Governance

How to select reliable securities analysts?

Important SQL server functions - other functions

MySQL:数据类型和运算符

空间复杂度计算超全整理!!(一起手撕复杂度计算

10 more advanced open source command line tools

Internet of things industrial serial port to WiFi module wireless routing WiFi module selection

Glusterfs file is not mounted, permission: R-S
![[kinematics] simulation of orbital angular momentum based on MATLAB [including Matlab source code 1971]](/img/5e/dfe029490183ee74687606941ce98e.jpg)
[kinematics] simulation of orbital angular momentum based on MATLAB [including Matlab source code 1971]

Information system project manager (2022) - key content: Information System Security Management (20)
随机推荐
《KG-BERT: BERT for Knowledge Graph Completion》
High number_ Chapter 4__ Curvilinear integral_ Exercise solution
Work fishing clock simulator wechat applet source code
[performance optimization methodology series] III. core idea of performance optimization (2)
【sylar】框架篇-Chapter12-ByteArray 模块
Important SQL server functions - other functions
Elementary level of C language -- while, for, do while
[Sylar] framework -chapter11 socket module
Information system project manager (2022) - key content: Project Contract Management (13)
[Sylar] framework -chapter12 bytearray module
重要的 SQL Server 函数 - 数字函数
Learn regular expressions (regexp)
Reading the paper "learning span level interactions for aspect sentimental triple extraction"
【sylar】框架篇-Chapter11-Socket 模块
24-Openwrt dnsmasq
Password key hard coding check
RN interface jump description
031 log4j log framework
Reading of the paper "attentional encoder network for targeted sentimental classification"
CMake使用基础汇总