当前位置:网站首页>Machine learning and deep learning -- normalization processing
Machine learning and deep learning -- normalization processing
2022-07-28 04:36:00 【A large piece of meat floss】
One 、 Normalization in machine learning
1、 Normalization
(1) After normalization, the speed of gradient descent to find the optimal solution is accelerated
(2) Normalization has the potential to improve accuracy
Detailed explanation :
(1) After normalization, the speed of gradient descent to find the optimal solution is accelerated

\qquad a. The left figure is the contour line without normalization , It can be seen from the left figure that there is a big difference between the two feature change ranges , The contour line formed is very sharp , When the gradient descent method is used to find the optimal solution , It's very likely to go “ And ” Font route , So it needs many iterations to converge ;
\qquad b. Right picture , Normalize the data , At this time, the contour lines of the two features appear smooth , In the process of gradient descent, it can converge faster ;
\qquad Therefore, if the machine learning model uses gradient descent to find the optimal solution , Normalization is very necessary , Otherwise, it will take more iterations to converge or even fail to converge .
(2) Normalization may improve accuracy
Some classifiers need to calculate the distance between samples ( For example, European distance ), for example KNN. If a range of eigenvalues is very large , Then the calculation of distance depends on this feature , But it may not conform to the actual situation , For example, the characteristics of small cure range may be more important in the actual situation .
2、 Normalization method
(1) Linear normalization
(2) Normalization of standard deviation : Characteristics minus mean divided by variance
(3) Nonlinear normalization
Two 、 In depth learning BN layer
1.BN layer
\qquad BN,Batch Normalization( Normalized layer )
\qquad BN Problems solved by layer : In deep neural networks , Each layer will operate on the data , Even if the data is normalized initially , But with the deepening of neural network layers , The distribution of data is likely to change dramatically , At this time, it brings difficulty to the training of network model .
\qquad At this point, you need to join BN layer , Make the data keep the same distribution in the process of deep neural network training .
\qquad In short :BN Layer is to deal with data in a normative way , Make the distribution of data input values of each layer smooth , For example, the average value is 0, The variance of 1 Is a normal distribution .
2.BN Layer action
(1) Speed up the training and convergence of the network
(2) Prevent gradient disappearance and gradient explosion
(3) Prevent over fitting
边栏推荐
- pytorch_ Lightning in lightning_ The output of hparams.yaml in logs is null
- Idea start project MVN command terminal cannot recognize "MVN" item as cmdlet
- Constructor of member function
- 高数_第4章__曲线积分
- bubble sort
- [Sylar] framework Chapter 23 summary of module chapter
- 【sylar】框架篇-Chapter21-环境变量模块
- Niuke, convert string to integer
- Information system project manager (2022) - key content: Project Risk Management (11)
- Study notes of Gu Yujia on July 27, 2022
猜你喜欢

Jupyter Notebook安装代码提示功能

High number_ Chapter 4__ curvilinear integral

重要的 SQL Server 函数 - 其他函数

Information system project manager (2022) - key content: Project Risk Management (11)

Render the data obtained from the database to the table in elementui

Information system project manager (2022) - key content: Information System Security Management (20)

将数据库拿到的数据渲染到elementUI 中的table中去

Reading of a unified generic framework for aspect based sentimental analysis

Internet of things industrial serial port to WiFi module wireless routing WiFi module selection

Reading the paper "learning span level interactions for aspect sentimental triple extraction"
随机推荐
[Sylar] framework Chapter 8 timer module
Explain initialization list
空间复杂度计算超全整理!!(一起手撕复杂度计算
[Sylar] practical part - redis based parameter query service
Bio annotation of emotion analysis aste triples extraction
[Sylar] framework chapter -chapter10-address module
Simple summary of Modbus Protocol
Study of the Intel aria 10 Avalon mm DMA interface for PCI Express solutions User Guide
Zhejiang University and other recent review papers on deep learning new drug design
The unsatisfied analysis of setup and hold timing is the solution
Blooming old trees -- quickly build a map bed application with imageprocessor
【实战】使用 Web Animations API 实现一个精确计时的时钟
Fearless of side impact damage, Chery arize 8 fully protects the safety of passengers
重要的 SQL Server 函数 - 其他函数
Efficient practice intelligent analysis of massive log data in aviation industry
【sylar】框架篇-Chapter11-Socket 模块
Seamless support for hugging face community, colossal AI low-cost and easy acceleration of large model
【sylar】实战篇-基于 redis 的参数查询服务
could only be written to 0 of the 1 minReplication nodes. There are 0 datanode(s) running and 0 node
Power consumption: leakage power