当前位置:网站首页>Five minutes of machine learning every day: why do we need to normalize the characteristics of numerical types?
Five minutes of machine learning every day: why do we need to normalize the characteristics of numerical types?
2022-07-04 14:45:00 【Phantom wind_ huanfeng】
This paper mainly
Gradient descent algorithm in the case of multiple features , Use feature scaling techniques , It can make the gradient descent algorithm faster , The number of cycles required for gradient descent is smaller .
If there is a machine learning problem with multiple eigenvalues , We need to ensure that these features have similar scales , This will help the gradient descent algorithm converge faster .
For example
If there are two characteristics in the housing price problem , features x1 Indicates the area value (01000), features x2 Indicates the number of bedrooms (15), We can see the characteristics x1 The value of is far greater than x2 Of , At this time, we draw the outline of the machine learning problem, which is like this :
We can see that the image of the loss function will appear very flat , The gradient descent algorithm needs many iterations to converge . If you want to solve this problem , One technique we need to use is feature scaling .
Feature scaling
Feature scaling is trying to scale all features to -1 To 1 Between ( Of course -1.5 To 0.5 It's OK , This can be done in a small range , Of course, if it is 0.00000001 To 0.00001 This is not a very good feature scaling ). The specific solution is to let us define x1= Building area /2000、x2= Number of bedrooms /5
边栏推荐
- Sqlserver functions, creation and use of stored procedures
- 产业互联网则具备更大的发展潜能,具备更多的行业场景
- Count the running time of PHP program and set the maximum running time of PHP
- Digi XBee 3 rf: 4 protocols, 3 packages, 10 major functions
- 【云原生】我怎么会和这个数据库杠上了?
- LVGL 8.2 Line wrap, recoloring and scrolling
- Docker compose public network deployment redis sentinel mode
- WT588F02B-8S(C006_03)单芯片语音ic方案为智能门铃设计降本增效赋能
- Xcode abnormal pictures cause IPA packet size problems
- AI and Life Sciences
猜你喜欢
曝光一下阿里的工资待遇和职位级别
[MySQL from introduction to proficiency] [advanced chapter] (IV) MySQL permission management and control
leetcode:6110. 网格图中递增路径的数目【dfs + cache】
韩国AI团队抄袭震动学界!1个导师带51个学生,还是抄袭惯犯
Comment configurer un accord
一文概览2D人体姿态估计
LVGL 8.2 LED
实战解惑 | OpenCV中如何提取不规则ROI区域
Classify boost libraries by function
Real time data warehouse
随机推荐
STM32F1与STM32CubeIDE编程实例-MAX7219驱动8位7段数码管(基于GPIO)
ML之shap:基于boston波士顿房价回归预测数据集利用shap值对XGBoost模型实现可解释性案例
函数计算异步任务能力介绍 - 任务触发去重
Why do domestic mobile phone users choose iPhone when changing a mobile phone?
關於miui12.5 紅米k20pro用au或者povo2出現問題的解决辦法
Leetcode T48: rotating images
LVGL 8.2 text shadow
【算法leetcode】面试题 04.03. 特定深度节点链表(多语言实现)
LVGL 8.2 keyboard
C language course design questions
Industrial Internet has greater development potential and more industry scenarios
Visual Studio调试方式详解
leetcode:6109. Number of people who know the secret [definition of DP]
How to match chords
WT588F02B-8S(C006_03)单芯片语音ic方案为智能门铃设计降本增效赋能
Progress in architecture
EventBridge 在 SaaS 企业集成领域的探索与实践
Chapter 17 process memory
【云原生】我怎么会和这个数据库杠上了?
Query optimizer for SQL optimization