当前位置:网站首页>Detailed explanation of linear regression in machine learning
Detailed explanation of linear regression in machine learning
2022-07-01 10:24:00 【HUIM_ Wang】
A detailed explanation of linear regression of machine learning
Linear regression algorithm :
Focus on gradient descent algorithm .
How to evaluate the model : Loss function (lost function)
The simplest and most common loss function : Minimum mean square error (mse)
The formula is as follows :
Forecast house price data , Hypothetical model y=1, The final value 60132 No practical significance , But in contrast , The smaller the value, the better . The best minimum mean square deviation is as close to 0 Of , But according to the data sample , It can't be equal to 0
Suppose the house price is predicted , The eigenvalue is the area , The target value is house price , Need to fit a line , Calculate the weight m and b
Step one : hypothesis m=0, namely y=b, be b Is the only adjustable parameter , Using the minimum mean square deviation formula , Calculate the minimum mean square deviation , In this process, an optimal parameter is fitted b

I know from the ,“ The optimal ” Of b The value should be mse=612 Corresponding 241 near , In this case, it is more in line with the price of house prices .( This process needs to find the smallest one by one mse Corresponding b, More trouble , And given in the figure b It's from 1 Started looking for , Because of the uncertainty b Value , once b A negative value of is troublesome )
In order to find the most suitable one as soon as possible and accurately b, A new concept needs to be introduced : Learning rate (learning rate)
Let's look at a set of pictures first :
Need to find the smallest mse Corresponding b spot



No matter b Start from what value , It is necessary for the computer to dynamically find the minimum according to the trend of the line mse value , I.e. derivative ,

According to the picture above , The minimum derivative obtained is -8, Corresponding to the minimum mean square deviation , here b=241
For all that , At this time b It is still the lowest value we can observe with the naked eye , also b The change and value of are added manually f, Computers can't understand , We need to adjust according to some semaphores b The change of .
If the curve is steep , For example, figure b, The value of the slope will be very negative , The next prediction point is on the right side of the curve, not on the left ; If it's a picture a situation , The slope will be very large , Should let b Offset to the left , That is, if the slope is negative , Then the next guess point should move to the right , If it's a positive value , It should be moved to the left , Until the curve flattens out , The derivative is close to 0.
We mentioned the learning rate above (learning rate), It can be based on mse Yes b The derivative of , Find the most suitable one as soon as possible b, If the derivative is negative, the greater , Then it should change the more , If the derivative is smaller , The more close to 0, Then it should change the smaller .
The learning rate is a value , for example 0.0001, The learning rate is used in this way : this b= the previous b - The last derivative * Learning rate , So the cycle goes on , Can quickly find the closest 0 The derivative of is corresponding to b .
If the learning rate is smaller , for example 0.000001, that b The slower the value of changes , If the learning rate is higher , for example 0.1、0.8,b The faster the value changes . The slower the change, the more iterations are required , Large amount of computation , But in the end b The more accurate the value ; The faster the change, the less the calculation , however b Value may not be the best one .
Here's the picture : When the learning rate is 0.00001、0.001,0.01、0.1 when :




When learning rate by 0.2 when ,b Slowly approaching 245, The derivative slowly approaches 0
learning rate The value of cannot be too large , otherwise b It will only be farther and farther away from the right point
To simplify the model , The above is y=mx+b,(m=0) Under the circumstances , Find the adjustable parameter, that is, the weight b Value .( This is an extreme situation , Under normal circumstances m≠0)
So in general, we should be right m and b To derive separately :
Empathy , obtain :
边栏推荐
- 442. duplicate data in array
- Design and practice of new generation cloud native database
- How did the data center change from "Britney Spears" to "Mrs. cow"?
- Can you afford to buy a house in Beijing, Shanghai, Guangzhou and Shenzhen with an annual salary of 1million?
- 建议收藏 | 在openGauss上遇到慢SQL该怎么办?
- [fxcg] large scale job hopping may be one of the driving forces behind the soaring inflation in the United States
- Project0: Games
- SQL server2014 failed to delete the database, with an error offset of 0x0000
- Ssh server rejects password, try again; Permitrootlogin yes invalid problem
- 关于#SQL#的问题,如何解决?
猜你喜欢

12.Gateway新一代网关

Have you learned the necessary global exception handler for the project

The Lantern Festival is held on the fifteenth day of the first month, and the Lantern Festival begins to celebrate the reunion

Kotlin coprocessor scheduling switch threads it's time to unravel the truth

CRC 校验

数据库的增删改查问题

Recommend a JSON visualization tool artifact!

新品大揭秘!雅迪冠能 3 多元产品矩阵,满足全球用户出行需求

CRC 校驗

Venv: directory structure of venv
随机推荐
项目必用的全局异常处理器,你学会了吗
JD and Tencent renewed the three-year strategic cooperation agreement; The starting salary rose to 260000 yuan! Samsung sk of South Korea competes for salary increase to retain semiconductor talents;
数字藏品新一轮热度开启
[matytype] insert MathType inter line and intra line formulas in CSDN blog
A new round of popularity of digital collections opens
The stock position building rate of global funds and asset management reached a new low in 15 years
渗透常用工具-Goby
12款大家都在用的产品管理平台
Can MySQL CDC take out the op field
大佬们 有没有搞过sink分流写入clickhouse 或者其他数据库的操作。
Sleeping second brother...
中国探月工程独家藏品限量发售!
What a high commission! The new programmer's partner plan is coming. Everyone can participate!
C# [字节数组]与[16进制字符串]互相转换 - CodePlus系列
Button button clear border
Today in history: the semiconductor war in the late 1990s; Von Neumann published the first draft; CBS acquires CNET
[laravel] detailed explanation of faker data filling
Recommend a JSON visualization tool artifact!
SQL Server列一相同的情况下,如何取列二的最大值,并重新生成表
程序员都想去国企?技术落后薪资低,躺平几年出来都找不到工作...