当前位置:网站首页>[ml] Li Hongyi III: gradient descent & Classification (Gaussian distribution)
[ml] Li Hongyi III: gradient descent & Classification (Gaussian distribution)
2022-07-02 23:26:00 【Exotic moon】

The normal of the contour line is perpendicular to the tangent ,:
Every time calculation
One time return , Then according to the gradient of return point ; Then calculate a new return point according to the result , Then gradient down .....

When doing gradient descent , Careful adjustment learning rate


It is better to have one for every parameter learning rate, Recommended adagrad

case: Use the root mean square of all differential values in the past



explain :

When only one parameter is considered :
When considering multiple parameters , The above discussion is not necessarily true :
Just look at w1( Blue ):a Than b Far away , Then the greater the differential value
Just look at w2( green ):c Than d Far away , Then the greater the differential value
But not when combined :a The differential value of is significantly higher than c Small , however a Farther from the origin , So cross parameter comparison , The above is not true !

So the right thing to do is : Use first-order differential value / Quadratic differential value , To calculate the distance from the lowest point

It's a constant ,
Express a differential ,
To replace quadratic differentiation ( In order to calculate )

Stochastic gradient descent :


use vector To describe an input ( Baokemeng )


Maximum likelihood estimation : Calculate the probability of the source from the results
Gaussian distribution
, Suppose it's a function, Each point is sampled from the Gaussian distribution .
The closer each point is to the center of the yellow , The greater the probability of sampling


Because the probability of each blue dot is independent , So yellow dot sample all 79 The probability of a blue dot is equal to each person who wants to multiply
Calculate the maximum likelihood value : Take the average ( Because Gaussian distribution is normal distribution )
Calculate the most likely sample After the maximum likelihood of all blue dots , Start sorting :

The classification effect of water system and common system is not very good , The correct rate is only 47%, What about Shengwei ?
Add attribute to 7 only
7 Dimensional effect is still not ideal , Reduce two function Parameters of 
After sharing the covariance matrix , Improved accuracy
3 Step summary :

Posterior probability :
Simplify consensus :
边栏推荐
- (stinger) use pystinger Socks4 to go online and not go out of the network host
- 公司里只有一个测试是什么体验?听听他们怎么说吧
- Win11如何开启目视控制?Win11开启目视控制的方法
- Makefile configuration of Hisilicon calling interface
- (毒刺)利用Pystinger Socks4上线不出网主机
- Pandora IOT development board learning (HAL Library) - Experiment 3 key input experiment (learning notes)
- YOLOX加强特征提取网络Panet分析
- Interface switching based on pyqt5 toolbar button -1
- php 获取真实ip
- 简述中台的常识
猜你喜欢

Golang common settings - modify background

Compose 中的 'ViewPager' 详解 | 开发者说·DTalk

C#中Linq用法汇集

数字图像处理实验目录

Convolution和Batch normalization的融合

golang入门:for...range修改切片中元素的值的另类方法

Win11如何开启目视控制?Win11开启目视控制的方法

Detailed explanation of 'viewpager' in compose | developer said · dtalk

基于Pyqt5工具栏按钮可实现界面切换-2

Li Kou brush questions (2022-6-28)
随机推荐
BBR encounters cubic
(毒刺)利用Pystinger Socks4上线不出网主机
Simple square wave generating circuit [51 single chip microcomputer and 8253a]
php 获取真实ip
Application of containerization technology in embedded field
2022年最新最全软件测试面试题大全
Win11如何开启目视控制?Win11开启目视控制的方法
用matlab调用vs2015来编译vs工程
在SOUI里使用真窗口时使用SOUI的滚动条
ADC of stm32
The use of 8255 interface chip and ADC0809
Doorplate making C language
Numerical solution of partial differential equations with MATLAB
Win11启用粘滞键关闭不了怎么办?粘滞键取消了但不管用怎么解决
RuntimeError: no valid convolution algorithms available in CuDNN
Convolution和Batch normalization的融合
CDN acceleration requires the domain name to be filed first
The difference between new and make in golang
Go basic data type
Brief introduction to common sense of Zhongtai