当前位置:网站首页>2021 Li Hongyi machine learning (1): basic concepts
2021 Li Hongyi machine learning (1): basic concepts
2022-07-05 02:38:00 【Three ears 01】
2021 Li hongyi machine learning (1): Basic concepts
B On the site 2021 Li Hongyi's learning notes of machine learning course , For reuse .
1 Basic concepts
Machine learning is ultimately about finding a function .
1.1 Different function categories
- Return to Regression—— Output is numeric
- classification Classification—— The output is in different categories classes, Do multiple choice questions
- Structural learning Structured Learning—— Generate a structured file ( Draw a picture 、 Write an article ), Let the machine learn to create
1.2 How to find functions (Training):
- First , Write a function with unknown parameters ;
- secondly , Definition loss( A function related to parameters ,MAE—— Absolute error ,MSE—— Mean square error );
- Last , Optimize , Find the loss Minimum parameters —— gradient descent
1) Randomly select the initial value of the parameter ;
2) Calculation ∂ L ∂ w ∣ w = w 0 \left.\frac{\partial L}{\partial w}\right|_{w=w^{0}} ∂w∂L∣∣w=w0, Then step down the gradient , The step size is l r × ∂ L ∂ w ∣ w = w 0 \left.lr\times\frac{\partial L}{\partial w}\right|_{w=w^{0}} lr×∂w∂L∣∣w=w0
3) Update parameters
This method has a huge drawback : Usually we will find Local minima, But what we want is global minima
1.3 Model
Linear model linear model There's a big limit , Cannot simulate polyline 、 Curve , This restriction is called model bias, So we need to improve .
How to improve :Piecewise Linear Curves
Many such sets can be fitted into curves .
1.3.1 sigmoid
It can be used sigmoid function y = c 1 1 + e − ( b + w x 1 ) = c s i g m o i d ( b + w x 1 ) y=c \frac{1}{1+e^{-\left(b+w x_{1}\right)}}=c sigmoid(b+wx_1) y=c1+e−(b+wx1)1=csigmoid(b+wx1) Fit the blue broken line :
y = b + ∑ i c i sigmoid ( b i + ∑ j w i j x j ) y=b+\sum_{i} c_{i} \operatorname{sigmoid}\left(b_{i}+\sum_{j} w_{i j} x_{j}\right) y=b+i∑cisigmoid(bi+j∑wijxj)
All unknown parameters in this , Use both θ \theta θ Express :
Use all at once θ \theta θ To calculate , Make a gradient descent , Such a large amount of data , Therefore, small batches are used batch:
every last data The number of updates depends on the total amount of data and batch Number :
1.3.2 ReLU
In front of it is soft sigmoid, That's the curve , In fact, you can use two ReLU Quasi synthesis hard sigmoid, That's the broken line :
above sigmoid The formula becomes :
1.3.3 Yes sigmoid The calculation of can be done several more times
There are many such layers , It is called neural network Neural Network, Later called Deep learning=Many hidden layers
边栏推荐
- Kotlin - coroutine
- 8. Commodity management - commodity classification
- Blue bridge - maximum common divisor and minimum common multiple
- Privatization lightweight continuous integration deployment scheme -- 01 environment configuration (Part 1)
- spoon插入更新oracle数据库,插了一部分提示报错Assertion botch: negative time
- Medusa installation and simple use
- Word processing software
- Variables in postman
- Data guard -- theoretical explanation (III)
- 低度酒赛道进入洗牌期,新品牌如何破局三大难题?
猜你喜欢
The most powerful new household god card of Bank of communications. Apply to earn 2100 yuan. Hurry up if you haven't applied!
Privatization lightweight continuous integration deployment scheme -- 01 environment configuration (Part 1)
. Net starts again happy 20th birthday
Hmi-31- [motion mode] solve the problem of picture display of music module
He was laid off.. 39 year old Ali P9, saved 150million
Elfk deployment
[技术发展-26]:新型信息与通信网络的数据安全
【LeetCode】98. Verify the binary search tree (2 brushes of wrong questions)
Avoid material "minefields"! Play with super high conversion rate
【LeetCode】111. Minimum depth of binary tree (2 brushes of wrong questions)
随机推荐
Spoon inserts and updates the Oracle database, and some prompts are inserted with errors. Assertion botch: negative time
Design and implementation of kindergarten management system
Asynchronous and promise
[understanding of opportunity -38]: Guiguzi - Chapter 5 flying clamp - warning one: there is a kind of killing called "killing"
Data guard -- theoretical explanation (III)
平台入驻与独立部署优缺点对比
RichView TRVStyle MainRVStyle
Vb+access hotel service management system
Pytorch register_ Hook (operate on gradient grad)
Start the remedial work. Print the contents of the array using the pointer
Some query constructors in laravel (2)
Exploration of short text analysis in the field of medical and health (I)
Redis distributed lock, lock code logic
Openresty ngx Lua Execution stage
Structure of ViewModel
ELK日志分析系统
The perfect car for successful people: BMW X7! Superior performance, excellent comfort and safety
100 basic multiple choice questions of C language (with answers) 04
【附源码】基于知识图谱的智能推荐系统-Sylvie小兔
Icu4c 70 source code download and compilation (win10, vs2022)