当前位置:网站首页>Learning notes of statistical learning methods -- Chapter 1 Introduction to statistical learning methods
Learning notes of statistical learning methods -- Chapter 1 Introduction to statistical learning methods
2022-07-05 21:25:00 【Raymond。】
Statistical learning methods learning notes -- Chapter one Introduction to statistical learning methods
- 1.1 Statistical learning
- 1.1.1 Basic steps of statistical learning
- 1.1.2 Statistical learning classification
- 1.1.3 Three elements of statistical learning method
- 1.1.4 Model evaluation and model selection
- 1.1.5 Regularization and cross validation -- Prevent over fitting
- 1.1.6 Generalization ability
- 1.1.7 Generation model and discrimination model
- 1.2 Supervised learning
1.1 Statistical learning
1.1.1 Basic steps of statistical learning
Steps to achieve statistical learning methods :
- Get a limited set of training data
- Determine the hypothetical space containing all possible models , That is, the set of learning models
- Determine the criteria for model selection , Learning strategies
- Determine the algorithm for solving the optimal model , Learning algorithm
- Choosing the best model by learning method
- Use the learned optimal model to predict and analyze new data
1.1.2 Statistical learning classification
Statistical learning includes supervised learning , Unsupervised learning , Semi supervised learning and intensive learning . Focus on supervised learning .
1.1.3 Three elements of statistical learning method
Statistical learning method = Model + Strategy + Algorithm
Model
A set of all possible mappings from input variables to output variablesStrategy
The optimal model .
How to measure the quality of the model : Loss function L(Y, f(X))( Measure the quality of a forecast ) And risk function ( The prediction of the model is good or bad in the average sense ).Algorithm
The calculation method of solving the optimal model .
1.1.4 Model evaluation and model selection
Training error and test error
Over fitting
The training error is small , The test error is large .( The noise is also studied and predicted )
1.1.5 Regularization and cross validation -- Prevent over fitting
Regularization
Add regularization term or penalty term to empirical risk , Measure the complexity of the model .Cross validation
When the data is enough , Divide the data into training sets , Verification set ( For model selection ) And test set .
1.1.6 Generalization ability
1.1.7 Generation model and discrimination model
- Generate models
Joint distribution by data learning P(X,Y), Find the conditional probability distribution P(Y|X).
Common generation models : Naive Bayes and hidden Markov model - Discriminant model
Learning decision function directly from data f(X) Or conditional probability distribution P(Y|X).
Common discriminant models :k a near neighbor , perceptron , Decision tree , Logistic regression model , Maximum entropy model , Support vector machine , Lifting method and condition random field .
1.2 Supervised learning
1.2.1 Basic concepts
- input space , Feature space and output space
Input ( Output ) Space is input ( Output ) All possible values . Each specific input is an instance , Usually represented by eigenvectors , The space where all eigenvectors exist is the eigenspace . - Classification of prediction problems
The problem that input and output are continuous variables is called regression problem .
The output variables are finite discrete variables, which is called classification problem .
The prediction problem in which both input and output variables are variable sequences is called marking problem . - Hypothetical space
The purpose of supervised learning is to learn a mapping from input to output , A map is represented by a model , The set of all mappings is called the hypothesis space . - Supervised learning model classification
It can be divided into probability models ( By conditional probability distribution P(Y|X) Express ) And non probabilistic models ( Decision function Y=f(X) Express ). The specific model is determined by the specific learning method .
1.2.2 Formalization of problems
- The process
The learning process ( Completed by the learning system ) And the prediction process ( Completed by the prediction system )
1.2.3 Application of supervised learning
Classification problem
The model is a classifier .AEC Dimension
The return question
边栏推荐
- PVC plastic sheets BS 476-6 determination of flame propagation properties
- Aitm2-0002 12s or 60s vertical combustion test
- Which securities company is better and which platform is safer for stock account opening
- Get JS of the previous day (timestamp conversion)
- MySQL 千万数据量深分页优化, 拒绝线上故障!
- 股票开户选择哪家证券公司比较好哪家平台更安全
- one hundred and twenty-three thousand four hundred and fifty-six
- 【日常训练】729. 我的日程安排表 I
- uni-app 蓝牙通信
- MySQL ifnull usage function
猜你喜欢
![[case] Application of element display and hiding -- element mask](/img/6e/6ea484a6e5d547e01dd8820af8e314.png)
[case] Application of element display and hiding -- element mask

Wood board ISO 5660-1 heat release rate mapping test

Teach yourself to train pytorch model to Caffe (III)

Zhang Lijun: penetrating uncertainty depends on four "invariants"

秋招将临 如何准备算法面试、回答算法面试题

Deployment of Jenkins under win7

学习机器人无从下手?带你体会当下机器人热门研究方向有哪些

显示屏DIN 4102-1 Class B1防火测试要求

力扣------经营摩天轮的最大利润

Learning robots have no way to start? Let me show you the current hot research directions of robots
随机推荐
Some things make feelings nowhere to put
Écrire une interface basée sur flask
Parker driver maintenance COMPAX controller maintenance cpx0200h
Feng Tang's "spring breeze is not as good as you" digital collection, logged into xirang on July 8!
Which securities company is better and which platform is safer for stock account opening
大二下个人发展小结
Influence of oscilloscope probe on signal source impedance
MySQL deep paging optimization with tens of millions of data, and online failure is rejected!
ArcGIS栅格重采样方法介绍
Haas506 2.0 development tutorial - Alibaba cloud OTA - PAC firmware upgrade (only supports versions above 2.2)
Golang(1)|从环境准备到快速上手
示波器探头对测量带宽的影响
【案例】定位的运用-淘宝轮播图
Hdu2377bus pass (build more complex diagram +spfa)
Why can't Chinese software companies produce products? Abandon the Internet after 00; Open source high-performance API gateway component of station B | weekly email exclusive to VIP members of Menon w
【案例】元素的显示与隐藏的运用--元素遮罩
Clion configures Visual Studio (MSVC) and JOM multi-core compilation
Mode - "Richter replacement principle"
字典树简单入门题(居然是蓝题?)
Determine the best implementation of horizontal and vertical screens