当前位置:网站首页>Learning notes of statistical learning methods -- Chapter 1 Introduction to statistical learning methods
Learning notes of statistical learning methods -- Chapter 1 Introduction to statistical learning methods
2022-07-05 21:25:00 【Raymond。】
Statistical learning methods learning notes -- Chapter one Introduction to statistical learning methods
- 1.1 Statistical learning
- 1.1.1 Basic steps of statistical learning
- 1.1.2 Statistical learning classification
- 1.1.3 Three elements of statistical learning method
- 1.1.4 Model evaluation and model selection
- 1.1.5 Regularization and cross validation -- Prevent over fitting
- 1.1.6 Generalization ability
- 1.1.7 Generation model and discrimination model
- 1.2 Supervised learning
1.1 Statistical learning
1.1.1 Basic steps of statistical learning
Steps to achieve statistical learning methods :
- Get a limited set of training data
- Determine the hypothetical space containing all possible models , That is, the set of learning models
- Determine the criteria for model selection , Learning strategies
- Determine the algorithm for solving the optimal model , Learning algorithm
- Choosing the best model by learning method
- Use the learned optimal model to predict and analyze new data
1.1.2 Statistical learning classification
Statistical learning includes supervised learning , Unsupervised learning , Semi supervised learning and intensive learning . Focus on supervised learning .
1.1.3 Three elements of statistical learning method
Statistical learning method = Model + Strategy + Algorithm
Model
A set of all possible mappings from input variables to output variablesStrategy
The optimal model .
How to measure the quality of the model : Loss function L(Y, f(X))( Measure the quality of a forecast ) And risk function ( The prediction of the model is good or bad in the average sense ).Algorithm
The calculation method of solving the optimal model .
1.1.4 Model evaluation and model selection
Training error and test error
Over fitting
The training error is small , The test error is large .( The noise is also studied and predicted )
1.1.5 Regularization and cross validation -- Prevent over fitting
Regularization
Add regularization term or penalty term to empirical risk , Measure the complexity of the model .Cross validation
When the data is enough , Divide the data into training sets , Verification set ( For model selection ) And test set .
1.1.6 Generalization ability
1.1.7 Generation model and discrimination model
- Generate models
Joint distribution by data learning P(X,Y), Find the conditional probability distribution P(Y|X).
Common generation models : Naive Bayes and hidden Markov model - Discriminant model
Learning decision function directly from data f(X) Or conditional probability distribution P(Y|X).
Common discriminant models :k a near neighbor , perceptron , Decision tree , Logistic regression model , Maximum entropy model , Support vector machine , Lifting method and condition random field .
1.2 Supervised learning
1.2.1 Basic concepts
- input space , Feature space and output space
Input ( Output ) Space is input ( Output ) All possible values . Each specific input is an instance , Usually represented by eigenvectors , The space where all eigenvectors exist is the eigenspace . - Classification of prediction problems
The problem that input and output are continuous variables is called regression problem .
The output variables are finite discrete variables, which is called classification problem .
The prediction problem in which both input and output variables are variable sequences is called marking problem . - Hypothetical space
The purpose of supervised learning is to learn a mapping from input to output , A map is represented by a model , The set of all mappings is called the hypothesis space . - Supervised learning model classification
It can be divided into probability models ( By conditional probability distribution P(Y|X) Express ) And non probabilistic models ( Decision function Y=f(X) Express ). The specific model is determined by the specific learning method .
1.2.2 Formalization of problems
- The process
The learning process ( Completed by the learning system ) And the prediction process ( Completed by the prediction system )
1.2.3 Application of supervised learning
Classification problem
The model is a classifier .AEC Dimension
The return question
边栏推荐
- 基于 Ingress Controller 在集群外访问 Zadig 自测环境(最佳实践)
- What are the requirements of UL 2043 test for drive housing in the United States?
- Dictionary tree simple introductory question (actually blue question?)
- 树莓派4B上ncnn转换出来的模型调用时总是崩溃(Segment Fault)的原因
- 秋招将临 如何准备算法面试、回答算法面试题
- Evolution of zhenai microservice underlying framework from open source component encapsulation to self-development
- Learning notes of SAS programming and data mining business case 19
- How to send samples when applying for BS 476-7 display? Is it the same as the display??
- Influence of oscilloscope probe on signal source impedance
- MySQL InnoDB Architecture Principle
猜你喜欢
显示屏DIN 4102-1 Class B1防火测试要求
Parker驱动器维修COMPAX控制器维修CPX0200H
Talk about my fate with some programming languages
LeetCode_ Hash table_ Difficulties_ 149. Maximum number of points on the line
LeetCode_哈希表_困难_149. 直线上最多的点数
EN 438-7 laminated sheet products for building covering decoration - CE certification
Why can't Chinese software companies produce products? Abandon the Internet after 00; Open source high-performance API gateway component of station B | weekly email exclusive to VIP members of Menon w
xlrd常见操作
Access Zadig self-test environment outside the cluster based on ingress controller (best practice)
張麗俊:穿透不確定性要靠四個“不變”
随机推荐
Teach yourself to train pytorch model to Caffe (III)
显示屏DIN 4102-1 Class B1防火测试要求
Feng Tang's "spring breeze is not as good as you" digital collection, logged into xirang on July 8!
vant 源码解析 event.ts 事件处理 全局函数 addEventListener详解
Wood board ISO 5660-1 heat release rate mapping test
Écrire une interface basée sur flask
【日常训练】729. 我的日程安排表 I
让开发效率飞速提升的跨端方案
JS common method encapsulation
事项研发工作流全面优化|Erda 2.2 版本如“七”而至
Golang(1)|从环境准备到快速上手
postgis 安装地理信息扩展
Chapter 05_ Storage engine
R语言【数据管理】
Influence of oscilloscope probe on measurement bandwidth
冯唐“春风十里不如你”数字藏品,7月8日登录希壤!
Clion configures Visual Studio (MSVC) and JOM multi-core compilation
学习机器人无从下手?带你体会当下机器人热门研究方向有哪些
Parker驱动器维修COMPAX控制器维修CPX0200H
水泥胶黏剂BS 476-4 不燃性测试