当前位置:网站首页>Watermelon Book + pumpkin book chapter 1-2
Watermelon Book + pumpkin book chapter 1-2
2022-07-27 12:16:00 【phac123】
Watermelon book && The first two chapters of pumpkin book
1. Watermelon Chapter 1
1.1 Basic terminology

1.2 After-school exercises

I feel like I didn't do anything right , Not finished , Still need to continue thinking
2. Watermelon Book II
2.1 Basic terminology

2.2 Evaluation methods
2.2.1 Set aside method

2.2.2 Cross validation

2.2.3 Self help law
Every time from the dataset D Extract a data from , Hypothesis extraction m Time , Extraction of m The data obtained at this time forms a data set A; The remaining data is probably 0.368 Forming data sets B.
shortcoming : The self-help method does not have stratified sampling , Changed the data distribution , It introduces estimation bias .
2.3 Performance estimation
2.3.1 Precision and recall
Precision rate :P
p = The prediction is positive and correct /( All that are predicted to be positive )
Recall rate :R
R = The prediction is positive and correct / ( The number of positive samples in the original data set )
2.4 After-school exercises
3. Pumpkin book chapter 1
3.1 The formula 1.1 understand
Purpose : Calculate the samples other than the training set in the learner 
The error of the .( Under the learner A Express )
computing method :
sum = 0
for(i = h) {
//h It is the hypothesis generated by the learner , Go through all the assumptions
for(j = f(x)) {
// All over x,x It belongs to data other than training set data
sum += (h(x) != f(x) ? p(x) : 0) * p(h|X, A); //p(x)
}
}
h It's a learning device A Through samples X The resulting assumptions ; It is worth noting that h There are many possibilities .
therefore sigma P(h | X, εa) = 1
3.2 The formula 1.2 understand
understand : Formula one considers only one objective function , Formula 2 considers many, many objective functions ;( I want to explain “ There is no free lunch ” That's the truth )
There are two things to understand when pushing the formula to the middle :
- sigma Symbols want to exchange order , The premise is that the variables are independent of each other
- Because of the assumption that f It's evenly distributed , And it is a binary classification problem , All... No matter h There will be everything 1/2 The objective function value of quantity is equal to h Function values are equal .
Last , Found out “ No free lunch ” Theorem ——No Free Lunch Theorem.
4. Pumpkin book chapter 2
4.1 The formula 2.20 understand
Understand this formula , In fact, we want to know what is AUC, As long as we know ROC How did the curve get ,AUC It's natural to know ;
ROC The way the curve is drawn : First, the predicted value of the sample is used as the keyword to sort from large to small ; Of the axis x The axis is the false positive case rate ,y The axis is the real case rate ; Draw the first dot with 1 Is the threshold , The second point takes the predicted value of the first sample as the threshold ; When you encounter positive sample points, point up a point , When you encounter negative sample points, draw a point to the right ( This is just experience ); Rigorous drawing needs to follow the formula .( At the same time, a positive , A negative , Draw one on the top right )
4.2 The formula 2.21 understand
Understanding method 1 : It can be understood according to the pumpkin book
Understanding method 2 : I find it understandable without simplification ; First of all, we still need to put 1/2 extracted ; Then it can be understood that the unit distance is 1, Next to the last bracket 1/m+ and 1/m- Is the correction of unit distance ;( In essence, it is the trapezoidal area calculation method )
4.3 The formula 2.27 understand ( await a vacancy or job opening )
Forget the confidence interval
4.4 The formula 2.41 understand ( await a vacancy or job opening )
边栏推荐
- Leetcode 01: t1. sum of two numbers; T1108. IP address invalidation; T344. Reverse string
- Principle of PWM and generation of PWM wave
- Sword finger offer note: t45. arrange the array into the smallest number
- LNMP architecture setup (deploy discuz Forum)
- 配置更改删除了路由过滤器,分布路由器不堪重负:加拿大网络大瘫痪
- 广东财政多举措助力稳住粮食安全“压舱石”
- Simple blockchain day based on bolt database (2)
- Chapter 8 multithreading
- Go Beginner (5)
- Guangdong's finance has taken many measures to help stabilize the "ballast stone" of food security
猜你喜欢

Keil MDK compilation appears..\user\stm32f10x H (428): error: # 67: expected a "}" wrong solution

Shell编程之正则表达式(Shell脚本文本三剑客之grep)

Could not load dynamic library ‘libcudnn.so.8‘;

JS parasitic combinatorial inheritance

EfficientNet

Strictly control outdoor operation time! Foshan housing and Urban Rural Development Bureau issued a document: strengthening construction safety management during high temperature

STS下载教程(include官网无法下载解决方案)
Synchronous use reference of the new version of data warehouse (for beginners)

Adobe audit prompts that the sampling rate of audio input does not match the output device - problem solving
Ali II: what if the AOF file in redis is too large?
随机推荐
Difference quotient approximation of wechat quotient
Lonely young people can't quit jellycat
Weibo comment crawler + visualization
Sword finger offer notes: t58 - I. flip word order
npm踩坑
Multi activity disaster recovery construction after station B 713 accident | takintalks share
While loop instance in shell
JS parasitic combinatorial inheritance
[machine learning whiteboard derivation series] learning notes --- conditional random fields
Unity shader - Laser special effect shader[easy to understand]
[product] about wechat product analysis
查看系统下各个进程打开的文件描述符数量
Sword finger offer note: T39. Numbers that appear more than half of the time in the array
Keil MDK compilation appears..\user\stm32f10x H (428): error: # 67: expected a "}" wrong solution
Leetcode 02: sword finger offer 58 - I. flip the word order (simple); T123. Verify palindrome string; T9. Palindromes
快抖抢救“失意人”
我在英国TikTok做直播电商
CLS monitoring alarm: ensure high availability of online services in real time
Chapter 8 multithreading
Wechat applet must use interface "suggestions collection"