当前位置:网站首页>Watermelon Book + pumpkin book chapter 1-2
Watermelon Book + pumpkin book chapter 1-2
2022-07-27 12:16:00 【phac123】
Watermelon book && The first two chapters of pumpkin book
1. Watermelon Chapter 1
1.1 Basic terminology

1.2 After-school exercises

I feel like I didn't do anything right , Not finished , Still need to continue thinking
2. Watermelon Book II
2.1 Basic terminology

2.2 Evaluation methods
2.2.1 Set aside method

2.2.2 Cross validation

2.2.3 Self help law
Every time from the dataset D Extract a data from , Hypothesis extraction m Time , Extraction of m The data obtained at this time forms a data set A; The remaining data is probably 0.368 Forming data sets B.
shortcoming : The self-help method does not have stratified sampling , Changed the data distribution , It introduces estimation bias .
2.3 Performance estimation
2.3.1 Precision and recall
Precision rate :P
p = The prediction is positive and correct /( All that are predicted to be positive )
Recall rate :R
R = The prediction is positive and correct / ( The number of positive samples in the original data set )
2.4 After-school exercises
3. Pumpkin book chapter 1
3.1 The formula 1.1 understand
Purpose : Calculate the samples other than the training set in the learner 
The error of the .( Under the learner A Express )
computing method :
sum = 0
for(i = h) {
//h It is the hypothesis generated by the learner , Go through all the assumptions
for(j = f(x)) {
// All over x,x It belongs to data other than training set data
sum += (h(x) != f(x) ? p(x) : 0) * p(h|X, A); //p(x)
}
}
h It's a learning device A Through samples X The resulting assumptions ; It is worth noting that h There are many possibilities .
therefore sigma P(h | X, εa) = 1
3.2 The formula 1.2 understand
understand : Formula one considers only one objective function , Formula 2 considers many, many objective functions ;( I want to explain “ There is no free lunch ” That's the truth )
There are two things to understand when pushing the formula to the middle :
- sigma Symbols want to exchange order , The premise is that the variables are independent of each other
- Because of the assumption that f It's evenly distributed , And it is a binary classification problem , All... No matter h There will be everything 1/2 The objective function value of quantity is equal to h Function values are equal .
Last , Found out “ No free lunch ” Theorem ——No Free Lunch Theorem.
4. Pumpkin book chapter 2
4.1 The formula 2.20 understand
Understand this formula , In fact, we want to know what is AUC, As long as we know ROC How did the curve get ,AUC It's natural to know ;
ROC The way the curve is drawn : First, the predicted value of the sample is used as the keyword to sort from large to small ; Of the axis x The axis is the false positive case rate ,y The axis is the real case rate ; Draw the first dot with 1 Is the threshold , The second point takes the predicted value of the first sample as the threshold ; When you encounter positive sample points, point up a point , When you encounter negative sample points, draw a point to the right ( This is just experience ); Rigorous drawing needs to follow the formula .( At the same time, a positive , A negative , Draw one on the top right )
4.2 The formula 2.21 understand
Understanding method 1 : It can be understood according to the pumpkin book
Understanding method 2 : I find it understandable without simplification ; First of all, we still need to put 1/2 extracted ; Then it can be understood that the unit distance is 1, Next to the last bracket 1/m+ and 1/m- Is the correction of unit distance ;( In essence, it is the trapezoidal area calculation method )
4.3 The formula 2.27 understand ( await a vacancy or job opening )
Forget the confidence interval
4.4 The formula 2.41 understand ( await a vacancy or job opening )
边栏推荐
- Keil MDK compilation appears..\user\stm32f10x H (428): error: # 67: expected a "}" wrong solution
- [machine learning whiteboard derivation series] learning notes --- conditional random fields
- MySQL paging query instance_ MySQL paging query example explanation "suggestions collection"
- Shell编程之正则表达式(Shell脚本文本三剑客之grep)
- Difference between verification and calibration
- deeplab系列详解(简单实用年度总结)
- Principle of PWM and generation of PWM wave
- [网摘][医学影像] 常用的DICOM缩略图解释以及Viewer converter 转换工具
- Principle of control system based on feedback rate
- Guangdong's finance has taken many measures to help stabilize the "ballast stone" of food security
猜你喜欢

快抖抢救“失意人”

Shell script text three swordsmen sed

Interaction free shell programming

How to make a graph? Multiple subgraphs in a graph are histogram (or other graphs)

Temporary use of solo, difficult choice of Blog

STS download tutorial (the solution cannot be downloaded on the include official website)

Shell script text three swordsman awk

Beyond compare 3 next difference segment / down search arrow not found
Ali II: what if the AOF file in redis is too large?

5V升压9V芯片
随机推荐
NPM step pit
53 亿 BI 市场 TOP 10:帆软、微软、永洪、SAP、百度、IBM、SAS、思迈特、Salesforce、浪潮通软
compute_ class_ weight() takes 1 positional argument but 3 were given
Sword finger offer note: T39. Numbers that appear more than half of the time in the array
Greek alphabet reading
JS-寄生组合式继承
Sword finger offer notes: T53 - I. find numbers in the sorted array
Firewalld防火墙
LNMP architecture setup (deploy discuz Forum)
Unexpected harvest of epic distributed resources, from basic to advanced are full of dry goods, big guys are strong!
Solution: can not issue executeupdate() or executelargeupdate() for selections
w.r.t. ; i.e.; etc.; e. G. what does it mean
5V升压9V芯片
Sword finger offer note: t45. arrange the array into the smallest number
Write and read system temporary files: createtempfile and tempfilecontent[easy to understand]
Sword finger offer notes: T53 - ii Missing numbers from 0 to n-1
Simple blockchain day based on bolt database (2)
Leetcode 02: sword finger offer 58 - I. flip the word order (simple); T123. Verify palindrome string; T9. Palindromes
源码编译安装LAMP
After Party A's hard work, 49.08 million orders of China Mobile were scrapped