当前位置:网站首页>Performance measure of classification model
Performance measure of classification model
2022-07-07 16:11:00 【_ Spring_】
Catalog
Confusion matrix (Confusion matrix)
For dichotomies , The actual data can be divided into positive examples and negative examples . According to the discrimination category of the model and the actual category of the data , Four results can be obtained : Real examples (True positive), False positive example (False positive), True counter example (True negative), False counter example (False negative).
If the discrimination category is consistent with the actual category , It is true , atypism , False , in other words , Real examples Means , The classification of the model is consistent with the actual , All positive examples ; and False positive example The meaning of is false positive example : The discrimination is positive , But it's wrong ( Actually, it is a counterexample ); False counter example It's a false counterexample : It is judged as a counterexample , But the judgment is wrong , In fact, it is a positive example .
use TP、FP、TN、FN To represent the number of corresponding results , The confusion matrix of classification results can be obtained :
- | The prediction is positive | Negative prediction |
---|---|---|
Actual positive example | TP | FN |
Actual counterexample | FP | TN |
Consider an example , Now there is 100 people , One of the boys 70 people , girl student 30 people . There is a model to classify boys and girls . The discrimination result of the model is : schoolboy 60 people ( What is really a boy is 55 people , rest 5 People are girls ), girl student 40 people ( What is really a girl is 25 people , in addition 15 For boys ). Then the confusion matrix can be expressed as :
- | Predicted to be male | Predicted to be female |
---|---|---|
Actual boys | 55 | 15 |
Actual girls | 5 | 25 |
Commonly used evaluation index
Accuracy
The accuracy of Chinese Translation / precision .
It refers to , The proportion of samples with correct classification in the total sample books .
In the confusion matrix ,TP and TN All belong to the samples with correct classification , therefore ,
A c c u r a c y = T P + T N T P + F P + F N + T N Accuracy= \frac{TP+TN} {TP+FP+FN+TN} Accuracy=TP+FP+FN+TNTP+TN
Use the example of male and female students above to calculate Accuracy Words ,acc=(55+25)/100=0.80
Precision
Chinese translation is accuracy / Precision rate .
It refers to the sample in which the model is judged as a positive example , How many are real positive examples , therefore ,
P r e c i s i o n = T P T P + F P Precision = \frac{TP}{TP+FP} Precision=TP+FPTP
Common scenarios are “ How much information is retrieved that users are really interested in ”.
Use the example of male and female students above to calculate Precision Words ,precision=55/(55+5)=0.917
Recall
Recall rate / Recall rate .
It refers to all positive examples , How many are judged as positive examples by the model , therefore
R e c a l l = T P T P + F N Recall = \frac{TP}{TP+FN} Recall=TP+FNTP
Common scenarios are “ In the information retrieval of all fugitives , How many fugitives can be detected ”.
Use the example of male and female students above to calculate Recall Words ,recall=55/(55+15)=0.786
F1 value
frequently-used F1 The value is calculated as :
F 1 = 2 ∗ P r e c i s i o n ∗ R e c a l l P r e c i s i o n + R e c a l l = 2 ∗ T P sample Ben total Count + T P − T N F1=\frac{2*Precision*Recall}{Precision+Recall}=\frac{2*TP}{ The total number of samples +TP-TN} F1=Precision+Recall2∗Precision∗Recall= sample Ben total Count +TP−TN2∗TP
Use the example of boys and girls above to calculate ,F1=(20.9170.786)/(0.917+0.786)=0.846, Or is it F1=(2*55)/(100+55-25)=0.846
PR curve
PR The curve is based on precision For the vertical axis ,recall Is the horizontal axis , The curve drawn .
PR The larger the area under the curve , The better the performance .
On the curve , When precision And recall When the values of are equal , This point is the equilibrium point (Break-Even Point).
ROC curve
ROC Its full name is “ Work characteristics of subjects ”.
The vertical axis is the true case rate (TPR), The horizontal axis is false positive rate (FPR).
T P R = T P T P + F N TPR= \frac{TP}{TP+FN} TPR=TP+FNTP
F P R = F P T N + F P FPR=\frac{FP}{TN+FP} FPR=TN+FPFP
AUC
Express Area under ROC Cureve, yes ROC The area under the curve .
Larger area , The better the performance .
Cost curve The cost curve
The previous evaluation criteria focus on TP, Actually FP and FN It is also important in some scenarios .
For example, a medical scene : The cancer diagnosis classifier classifies healthy patients into cancer patients , Or classify cancer patients as healthy patients . These two scenarios are the misclassification of the model , But the consequences of the latter situation are more serious .
therefore , In order to better measure the different losses caused by different mistakes , Give fault to “ Unequal costs ”(unequal cost).
At the time of calculation , We need to optimize the overall cost (total cost).
边栏推荐
- [flower carving experience] 15 try to build the Arduino development environment of beetle esp32 C3
- JS array foreach source code parsing
- UE4 exports the picture + text combination diagram through ucanvasrendertarget2d
- C4D learning notes 1- animation - animation key frames
- Multiplication in pytorch: mul (), multiply (), matmul (), mm (), MV (), dot ()
- A wave of open source notebooks is coming
- Align individual elements to the right under flex layout
- After UE4 is packaged, mesh has no material problem
- Odoo集成Plausible埋码监控平台
- You Yuxi, coming!
猜你喜欢
Odoo集成Plausible埋码监控平台
深度之眼(七)——矩阵的初等变换(附:数模一些模型的解释)
Dotween -- ease function
模仿企业微信会议室选择
Three. JS introductory learning notes 04: external model import - no material obj model
Unity3d click events added to 3D objects in the scene
Virtual memory, physical memory /ram what
Leetcode-231-2的幂
Three. JS introductory learning notes 07: external model import -c4d to JSON file for web pages -fbx import
SPI master rx time out中断
随机推荐
Plate - forme de surveillance par étapes zabbix
Numpy -- epidemic data analysis case
山东老博会,2022中国智慧养老展会,智能化养老、适老科技展
[excelexport], Excel to Lua, JSON, XML development tool
Three. JS introductory learning notes 00: coordinate system, camera (temporarily understood)
喜讯!科蓝SUNDB数据库与鸿数科技隐私数据保护管理软件完成兼容性适配
Detailed explanation of unity hot update knowledge points and introduction to common solution principles
TS as a general cache method
Ue4/ue5 multi thread development attachment plug-in download address
Async and await
Mysql database backup script
torch.numel作用
Bidding announcement: 2022 Yunnan Unicom gbase database maintenance public comparison and selection project (second) comparison and selection announcement
谈谈 SAP iRPA Studio 创建的本地项目的云端部署问题
95.(cesium篇)cesium动态单体化-3D建筑物(楼栋)
Please supervise the 2022 plan
47_Opencv中的轮廓查找 cv::findContours()
分步式監控平臺zabbix
UE4 exports the picture + text combination diagram through ucanvasrendertarget2d
PyTorch 中的乘法:mul()、multiply()、matmul()、mm()、mv()、dot()