当前位置:网站首页>Advantages and disadvantages of evaluation methods
Advantages and disadvantages of evaluation methods
2022-07-06 10:25:00 【How about a song without trace】
1、 Over fitting : When the learner learns the training samples well , It is possible to take the characteristics of the trained samples as the general properties of all potential samples , This will lead to the decline of Pan China capability ( Generalization ability refers to the ability of the learning model to be applied to unknown samples ).
2、 Under fitting : Low learning ability , I think the general characteristics are all characteristics .
Evaluation methods :
- Set aside method : If the training set contains the vast majority of samples , Then the trained sample may be close to the desired training model , But because of the small test set , The assessment results may not be accurate enough , The pattern of basic partitioned data sets :2:1,4:1 The front is used for training , The latter is used for testing .
- Cross validation : Equal division , Stratified sampling , Take the mean , The defect is : Large data sets are too expensive , Spend more time .
- Self help law : Loop from the overall data into the sample , Put it back again , The final initial data are 0.368 The sample of does not appear , Used for testing . The self-help method can be used to test from the samples that appear in the initial data set , Such a test is also known as out of package estimation . advantage : The self-help method is smaller in the data set , It's hard to divide training effectively \ Test sets are useful , Multiple different training sets can be generated from the initial data set , shortcoming : But it changes the distribution of data sets , This will introduce Estimated deviation .
But when the initial data volume is enough , Set aside method and cross validation method are more commonly used .
Participate in the final parameter model :
General rules of parameter adjustment : Select a range and a varying step size for each parameter , This is a compromise between computational overhead and performance .
Performance metrics : Measure the pan China capability of the model , Performance depends not only on Algorithms and data , It also determines mission requirements .
The most commonly used performance measure for regression tasks : Mean square error .
Recall rate (TP/(TP+FN))、 Precision rate (TP/(TP+FP)):TP Real examples FP False positive example TN True counter example FN False counter example .
F1 It is based on the harmonic average of recall and precision :2*TP/( Total number of samples +TP-TN)
ROC: Characteristics of test work . The horizontal axis TPR( Real examples )=TP/(TP+FN), The vertical axis FPR( False positive example ):FP/(TN+FP).
Normalization : Map values from different ranges of variation to the same fixed range , Common is [0,1], Also known as normalization .
deviation : The difference between the expected output and the real tag , Describe the fitting ability of the learning algorithm itself .
Generalization error can be decomposed into deviation 、 variance ( Have you measured the change of learning performance caused by the change of the same size training set , The impact of data perturbation is characterized )、 And noise ( The lower bound of the expected generalization error that any learning algorithm can achieve in the current task is expressed ) The sum of the .
边栏推荐
- 13 medical registration system_ [wechat login]
- Introduction tutorial of typescript (dark horse programmer of station B)
- MySQL storage engine
- If someone asks you about the consistency of database cache, send this article directly to him
- MySQL35-主从复制
- C miscellaneous lecture continued
- C杂讲 文件 初讲
- 该不会还有人不懂用C语言写扫雷游戏吧
- MySQL实战优化高手07 生产经验:如何对生产环境中的数据库进行360度无死角压测?
- MySQL combat optimization expert 10 production experience: how to deploy visual reporting system for database monitoring system?
猜你喜欢

西南大学:胡航-关于学习行为和学习效果分析

The programming ranking list came out in February. Is the result as you expected?

16 医疗挂号系统_【预约下单】

The 32 year old programmer left and was admitted by pinduoduo and foreign enterprises. After drying out his annual salary, he sighed: it's hard to choose

Installation of pagoda and deployment of flask project

Super detailed steps to implement Wechat public number H5 Message push

Super detailed steps for pushing wechat official account H5 messages

宝塔的安装和flask项目部署

MySQL底层的逻辑架构

四川云教和双师模式
随机推荐
四川云教和双师模式
17 medical registration system_ [wechat Payment]
cmooc互联网+教育
docker MySQL解决时区问题
MySQL real battle optimization expert 11 starts with the addition, deletion and modification of data. Review the status of buffer pool in the database
Flash operation and maintenance script (running for a long time)
颜值爆表,推荐两款JSON可视化工具,配合Swagger使用真香
华南技术栈CNN+Bilstm+Attention
数据库中间件_Mycat总结
Write your own CPU Chapter 10 - learning notes
MySQL ERROR 1040: Too many connections
A necessary soft skill for Software Test Engineers: structured thinking
Windchill配置远程Oracle数据库连接
使用OVF Tool工具从Esxi 6.7中导出虚拟机
What is the current situation of the game industry in the Internet world?
C杂讲 动态链表操作 再讲
寶塔的安裝和flask項目部署
西南大学:胡航-关于学习行为和学习效果分析
① BOKE
vscode 常用的指令