当前位置:网站首页>Advantages and disadvantages of evaluation methods
Advantages and disadvantages of evaluation methods
2022-07-06 10:25:00 【How about a song without trace】
1、 Over fitting : When the learner learns the training samples well , It is possible to take the characteristics of the trained samples as the general properties of all potential samples , This will lead to the decline of Pan China capability ( Generalization ability refers to the ability of the learning model to be applied to unknown samples ).
2、 Under fitting : Low learning ability , I think the general characteristics are all characteristics .
Evaluation methods :
- Set aside method : If the training set contains the vast majority of samples , Then the trained sample may be close to the desired training model , But because of the small test set , The assessment results may not be accurate enough , The pattern of basic partitioned data sets :2:1,4:1 The front is used for training , The latter is used for testing .
- Cross validation : Equal division , Stratified sampling , Take the mean , The defect is : Large data sets are too expensive , Spend more time .
- Self help law : Loop from the overall data into the sample , Put it back again , The final initial data are 0.368 The sample of does not appear , Used for testing . The self-help method can be used to test from the samples that appear in the initial data set , Such a test is also known as out of package estimation . advantage : The self-help method is smaller in the data set , It's hard to divide training effectively \ Test sets are useful , Multiple different training sets can be generated from the initial data set , shortcoming : But it changes the distribution of data sets , This will introduce Estimated deviation .
But when the initial data volume is enough , Set aside method and cross validation method are more commonly used .
Participate in the final parameter model :
General rules of parameter adjustment : Select a range and a varying step size for each parameter , This is a compromise between computational overhead and performance .
Performance metrics : Measure the pan China capability of the model , Performance depends not only on Algorithms and data , It also determines mission requirements .
The most commonly used performance measure for regression tasks : Mean square error .
Recall rate (TP/(TP+FN))、 Precision rate (TP/(TP+FP)):TP Real examples FP False positive example TN True counter example FN False counter example .
F1 It is based on the harmonic average of recall and precision :2*TP/( Total number of samples +TP-TN)
ROC: Characteristics of test work . The horizontal axis TPR( Real examples )=TP/(TP+FN), The vertical axis FPR( False positive example ):FP/(TN+FP).
Normalization : Map values from different ranges of variation to the same fixed range , Common is [0,1], Also known as normalization .
deviation : The difference between the expected output and the real tag , Describe the fitting ability of the learning algorithm itself .
Generalization error can be decomposed into deviation 、 variance ( Have you measured the change of learning performance caused by the change of the same size training set , The impact of data perturbation is characterized )、 And noise ( The lower bound of the expected generalization error that any learning algorithm can achieve in the current task is expressed ) The sum of the .
边栏推荐
- AI的路线和资源
- 高并发系统的限流方案研究,其实限流实现也不复杂
- C杂讲 双向循环链表
- MySQL实战优化高手07 生产经验:如何对生产环境中的数据库进行360度无死角压测?
- Time complexity (see which sentence is executed the most times)
- [paper reading notes] - cryptographic analysis of short RSA secret exponents
- 四川云教和双师模式
- 简单解决phpjm加密问题 免费phpjm解密工具
- MySQL实战优化高手04 借着更新语句在InnoDB存储引擎中的执行流程,聊聊binlog是什么?
- 西南大学:胡航-关于学习行为和学习效果分析
猜你喜欢
History of object recognition
保姆级手把手教你用C语言写三子棋
该不会还有人不懂用C语言写扫雷游戏吧
华南技术栈CNN+Bilstm+Attention
The 32 year old programmer left and was admitted by pinduoduo and foreign enterprises. After drying out his annual salary, he sighed: it's hard to choose
实现以form-data参数发送post请求
MySQL storage engine
The appearance is popular. Two JSON visualization tools are recommended for use with swagger. It's really fragrant
In fact, the implementation of current limiting is not complicated
C杂讲 动态链表操作 再讲
随机推荐
Installation of pagoda and deployment of flask project
美疾控中心:美国李斯特菌疫情暴发与冰激凌产品有关
[after reading the series] how to realize app automation without programming (automatically start Kwai APP)
Zsh configuration file
美新泽西州州长签署七项提高枪支安全的法案
MySQL实战优化高手02 为了执行SQL语句,你知道MySQL用了什么样的架构设计吗?
Ueeditor internationalization configuration, supporting Chinese and English switching
Pytorch LSTM实现流程(可视化版本)
oracle sys_ Context() function
Const decorated member function problem
C miscellaneous lecture continued
Use xtrabackup for MySQL database physical backup
MySQL实战优化高手05 生产经验:真实生产环境下的数据库机器配置如何规划?
Docker MySQL solves time zone problems
实现以form-data参数发送post请求
[programmers' English growth path] English learning serial one (verb general tense)
评估方法的优缺点
MySQL combat optimization expert 04 uses the execution process of update statements in the InnoDB storage engine to talk about what binlog is?
17 医疗挂号系统_【微信支付】
How to make shell script executable