当前位置:网站首页>Advantages and disadvantages of evaluation methods
Advantages and disadvantages of evaluation methods
2022-07-06 10:25:00 【How about a song without trace】
1、 Over fitting : When the learner learns the training samples well , It is possible to take the characteristics of the trained samples as the general properties of all potential samples , This will lead to the decline of Pan China capability ( Generalization ability refers to the ability of the learning model to be applied to unknown samples ).
2、 Under fitting : Low learning ability , I think the general characteristics are all characteristics .
Evaluation methods :
- Set aside method : If the training set contains the vast majority of samples , Then the trained sample may be close to the desired training model , But because of the small test set , The assessment results may not be accurate enough , The pattern of basic partitioned data sets :2:1,4:1 The front is used for training , The latter is used for testing .
- Cross validation : Equal division , Stratified sampling , Take the mean , The defect is : Large data sets are too expensive , Spend more time .
- Self help law : Loop from the overall data into the sample , Put it back again , The final initial data are 0.368 The sample of does not appear , Used for testing . The self-help method can be used to test from the samples that appear in the initial data set , Such a test is also known as out of package estimation . advantage : The self-help method is smaller in the data set , It's hard to divide training effectively \ Test sets are useful , Multiple different training sets can be generated from the initial data set , shortcoming : But it changes the distribution of data sets , This will introduce Estimated deviation .
But when the initial data volume is enough , Set aside method and cross validation method are more commonly used .
Participate in the final parameter model :
General rules of parameter adjustment : Select a range and a varying step size for each parameter , This is a compromise between computational overhead and performance .
Performance metrics : Measure the pan China capability of the model , Performance depends not only on Algorithms and data , It also determines mission requirements .
The most commonly used performance measure for regression tasks : Mean square error .
Recall rate (TP/(TP+FN))、 Precision rate (TP/(TP+FP)):TP Real examples FP False positive example TN True counter example FN False counter example .
F1 It is based on the harmonic average of recall and precision :2*TP/( Total number of samples +TP-TN)
ROC: Characteristics of test work . The horizontal axis TPR( Real examples )=TP/(TP+FN), The vertical axis FPR( False positive example ):FP/(TN+FP).
Normalization : Map values from different ranges of variation to the same fixed range , Common is [0,1], Also known as normalization .
deviation : The difference between the expected output and the real tag , Describe the fitting ability of the learning algorithm itself .
Generalization error can be decomposed into deviation 、 variance ( Have you measured the change of learning performance caused by the change of the same size training set , The impact of data perturbation is characterized )、 And noise ( The lower bound of the expected generalization error that any learning algorithm can achieve in the current task is expressed ) The sum of the .
边栏推荐
- MySQL ERROR 1040: Too many connections
- Routes and resources of AI
- Super detailed steps to implement Wechat public number H5 Message push
- 软件测试工程师必备之软技能:结构化思维
- 高并发系统的限流方案研究,其实限流实现也不复杂
- 基于Pytorch的LSTM实战160万条评论情感分类
- text 文本数据增强方法 data argumentation
- Security design verification of API interface: ticket, signature, timestamp
- [unity] simulate jelly effect (with collision) -- tutorial on using jellysprites plug-in
- 好博客好资料记录链接
猜你喜欢

实现以form-data参数发送post请求

UnicodeDecodeError: ‘utf-8‘ codec can‘t decode byte 0xd0 in position 0成功解决

16 医疗挂号系统_【预约下单】

如何让shell脚本变成可执行文件

History of object recognition

Target detection -- yolov2 paper intensive reading

MySQL combat optimization expert 04 uses the execution process of update statements in the InnoDB storage engine to talk about what binlog is?

Solve the problem of remote connection to MySQL under Linux in Windows

基于Pytorch的LSTM实战160万条评论情感分类

Implement context manager through with
随机推荐
Write your own CPU Chapter 10 - learning notes
What should the redis cluster solution do? What are the plans?
Use xtrabackup for MySQL database physical backup
[paper reading notes] - cryptographic analysis of short RSA secret exponents
Super detailed steps to implement Wechat public number H5 Message push
Target detection -- yolov2 paper intensive reading
Not registered via @EnableConfigurationProperties, marked(@ConfigurationProperties的使用)
ByteTrack: Multi-Object Tracking by Associating Every Detection Box 论文阅读笔记()
The governor of New Jersey signed seven bills to improve gun safety
cmooc互联网+教育
软件测试工程师必备之软技能:结构化思维
MySQL combat optimization expert 07 production experience: how to conduct 360 degree dead angle pressure test on the database in the production environment?
C miscellaneous dynamic linked list operation
Redis集群方案应该怎么做?都有哪些方案?
Ueeditor internationalization configuration, supporting Chinese and English switching
C杂讲 浅拷贝 与 深拷贝
Complete web login process through filter
AI的路线和资源
Contest3145 - the 37th game of 2021 freshman individual training match_ C: Tour guide
Installation of pagoda and deployment of flask project