当前位置:网站首页>Detailed explanation of machine learning out of fold prediction | using out of fold prediction oof to evaluate the generalization performance of models and build integrated models
Detailed explanation of machine learning out of fold prediction | using out of fold prediction oof to evaluate the generalization performance of models and build integrated models
2022-06-29 13:46:00 【Yetingyun】
One 、 introduction
Model evaluation of machine learning algorithms usually uses Resampling technology , Such as K Crossover verification .
Machine learning models can be used K-Fold Cross validation technology to improve the prediction accuracy of the model . In the process of cross validation , The prediction is performed on the split test set that is not used for model training ( Model training has never seen it ). These forecasts are called discounted forecasts (out-of-fold predictions). External prediction plays an important role in machine learning , Sure Improve the generalization performance of the model , as well as Build an integration model .
Summarized below :
- Out of sample prediction is a kind of out of sample prediction for the data not used in the training model ;
- When predicting invisible data , Discounted forecasts are often used for model evaluation , Prove the generalization performance of the model ;
- Discounted forecasts can be used to build integrated models , It is called stack generalization or stack integration .
The following describes in detail the use of discount forecasting OOF Evaluate the generalization performance of the model and build an integrated model
Two 、 What is discount forecast ?
Use resampling techniques such as K-Fold It is a common method to evaluate the performance of machine learning algorithms on data sets .K-Fold The process includes dividing the training data set into K Group , And then use K Each of the group samples is used as a test set , The remaining samples are used as training sets . This means training and evaluating K A different model . This process can be summarized as follows :
- Randomly scrambling data sets ;
- Divide the dataset into K Group ( Sometimes, it may be necessary to divide by the distribution of labels );
- For each unique group : Use this group as a reserved data for testing , Use the remaining data set as the training set , Fit the model on the training set and evaluate it on the test set , heavy
边栏推荐
- @Table burst red
- 想做个答题类的微信小游戏?读这篇文章就够了
- 自主可控再下一城!首套国产ARTIQ架构量子计算测控系统发布
- 高校女生穿旗袍答辩!网友:导师说论文要是和旗袍一样漂亮就好了
- Tutorial on building pytoch model from zero (V) writing training process -- some basic configurations
- [cloud resident co creation] break through the performance bottleneck of image recognition through rust language computing acceleration technology
- 开户可以在网上开么?能安全吗
- Windwos10 installing sshd service
- Cvpr2022 | reexamine pooling: your receptive field is not the best
- Tree array application (acwing 24224244)
猜你喜欢

Write it down once Net analysis of a property management background service stuck

C语言内存函数

Mirror vulnerability scanner: trivy

Cvpr2022 | reexamine pooling: your receptive field is not the best

Online text filter less than specified length tool

记一次 .NET 某物管后台服务 卡死分析

使用 Gerrit + Zadig 实现主干开发主干发布(含字节飞书实践)

C语言字符函数

Cvpr2022 𞓜 thin domain adaptation

Yyds dry inventory solution sword finger offer: find the nearest common ancestor of two nodes in the binary tree
随机推荐
Redis deletion policy and eviction algorithm
WinDbg common commands
DeeCamp2022正式开营!李开复、张亚勤亲授大师课 | 创新事
思科模拟器简单校园网设计,期末作业难度
C语言字符函数
打造一个 API 快速开发平台,牛逼!
@Table burst red
*Clock in algorithm *leetcode 146 Analysis of LRU cache algorithm
华为机器学习服务语音识别功能,让应用绘“声”绘色
商品搜索引擎—推荐系统设计
Acwing game 57
The role of each part of Neural Network & thoroughly understand neural network
Aurora · Huffman tree generation (segment tree structure non pointer) (imitating adjacency table)
Xiaobai learns MySQL - incremental statistical SQL requirements - windowing function scheme
Tree array application (acwing 24224244)
How to make dapper support dateonly type
Korean AI team plagiarizes the shock academic world! One tutor with 51 students, or plagiarism recidivist
June training (day 29) - divide and rule
3个最佳实践助力企业改善供应链安全
weserver發布地圖服務