当前位置:网站首页>False label aggregation
False label aggregation
2022-08-03 20:47:00 【Mark_Aussie】
Pseudo Label is a concept in semi-supervised learning, which can help models learn better from unlabeled information.
Compared with full unsupervised learning, semi-supervised learning has partial labeled data and a large amount of unlabeled data, which is more suitable for real-world and competition scenarios.
In reality, there is less labeled data and more unlabeled data;
In the competition, the training set is marked, but the test set is not marked;
Pseudo-labeling is one of the methods in semi-supervised learning. The idea is as follows: first use the existing labeled data to train the model; use the trained model to predict the unlabeled data;The predicted labels and data are added to the training set for training;

Not all unlabeled data are predicted and trained together,
If there are relatively few labeled datasets in the initial stage, there should not be too many pseudo-labels added each time;
The above process of predicting and adding training to unlabeled data is performed iteratively, not once.
Pseudo tags are not a panacea in competitions. Generally, pseudo tags are suitable for:
Unstructured data, common use of deep learning;
When the accuracy of the model is high, the added pseudo tags are accurate;
In the competition, according to Kaggle's mechanism pseudo-tags can be divided into:
Non-Kernel competition: offline pseudo-label, offline prediction, pseudo-label training, and then prediction;
Kernel competition: online pseudo-label, online prediction, pseudo-label retraining prediction, then prediction;
Pseudo tags and soft tags:
Pseudo Label predicts unlabeled data and performs secondary training;
Soft Label converts labels to discrete values for secondary training;
Soft labels are generally used in model distillation and training of some datasets, allowing the model to learn the overall class distribution of the sample.At the same time, compared with the hard label (Hard Label), the soft label can prevent the model from overfitting, and can be used together with the mixup.
Soft labels and pseudo labels are used at the same time; in the picture below, the original label of the photo is car, but the photo also has the category of person. If the hard label is used directly for training, it will bring a certain amount of model noise.The predicted probability results of the model (probability distribution of each type) can be used to replace the labels of the original pictures for training, so that the labels of the pictures are more reasonable and the model training process will be more stable.

If there is no other way to increase points in the competition, it is recommended to try pseudo tags, otherwise it is not recommended to try;
Pseudo-tags are suitable for deep learning methods, and samples with high prediction execution are generally selected for training;
Whether pseudo-tags can be used depends on the organizer's regulations;
Semi-supervised training process:
Step 1: Train the model with labeled data
Step 2: Use the trained model to predict labels for unlabeled data
Step 3: Retrain the model using both the pseudo and labeled datasets, and the model is used for final predictions on the test data.
Reference: Kaggle Knowledge Point: Pseudo Label Pseudo Label -Motian Wheel
边栏推荐
猜你喜欢

信使mRNA甲基化偶联3-甲基胞嘧啶(m3C)|mRNA-m3C

史兴国对谈于佳宁:从经济模式到落地应用,Web3的中国之路怎么走?

tidyverse based on data.table?

华为设备配置VRRP与BFD联动实现快速切换

伪标签汇总

2021年数据泄露成本报告解读

RNA-ATTO 390|RNA-ATTO 425|RNA-ATTO 465|RNA-ATTO 488|RNA-ATTO 495|RNA-ATTO 520近红外荧光染料标记核糖核酸RNA

力扣707-设计链表——链表

博士申请 | 美国明尼苏达大学葛畅教授招收隐私数据管理方向全奖博士/硕士/博后/访问学者...

9月1日起我国给予多哥等16国98%税目产品零关税待遇
随机推荐
tidyverse based on data.table?
华为设备配置VRRP负载分担
算法--交错字符串(Kotlin)
leetcode 231. 2 的幂
RNA核糖核酸修饰RNA-HiLyte FluorTM 405荧光染料|RNA-HiLyte FluorTM 405
leetcode 2119. Numbers reversed twice
leetcode 461. 汉明距离
TweenMax.js向日葵表情变化
ES6--residual parameters
Often forget HiFlow 】 【 check-in?Use tencent cloud scenario connector to remind you every day.
Markdown syntax
简单又有效的基本折线图制作方法
尚医通项目总结
Interesting opencv - record image binarization and similarity
史兴国对谈于佳宁:从经济模式到落地应用,Web3的中国之路怎么走?
8.3模拟赛总结
15 years experience in software architect summary: in the field of ML, tread beginners, five hole
【使用 Pytorch 实现入门级的人工神经网络】
error: C1083: 无法打开包括文件: “QString”: No such error: ‘QDir‘ file not found
从开发到软件测试:除了扎实的测试基础,还有哪些必须掌握 ?