当前位置:网站首页>Transfer Learning - Domain Adaptation
Transfer Learning - Domain Adaptation
2022-07-31 20:47:00 【Full stack programmer webmaster】
Hello everyone, meet again, I'm your friend Quanstack Jun.
Domain Adaptation
On the classic machineLearning In the problem, we often assume that the training set and test set have the same distribution, train the model on the training set, and test it on the test set.However, in practical problems, the test scene is often uncontrollable, and the distribution of the test set and the training set is very different. At this time, the so-called overfitting problem occurs: the model does not perform well on the test set.Taking face recognition as an example, if it is trained with oriental face data and used to recognize western people, the recognition performance will be significantly lower than that of oriental people.When the distribution of the training set and the test set is inconsistent, the model trained by the minimum empirical error criterion on the training data does not perform well on the test, so the transfer learning technology appears.
Domain Adaptation is a representative method in transfer learning, which refers to the use of informative source domain samples to improve the performance of the target domain model.Two crucial concepts in the domain adaptation problem: source domain (source domain) represents a different domain from the test sample, but has rich supervision information; target domain(target domain) represents the field where the test sample is located, with no labels or only a few labels.The source domain and target domain often belong to the same kind of tasks, but the distribution is different. According to the different types of target and source domains, domain adaptation problems have four different scenarios: unsupervised, supervised, heterogeneous distribution and multiple source domain problems.By performing domain adaptation at different stages, the researchers propose three different domain adaptation methods: 1) sample adaptation, weighted resampling of the source domain samples to approximate the distribution of the target domain.2) Feature-level adaptation, projecting the source and target domains into a common feature subspace.3) Model-level self-adaptation, modify the source domain error function, and consider the target domain error.
Sample Adaptive:
The basic idea is to resample the source domain samples, so that the resampled source domain samples and target domain samples have basically the same distribution, and the classifier is re-learned on the resampled sample set.
Instance based TL
Find data similar to the target domain in the source domain, adjust the weight of this data so that the new data matches the data in the target domain, and then increase the weight of the sample, so that when predicting the target domainproportion increased.The advantage is that the method is simple and easy to implement.The disadvantage is that the selection of weights and the measurement of similarity depend on experience, and the data distributions of the source and target domains are often different.
Feature Adaptive:
The basic idea is to learn a common feature representation. In the common feature space, the distribution of the source domain and the target domain should be as the same as possible.
Feature based TL
Assuming that the source domain and the target domain contain some common cross features, through feature transformation, the features of the source domain and the target domain are transformed into the same space, so that the source domain data and the target domain data in this space have the same distribution of data distribution, followed by traditional machine learning.The advantage is that it is applicable to most methods, and the effect is better.The disadvantage is that it is difficult to solve and is prone to overfitting.Link: https://www.zhihu.com/question/41979241/answer/247421889
Model adaptation:
The basic idea is to adapt directly at the model level.There are two ways of model adaptation. One is to model the model directly, but the constraint of "close distance between domains" is added to the model.High-degree samples are added to the training set and the model is updated.
Model Transfer (Parameter based TL)
Assuming that the source domain and the target domain share model parameters, it refers to applying the model previously trained in the source domain with a large amount of data to the target domain for prediction, such as using tens of millions of images to train an imageFor the recognition system, when we encounter a problem in a new image field, we don’t need to find tens of millions of images for training. We only need to transfer the previously trained model to the new field.In the field, only tens of thousands of images are often enough, and high accuracy can also be obtained.The advantage is that the similarities that exist between the models can be fully exploited.The disadvantage is that the model parameters are not easy to converge.
Note: The director of Bozhong’s family is a gathering of talents.
Publisher: Full-stack programmer, please indicate the source: https://javaforall.cn/127862.htmlOriginal link: https://javaforall.cn
边栏推荐
- MySQL - multi-table query
- MySQL---operator
- Taobao/Tmall get Taobao password real url API
- NVIDIA已经开始测试AD106和AD107 GPU核心的显卡产品
- 高通cDSP简单编程例子(实现查询高通cDSP使用率、签名),RK3588 npu使用率查询
- 广汽本田安全体验营:“危险”是最好的老师
- "The core concept of" image classification and target detection in the positive and negative samples and understanding architecture
- Basic Grammar Introduction of Carbon Tutorial (Tutorial)
- Tkinter 入门之旅
- sqlite3 simple operation
猜你喜欢
1161. 最大层内元素和 : 层序遍历运用题
MATLAB程序设计与应用 2.4 MATLAB常用内部函数
Socket Review and I/0 Model
财务盈利、偿债能力指标
1161. Maximum Sum of Elements in Layer: Hierarchical Traversal Application Problems
Three.js入门
Short-circuit characteristics and protection of SiC MOSFETs
PCB叠层设计
idea中搜索具体的字符内容的快捷方式
Teach you how to deploy Nestjs projects
随机推荐
MySQL - single function
MySQL---aggregate function
Socket Review and I/0 Model
linux查看redis版本命令(linux查看mysql版本号)
1161. 最大层内元素和 : 层序遍历运用题
Introduction to Audio Types and Encoding Formats in Unity
BM5 合并k个已排序的链表
Qualcomm cDSP simple programming example (to query Qualcomm cDSP usage, signature), RK3588 npu usage query
Douyin fetches video list based on keywords API
中文编码的设置与action方法的返回值
INeuOS industrial Internet operating system, the equipment operational business and "low code" form development tools
【AcWing】The 62nd Weekly Match 【2022.07.30】
API for JD.com to obtain historical price information of commodities
全平台GPU通用AI视频补帧超分教程
uni-app中的renderjs使用
The old music player WinAmp released version 5.9 RC1: migrated to VS 2019, completely rebuilt, compatible with Win11
cas and spin locks (is lightweight locks spin locks)
Memblaze released the first enterprise-grade SSD based on long-lasting particles. What is the new value behind it?
Efficient Concurrency: A Detailed Explanation of Synchornized's Lock Optimization
程序员如何学习开源项目,这篇文章告诉你