当前位置:网站首页>Text matching - [naacl 2022] GPL
Text matching - [naacl 2022] GPL
2022-06-30 14:48:00 【User 1621453】
Address of thesis :https://arxiv.org/abs/2112.07577
《 Text matching ——【EMNLP 2021】TSDAE》 One of the disadvantages of adaptive pre training in is the high computational cost , Because you must first run the pre training on the corpus , Then supervised learning is performed on the labeled training data set . Tagged training data sets can be very large .
GPL( Unsupervised domain adaptive generation of pseudo tags for dense retrieval ) Overcome the above problems : It can be applied to fine tune the model . therefore , You can use one of the pre training models and adapt it to a specific domain :
The longer you train , The better your model is . stay V100-GPU The upper training model is about 1 God .GPL It can be combined with adaptive pre training , To further improve performance .
GPL Work in three stages :
- query Generate : For a given text in our domain , We use T5 The model generates possible for a given text query. for example , When your text is “Python is a high-level general-purpose programming language” when , The model may generate something like “What is Python” In this way query. chinese T5 Doc2Query Pre training model address :https://huggingface.co/doc2query/msmarco-chinese-mt5-base-v1
- Negative example mining : Next , For build query “What is Python”, We mine negative examples from the corpus passage, I.e query Similar but not relevant to the user passage. Such negative examples passage May be “Java is a high-level, class-based, object-oriented programming language.”.. We use dense retrieval for this kind of mining , That is, we use one of the existing text embedding models to retrieve the given query Correlation passage.
- Pseudo label : In the negative example mining step , We have retrieved information related to query Actually relevant passage( Such as “What is Python” Another definition of ). To overcome this problem , We use Cross-Encoder For all (query、passage) Rate .
Training : Once we have triples (generated query, positive passage, mined negative passage) And for (query, positive) 、 (query, negative) The score of Cross-Encoder, We can start using MarginMSELoss Training text embedding model :
The pseudo marking step is very important , With the previous method QGen(《 Text matching ——【NeurIPS 2021】BEIR》) comparison , It improves performance ,QGen take passages Deemed positive (1) Or negative (0). As we can see in the figure below , For build query (“what is futures conrtact”), Negative example mining steps retrieval and generation query Partially or highly related passages. Use MarginMSELoss and Cross-Encoder, We can identify these passages And teach the text embedding model that these paragraphs are also relevant to a given query .
The following table provides an overview GPL And adaptive pre training (MLM and TSDAE) Comparison . As mentioned earlier ,GPL It can be combined with adaptive pre training :
边栏推荐
- LIS error: this configuration section cannot be used in this path
- Using docker to manage MySQL services under Windows
- [extensive reading of papers] attributes guided facial image completion
- The JSON data returned from the control layer to JS has a "\" translator. How to remove it
- Add attributes to multimode
- Problem: wechat developer tool visitor mode cannot use this function
- 高清机械原理 · 机械设计经典动图
- [extensive reading of papers] a delicious recipe analysis framework for exploring multi modal recipes with variable attributes
- Learn about data kinship JSON format design from sqlflow JSON format
- [geek challenge 2019] PHP problem solving record
猜你喜欢

Svn password forgetting solution

Fastcgi CGI shallow understanding

ThinkPHP v3.2 comment annotation injection write shell

CCF date calculation (Full Score code + skill summary) February 2, 2015

Summary of use of laravel DCAT admin

Lihongyi machine learning 2020 homework summary

CCF sequence segmentation (Full Score code + problem solving idea) 201509 -1

JS to realize simple lottery function

PS dynamic drawing

Shangpinhui knowledge points of large e-commerce projects
随机推荐
2021-05-12
PHP excel export function encapsulation (based on phpexcel class)
Uniapp upload image method
浅析卧式加工中心上不规则台阶孔存在问题
Color classification of sorting
[extensive reading of papers] a delicious recipe analysis framework for exploring multi modal recipes with variable attributes
ThinkPHP show method parameter controllable command execution
CCF command line options (Full Score code + problem solving ideas + skill summary) March 3, 2014
PHP 2D array change key name
Lost connection to the flow server (0 retries remaining): |Out of retries, exiting! Error reporting solution (flow)
XSS challenge (6-10) more detailed answers
Svn password forgetting solution
Double pointer letter matching
机械工程师面试的几个问题,你能答上来几个?
PS dynamic drawing
[extensive reading of papers] multi modal sarcasm detection and human classification in code mixed conversations
Upgrade composer self update
Binary rotation array (1)
An error is reported when installing dataspherestudio doc: invalid default value for 'update_ time‘
立式加工中心调试的步骤