当前位置:网站首页>Text matching - [naacl 2022] GPL
Text matching - [naacl 2022] GPL
2022-06-30 14:48:00 【User 1621453】
Address of thesis :https://arxiv.org/abs/2112.07577
《 Text matching ——【EMNLP 2021】TSDAE》 One of the disadvantages of adaptive pre training in is the high computational cost , Because you must first run the pre training on the corpus , Then supervised learning is performed on the labeled training data set . Tagged training data sets can be very large .
GPL( Unsupervised domain adaptive generation of pseudo tags for dense retrieval ) Overcome the above problems : It can be applied to fine tune the model . therefore , You can use one of the pre training models and adapt it to a specific domain :
The longer you train , The better your model is . stay V100-GPU The upper training model is about 1 God .GPL It can be combined with adaptive pre training , To further improve performance .
GPL Work in three stages :
- query Generate : For a given text in our domain , We use T5 The model generates possible for a given text query. for example , When your text is “Python is a high-level general-purpose programming language” when , The model may generate something like “What is Python” In this way query. chinese T5 Doc2Query Pre training model address :https://huggingface.co/doc2query/msmarco-chinese-mt5-base-v1
- Negative example mining : Next , For build query “What is Python”, We mine negative examples from the corpus passage, I.e query Similar but not relevant to the user passage. Such negative examples passage May be “Java is a high-level, class-based, object-oriented programming language.”.. We use dense retrieval for this kind of mining , That is, we use one of the existing text embedding models to retrieve the given query Correlation passage.
- Pseudo label : In the negative example mining step , We have retrieved information related to query Actually relevant passage( Such as “What is Python” Another definition of ). To overcome this problem , We use Cross-Encoder For all (query、passage) Rate .
Training : Once we have triples (generated query, positive passage, mined negative passage) And for (query, positive) 、 (query, negative) The score of Cross-Encoder, We can start using MarginMSELoss Training text embedding model :
The pseudo marking step is very important , With the previous method QGen(《 Text matching ——【NeurIPS 2021】BEIR》) comparison , It improves performance ,QGen take passages Deemed positive (1) Or negative (0). As we can see in the figure below , For build query (“what is futures conrtact”), Negative example mining steps retrieval and generation query Partially or highly related passages. Use MarginMSELoss and Cross-Encoder, We can identify these passages And teach the text embedding model that these paragraphs are also relevant to a given query .
The following table provides an overview GPL And adaptive pre training (MLM and TSDAE) Comparison . As mentioned earlier ,GPL It can be combined with adaptive pre training :
边栏推荐
- 左旋梯形螺纹的编程
- DefCamp Capture the Flag (D-CTF) 2021-22 web
- PS tip: the video frame to Layer command cannot be completed because dynamiclink is not available
- 文本匹配——【NAACL 2021】AugSBERT
- jsPlumb. Deleteeveryconnection is not a function & jsplumb clear canvas jsplumb delete all nodes and all connections
- 立式加工中心调试的步骤
- Finding the median of two arrays by dichotomy
- MV3 04_ Introducing Manifest V3
- Laravel upload error
- Basic learning notes of C language
猜你喜欢

val_ Loss decreases first and then increases or does not decrease but only increases

XSS challenge (6-10) more detailed answers

LIS error: this configuration section cannot be used in this path

PS tip: the video frame to Layer command cannot be completed because dynamiclink is not available

How does hbuilder display in columns?

ctfshow nodejs

Att & CK red team evaluation field (I)
![[extensive reading of papers] multimodal attribute extraction](/img/ec/546c107ac0d31deded7ca94fdf0e2d.jpg)
[extensive reading of papers] multimodal attribute extraction

ES6 notes

After the MySQL service on the local computer is started and stopped, some services will automatically stop when they are not used by other services or programs
随机推荐
Zend studio how to import an existing project
Knowledge learned from the water resources institute project
PHP recursive multi-level classification, infinite classification
Three ways and differences of defining functions in JS
JS to realize simple lottery function
How to use Alibaba Vector Icon
CCF Z-scan (full mark code + problem solving ideas) 201412-2
Laravel8 custom log directory, rename
jsPlumb. Deleteeveryconnection is not a function & jsplumb clear canvas jsplumb delete all nodes and all connections
Fastcgi CGI shallow understanding
Is it troublesome for CITIC futures to open an account? Is it safe? How much is the handling charge for opening an account for futures? Can you offer a discount
day02
Using docker to manage MySQL services under Windows
Pseudocode writing specification
How to realize selective screen recording for EV screen recording
Att & CK red team evaluation field (I)
Laravel artist command error
Greedy two-dimensional array sorting
CCF drawing (full mark code + problem solving ideas + skill summary) February 2, 2014
For loop and promise to solve the problem of concurrent callback