当前位置:网站首页>Why can't Bert completely kill the BM25??
Why can't Bert completely kill the BM25??
2022-06-28 14:00:00 【Zhiyuan community】
In recent years , Compared with the traditional retrieval model , Large scale pre training transformers The introduction of structure has significantly improved various tasks . This promotion has special model settings on different data sets , However, it is still difficult to fully understand why and how these models can work better .
The ancients said : Enemy and know yourself , Only in this way can a hundred battles be won . Now the NN The model is not yet a confidant , How to carry out the next upgrade iteration ? Today, let's take a look at the information retrieval task , be based on Bert Compared with the traditional cross encoder BM25 What are the similarities and differences of sorting algorithms ?

Thesis title :
How Different are Pre-trained Transformers for Text Ranking?
Thesis link :
https://arxiv.org/abs/2204.07233
And the traditional word - based approach ( Such as BM25 or Query-Likelihood) comparison , Neural information retrieval has recently experienced impressive performance improvements .
Because of BERT Such models have a large number of parameters , So it can deal with sentence structures with long-range dependence and complexity .
When will BERT When applied to sorting , It can be query and doc Build deep interaction between , This allows complex patterns of association to be revealed , Not just simple term matching .
up to now ,BERT The huge performance gain achieved by the cross encoder has not been well explained .
We are right. BERT What kind of features is the model based on to calculate the matching principle of sentence relevance, and the ranking results of the model are consistent with BM25 And other traditional sparse sorting algorithms .
BERT adopt query and doc The interaction between terms directly captures correlation signals , This paper deals with BERT Cross coder (Cross-Encode, Hereinafter referred to as" CE) And BM25 Do some research on the relationship between the sorting algorithm of .
First of all, the following questions are raised :
RQ1: CE and BM25 What is the difference ?
RQ1.2: CE Whether the BM25 The same results retrieved are better sorted ?
RQ1.3: CE Can be recalled better BM25 Missing results ?
secondly , Quantify separately Exactly match and Soft matching Contribution to the overall effect , Because they constitute the most direct comparison between traditional sparse retrieval and neural retrieval matching paradigm . More specifically , The following questions need to be clarified :
RQ2: CE Whether it can reflect term perfect match ?
RQ3: CE Can find “ Impossible to be relevant ” The result of ?
边栏推荐
- Native JS implements drag and drop of page elements
- 外贸SEO 站长工具
- 增额终身寿险有哪些产品可以买呢?
- Embedded design and development project - liquid level detection and alarm system
- 由两个栈组成的队列
- 3. Overall UI architecture of the project
- 真香啊!最全的 Pycharm 常用快捷键大全!
- Unit test ci/cd
- Multi dimensional monitoring: the data base of intelligent monitoring
- How to set auto format after saving code in vscade
猜你喜欢
随机推荐
yii2编写swoole的websocket服务
其他国产手机未能填补华为的空缺,苹果在高端手机市场已无对手
To be the Italian Islander? Liuqiangdong cashed out 6.6 billion yuan in two months and made a one-time 560million "emergency transfer" to buy the European maritime Palace
欧拉恒等式:数学史上的真正完美公式!
Pytorch model
First knowledge of exception
Kubernetes 深入理解kubernetes(一)
Cat dog queue
抢做意大利岛主?刘强东两月套现66亿 疑一次性5.6亿“紧急转账”急购欧洲海上皇宫
Design artificial intelligence products: technical possibility, user acceptability and commercial feasibility
Thread life cycle and its methods
开源社邀您参加OpenInfra Days China 2022,议题征集进行中~
线程终止的 4 种方式
Jerry's wif interferes with Bluetooth [chapter]
Idea global search shortcut settings
(original) [Maui] realize "floating action button" step by step
How to open an account of Huatai Securities? How to handle the account opening is the safest
Kubernetes 深入理解Kubernetes(二) 声明组织对象
2021计算机三级数据库大题总结
PC博物馆-熟悉又陌生的懵懂年代









