当前位置:网站首页>Why can't Bert completely kill the BM25??
Why can't Bert completely kill the BM25??
2022-06-28 14:00:00 【Zhiyuan community】
In recent years , Compared with the traditional retrieval model , Large scale pre training transformers The introduction of structure has significantly improved various tasks . This promotion has special model settings on different data sets , However, it is still difficult to fully understand why and how these models can work better .
The ancients said : Enemy and know yourself , Only in this way can a hundred battles be won . Now the NN The model is not yet a confidant , How to carry out the next upgrade iteration ? Today, let's take a look at the information retrieval task , be based on Bert Compared with the traditional cross encoder BM25 What are the similarities and differences of sorting algorithms ?

Thesis title :
How Different are Pre-trained Transformers for Text Ranking?
Thesis link :
https://arxiv.org/abs/2204.07233
And the traditional word - based approach ( Such as BM25 or Query-Likelihood) comparison , Neural information retrieval has recently experienced impressive performance improvements .
Because of BERT Such models have a large number of parameters , So it can deal with sentence structures with long-range dependence and complexity .
When will BERT When applied to sorting , It can be query and doc Build deep interaction between , This allows complex patterns of association to be revealed , Not just simple term matching .
up to now ,BERT The huge performance gain achieved by the cross encoder has not been well explained .
We are right. BERT What kind of features is the model based on to calculate the matching principle of sentence relevance, and the ranking results of the model are consistent with BM25 And other traditional sparse sorting algorithms .
BERT adopt query and doc The interaction between terms directly captures correlation signals , This paper deals with BERT Cross coder (Cross-Encode, Hereinafter referred to as" CE) And BM25 Do some research on the relationship between the sorting algorithm of .
First of all, the following questions are raised :
RQ1: CE and BM25 What is the difference ?
RQ1.2: CE Whether the BM25 The same results retrieved are better sorted ?
RQ1.3: CE Can be recalled better BM25 Missing results ?
secondly , Quantify separately Exactly match and Soft matching Contribution to the overall effect , Because they constitute the most direct comparison between traditional sparse retrieval and neural retrieval matching paradigm . More specifically , The following questions need to be clarified :
RQ2: CE Whether it can reflect term perfect match ?
RQ3: CE Can find “ Impossible to be relevant ” The result of ?
边栏推荐
- Unit test ci/cd
- My hematemesis collection integrates script teaching from various classic shell books. As Xiaobai, come quickly
- ThreadLocal的简单理解
- 正则匹配数字,英文以及英文符号
- 木兰开放作品许可证1.0面向社会公开征求意见
- 一个bug肝一周...忍不住提了issue
- 中国广电5G套餐来了,比三大运营商低,却没预期那么低
- PC博物馆-熟悉又陌生的懵懂年代
- To be the Italian Islander? Liuqiangdong cashed out 6.6 billion yuan in two months and made a one-time 560million "emergency transfer" to buy the European maritime Palace
- 你的代碼會說話嗎?(上)
猜你喜欢

Kubernetes 深入理解Kubernetes(二) 声明组织对象

真香啊!最全的 Pycharm 常用快捷键大全!

PC博物馆-熟悉又陌生的懵懂年代

众昂矿业着眼氟化工产业,布局新能源产业链

程序员坐牢了,会被安排去写代码吗?

Other domestic mobile phones failed to fill the vacancy of Huawei, and apple has no rival in the high-end mobile phone market

初识exception

你的代码会说话吗?(上)

PCB understand Wang, are you? I am not

外贸邮件推广怎么统计维度
随机推荐
[机缘参悟-32]:鬼谷子-抵巇[xī]篇-面对危险与问题的五种态度
抢做意大利岛主?刘强东两月套现66亿 疑一次性5.6亿“紧急转账”急购欧洲海上皇宫
众昂矿业着眼氟化工产业,布局新能源产业链
Votre Code parle? (1)
Reverse a stack with recursive functions and stack operations only
Luogu_ P1303 A*B Problem_ High precision calculation
Introduction to PWN (1) binary Basics
求解汉诺塔问题
PostgreSQL surpasses MySQL
New product experience: Alibaba cloud's new generation of local SSD instance I4 open beta
PC博物馆-熟悉又陌生的懵懂年代
iNFTnews | 科技巨头加快进军Web3和元宇宙
Design artificial intelligence products: technical possibility, user acceptability and commercial feasibility
From PDB source code to frame frame object
3. Overall UI architecture of the project
SPI接口简介-Piyu Dhaker
Simple understanding of ThreadLocal
MySQL从库Error:“You cannot ‘Alter‘ a log table...“
欧拉恒等式:数学史上的真正完美公式!
真香啊!最全的 Pycharm 常用快捷键大全!