当前位置:网站首页>超越PaLM!北大硕士提出DiVeRSe,全面刷新NLP推理排行榜
超越PaLM!北大硕士提出DiVeRSe,全面刷新NLP推理排行榜
2022-07-05 14:48:00 【智源社区】
最近,来自北大和微软的研究人员基于自洽的新方法DiVeRSe,包含三个主要的创新点,进一步提升了模型的推理能力。

论文链接:https://arxiv.org/abs/2206.02336
代码链接:https://github.com/microsoft/DiVeRSe
第一,受到自洽方式「想法不同,答案相同」的启发,即从语言模型中采样不同的推理路径,DiVeRSe在多样性上更进一步,按照「条条大路通罗马」的理念,使用多个prompt生成答案,能够生成更完整、互补的答案。
第二,在生成推理路径时,语言模型中并不存在一种机制来纠正先前步骤中的错误,可能会导致最终预测结果的混乱。DiVeRSe借鉴verifier的思想,对每个推理路径的正确性进行验证来引导投票机制。也就是说,并非所有的推理机制都是相等重要的或都是好的。
第三,由于答案是基于多个步骤的推理而产生的,当一个路径生成一个正确的答案时,可以认为所有的步骤都对最终的正确性做出了贡献。然而,当生成一个错误的答案时,这并不意味着所有的步骤都是错误的或对错误有贡献。
边栏推荐
- 【jvm】运算指令
- Mysql---- function
- Matrix chain multiplication dynamic programming example
- 通过npm 或者 yarn安装依赖时 报错 出现乱码解决方式
- CPU design practice - Chapter 4 practical task 2 using blocking technology to solve conflicts caused by related problems
- Under the crisis of enterprise development, is digital transformation the future savior of enterprises
- GPS原始坐标转百度地图坐标(纯C代码)
- PyTorch二分类时BCELoss,CrossEntropyLoss,Sigmoid等的选择和使用
- 729. My schedule I: "simulation" & "line segment tree (dynamic open point) &" block + bit operation (bucket Division) "
- Share 20 strange JS expressions and see how many correct answers you can get
猜你喜欢

两个BI开发,3000多张报表?如何做的到?

面试突击62:group by 有哪些注意事项?

Photoshop插件-动作相关概念-ActionList-ActionDescriptor-ActionList-动作执行加载调用删除-PS插件开发

Select sort and bubble sort

【华为机试真题详解】字符统计及重排

黑马程序员-软件测试-10阶段2-linux和数据库-44-57为什么学习数据库,数据库分类关系型数据库的说明Navicat操作数据的说明,Navicat操作数据库连接说明,Navicat的基本使用,
![[12 classic written questions of array and advanced pointer] these questions meet all your illusions about array and pointer, come on!](/img/d2/c0a19c85b2011ecd07c9944d996c4d.png)
[12 classic written questions of array and advanced pointer] these questions meet all your illusions about array and pointer, come on!

leetcode:881. lifeboat

Talking about how dataset and dataloader call when loading data__ getitem__ () function

Penetration testing methodology
随机推荐
Mongdb learning notes
Visual task scheduling & drag and drop | scalph data integration based on Apache seatunnel
I collect multiple Oracle tables at the same time. After collecting for a while, I will report that Oracle's OGA memory is exceeded. Have you encountered it?
Is the securities account given by the head teacher of qiniu school safe? Can I open an account?
qt creater断点调试程序详解
Section - left closed right open
选择排序和冒泡排序
危机重重下的企业发展,数字化转型到底是不是企业未来救星
Install and configure Jenkins
TS所有dom元素的类型声明
Reconnaissance des caractères easycr
Crud de MySQL
mysql8.0JSON_ Instructions for using contains
easyOCR 字符识别
你童年的快乐,都是被它承包了
Super wow fast row, you are worth learning!
Total amount analysis accounting method and potential method - allocation analysis
Shanghai under layoffs
Topology visual drawing engine
漫画:优秀的程序员具备哪些属性?