当前位置:网站首页>超越PaLM!北大硕士提出DiVeRSe,全面刷新NLP推理排行榜
超越PaLM!北大硕士提出DiVeRSe,全面刷新NLP推理排行榜
2022-07-05 14:48:00 【智源社区】
最近,来自北大和微软的研究人员基于自洽的新方法DiVeRSe,包含三个主要的创新点,进一步提升了模型的推理能力。

论文链接:https://arxiv.org/abs/2206.02336
代码链接:https://github.com/microsoft/DiVeRSe
第一,受到自洽方式「想法不同,答案相同」的启发,即从语言模型中采样不同的推理路径,DiVeRSe在多样性上更进一步,按照「条条大路通罗马」的理念,使用多个prompt生成答案,能够生成更完整、互补的答案。
第二,在生成推理路径时,语言模型中并不存在一种机制来纠正先前步骤中的错误,可能会导致最终预测结果的混乱。DiVeRSe借鉴verifier的思想,对每个推理路径的正确性进行验证来引导投票机制。也就是说,并非所有的推理机制都是相等重要的或都是好的。
第三,由于答案是基于多个步骤的推理而产生的,当一个路径生成一个正确的答案时,可以认为所有的步骤都对最终的正确性做出了贡献。然而,当生成一个错误的答案时,这并不意味着所有的步骤都是错误的或对错误有贡献。
边栏推荐
- Section - left closed right open
- Stm32+bh1750 photosensitive sensor obtains light intensity
- Structure - C language
- dynamic programming
- FR练习题目---综合题
- 申请代码签名证书时如何选择合适的证书品牌?
- 729. 我的日程安排表 I :「模拟」&「线段树(动态开点)」&「分块 + 位运算(分桶)」
- Differences between IPv6 and IPv4 three departments including the office of network information technology promote IPv6 scale deployment
- 【jvm】运算指令
- 【NVMe2.0b 14-9】NVMe SR-IOV
猜你喜欢

Crud de MySQL

Visual task scheduling & drag and drop | scalph data integration based on Apache seatunnel

黑马程序员-软件测试-10阶段2-linux和数据库-44-57为什么学习数据库,数据库分类关系型数据库的说明Navicat操作数据的说明,Navicat操作数据库连接说明,Navicat的基本使用,

有一个强大又好看的,赛过Typora,阿里开发的语雀编辑器

Crud of MySQL
![[summary of leetcode weekly competition] the 81st fortnight competition of leetcode (6.25)](/img/d7/f49bca8da2ce286c18508325985990.png)
[summary of leetcode weekly competition] the 81st fortnight competition of leetcode (6.25)

Security analysis of Web Architecture

计算中间件 Apache Linkis参数解读

如何将电脑复制的内容粘贴进MobaXterm?如何复制粘贴

729. My schedule I: "simulation" & "line segment tree (dynamic open point) &" block + bit operation (bucket Division) "
随机推荐
【华为机试真题详解】欢乐的周末
Topology可视化绘图引擎
Security analysis of Web Architecture
手写promise与async await
FR练习题目---综合题
Cartoon: what are the attributes of a good programmer?
Want to ask the big guy, is there any synchronization from Tencent cloud Mysql to other places? Binlog saved by Tencent cloud MySQL on cos
市值蒸发超百亿美元,“全球IoT云平台第一股”赴港求生
社区团购撤城“后遗症”
1330:【例8.3】最少步数
可视化任务编排&拖拉拽 | Scaleph 基于 Apache SeaTunnel的数据集成
MySQL之CRUD
There is a powerful and good-looking language bird editor, which is better than typora and developed by Alibaba
STM32+BH1750光敏传感器获取光照强度
Super wow fast row, you are worth learning!
Fr exercise topic --- comprehensive question
12 MySQL interview questions that you must chew through to enter Alibaba
Differences between IPv6 and IPv4 three departments including the office of network information technology promote IPv6 scale deployment
be careful! Software supply chain security challenges continue to escalate
MySQL之CRUD