当前位置:网站首页>超越PaLM!北大碩士提出DiVeRSe,全面刷新NLP推理排行榜
超越PaLM!北大碩士提出DiVeRSe,全面刷新NLP推理排行榜
2022-07-05 14:59:00 【智源社區】
最近,來自北大和微軟的研究人員基於自洽的新方法DiVeRSe,包含三個主要的創新點,進一步提昇了模型的推理能力。
論文鏈接:https://arxiv.org/abs/2206.02336
代碼鏈接:https://github.com/microsoft/DiVeRSe
第一,受到自洽方式「想法不同,答案相同」的啟發,即從語言模型中采樣不同的推理路徑,DiVeRSe在多樣性上更進一步,按照「條條大路通羅馬」的理念,使用多個prompt生成答案,能够生成更完整、互補的答案。
第二,在生成推理路徑時,語言模型中並不存在一種機制來糾正先前步驟中的錯誤,可能會導致最終預測結果的混亂。DiVeRSe借鑒verifier的思想,對每個推理路徑的正確性進行驗證來引導投票機制。也就是說,並非所有的推理機制都是相等重要的或都是好的。
第三,由於答案是基於多個步驟的推理而產生的,當一個路徑生成一個正確的答案時,可以認為所有的步驟都對最終的正確性做出了貢獻。然而,當生成一個錯誤的答案時,這並不意味著所有的步驟都是錯誤的或對錯誤有貢獻。
边栏推荐
- What are the domestic formal futures company platforms in 2022? How about founder metaphase? Is it safe and reliable?
- 裁员下的上海
- 我这边同时采集多个oracle表,采集一会以后,会报oracle的oga内存超出,大家有没有遇到的?
- Is it OK to open the securities account on the excavation finance? Is it safe?
- 计算中间件 Apache Linkis参数解读
- 超级哇塞的快排,你值得学会!
- Behind the ultra clear image quality of NBA Live Broadcast: an in-depth interpretation of Alibaba cloud video cloud "narrowband HD 2.0" technology
- Interpretation of Apache linkage parameters in computing middleware
- 【招聘岗位】软件工程师(全栈)- 公共安全方向
- webRTC SDP mslabel lable
猜你喜欢
【数组和进阶指针经典笔试题12道】这些题,满足你对数组和指针的所有幻想,come on !
社区团购撤城“后遗症”
IPv6与IPv4的区别 网信办等三部推进IPv6规模部署
There is a powerful and good-looking language bird editor, which is better than typora and developed by Alibaba
Mysql---- function
Differences between IPv6 and IPv4 three departments including the office of network information technology promote IPv6 scale deployment
1330:【例8.3】最少步数
12 MySQL interview questions that you must chew through to enter Alibaba
超级哇塞的快排,你值得学会!
Dark horse programmer - software testing -10 stage 2-linux and database -44-57 why learn database, description of database classification relational database, description of Navicat operation data, de
随机推荐
Cartoon: programmers don't repair computers!
CPU design related notes
Un week - end heureux
百亿按摩仪蓝海,难出巨头
How to choose the appropriate certificate brand when applying for code signing certificate?
Dark horse programmer - software testing -10 stage 2-linux and database -44-57 why learn database, description of database classification relational database, description of Navicat operation data, de
注意!软件供应链安全挑战持续升级
两个BI开发,3000多张报表?如何做的到?
CPU design practice - Chapter 4 practice task 3 use pre delivery technology to solve conflicts caused by related issues
Run faster with go: use golang to serve machine learning
[detailed explanation of Huawei machine test] happy weekend
Pointer operation - C language
美团优选管理层变动:老将刘薇调岗,前阿里高管加盟
在Pytorch中使用Tensorboard可视化训练过程
useMemo,memo,useRef等相关hooks详解
有一个强大又好看的,赛过Typora,阿里开发的语雀编辑器
Crud of MySQL
easyOCR 字符識別
【leetcode周赛总结】LeetCode第 81 场双周赛(6.25)
【招聘岗位】基础设施软件开发人员