当前位置:网站首页>超越PaLM!北大硕士提出DiVeRSe,全面刷新NLP推理排行榜
超越PaLM!北大硕士提出DiVeRSe,全面刷新NLP推理排行榜
2022-07-05 14:48:00 【智源社区】
最近,来自北大和微软的研究人员基于自洽的新方法DiVeRSe,包含三个主要的创新点,进一步提升了模型的推理能力。

论文链接:https://arxiv.org/abs/2206.02336
代码链接:https://github.com/microsoft/DiVeRSe
第一,受到自洽方式「想法不同,答案相同」的启发,即从语言模型中采样不同的推理路径,DiVeRSe在多样性上更进一步,按照「条条大路通罗马」的理念,使用多个prompt生成答案,能够生成更完整、互补的答案。
第二,在生成推理路径时,语言模型中并不存在一种机制来纠正先前步骤中的错误,可能会导致最终预测结果的混乱。DiVeRSe借鉴verifier的思想,对每个推理路径的正确性进行验证来引导投票机制。也就是说,并非所有的推理机制都是相等重要的或都是好的。
第三,由于答案是基于多个步骤的推理而产生的,当一个路径生成一个正确的答案时,可以认为所有的步骤都对最终的正确性做出了贡献。然而,当生成一个错误的答案时,这并不意味着所有的步骤都是错误的或对错误有贡献。
边栏推荐
- webRTC SDP mslabel lable
- Cartoon: what are the attributes of a good programmer?
- Interview shock 62: what are the precautions for group by?
- 机器学习框架简述
- Dark horse programmer - software testing -10 stage 2-linux and database -44-57 why learn database, description of database classification relational database, description of Navicat operation data, de
- Two Bi development, more than 3000 reports? How to do it?
- Shanghai under layoffs
- JMeter performance test: serveragent resource monitoring
- Handwriting promise and async await
- CPU设计相关笔记
猜你喜欢

Talking about how dataset and dataloader call when loading data__ getitem__ () function

Security analysis of Web Architecture

机器学习笔记 - 灰狼优化

浅谈Dataset和Dataloader在加载数据时如何调用到__getitem__()函数

Two Bi development, more than 3000 reports? How to do it?

一键更改多个文件名字

安装配置Jenkins

Machine learning notes - gray wolf optimization

Interview shock 62: what are the precautions for group by?

Niuke: intercepting missiles
随机推荐
Under the crisis of enterprise development, is digital transformation the future savior of enterprises
选择排序和冒泡排序
What about SSL certificate errors? Solutions to common SSL certificate errors in browsers
Fr exercise topic --- comprehensive question
Pointer operation - C language
基于TI DRV10970驱动直流无刷电机
开挖财上的证券账户可以吗?安全吗?
JS bright blind your eyes date selector
Niuke: intercepting missiles
CODING DevSecOps 助力金融企业跑出数字加速度
Crud de MySQL
Install and configure Jenkins
[JVM] operation instruction
CPU design practice - Chapter 4 practical task 2 using blocking technology to solve conflicts caused by related problems
裁员下的上海
30岁汇源,要换新主人了
Want to ask the big guy, is there any synchronization from Tencent cloud Mysql to other places? Binlog saved by Tencent cloud MySQL on cos
MySQL之CRUD
机器学习框架简述
Longest common subsequence dynamic programming