当前位置:网站首页>Surpass palm! Peking University Master proposed diverse to comprehensively refresh the NLP reasoning ranking
Surpass palm! Peking University Master proposed diverse to comprehensively refresh the NLP reasoning ranking
2022-07-05 15:00:00 【Zhiyuan community】
lately , Researchers from Peking University and Microsoft based on self consistent new methods DiVeRSe, There are three main innovations , It further improves the reasoning ability of the model .

Thesis link :https://arxiv.org/abs/2206.02336
Code link :https://github.com/microsoft/DiVeRSe
First of all , Be self consistent 「 Different ideas , The answer is the same 」 Inspired by the , That is, sampling different reasoning paths from the language model ,DiVeRSe Go further in diversity , according to 「 All roads lead to Rome 」 Idea , The use of multiple prompt Generate answers , It can generate more complete 、 Complementary answers .
second , When generating reasoning paths , There is no mechanism in the language model to correct errors in previous steps , It may cause confusion in the final prediction results .DiVeRSe reference verifier Thought , Verify the correctness of each reasoning path to guide the voting mechanism . in other words , Not all reasoning mechanisms are equally important or good .
Third , Because the answer is based on multi-step reasoning , When a path generates a correct answer , It can be considered that all the steps have contributed to the final correctness . However , When a wrong answer is generated , This does not mean that all steps are wrong or contribute to the error .
边栏推荐
- useMemo,memo,useRef等相关hooks详解
- [summary of leetcode weekly competition] the 81st fortnight competition of leetcode (6.25)
- [JVM] operation instruction
- 漫画:优秀的程序员具备哪些属性?
- DVWA range clearance tutorial
- Run faster with go: use golang to serve machine learning
- qt creater断点调试程序详解
- 【華為機試真題詳解】歡樂的周末
- 爱可可AI前沿推介(7.5)
- FR练习题目---简单题
猜你喜欢

729. 我的日程安排表 I :「模拟」&「线段树(动态开点)」&「分块 + 位运算(分桶)」

Under the crisis of enterprise development, is digital transformation the future savior of enterprises

【leetcode周赛总结】LeetCode第 81 场双周赛(6.25)

【华为机试真题详解】字符统计及重排

Dark horse programmer - software testing -10 stage 2-linux and database -44-57 why learn database, description of database classification relational database, description of Navicat operation data, de

黑马程序员-软件测试-10阶段2-linux和数据库-44-57为什么学习数据库,数据库分类关系型数据库的说明Navicat操作数据的说明,Navicat操作数据库连接说明,Navicat的基本使用,

MongDB学习笔记

Change multiple file names with one click

Run faster with go: use golang to serve machine learning

Ctfshow web entry information collection
随机推荐
微帧科技荣获全球云计算大会“云鼎奖”!
[summary of leetcode weekly competition] the 81st fortnight competition of leetcode (6.25)
MySQL之CRUD
TS所有dom元素的类型声明
机器学习笔记 - 灰狼优化
Detailed explanation of usememo, memo, useref and other relevant hooks
Magic methods and usage in PHP (PHP interview theory questions)
easyOCR 字符識別
百亿按摩仪蓝海,难出巨头
当代人的水焦虑:好水究竟在哪里?
PyTorch二分类时BCELoss,CrossEntropyLoss,Sigmoid等的选择和使用
Brief introduction of machine learning framework
Interpretation of Apache linkage parameters in computing middleware
STM32+BH1750光敏传感器获取光照强度
你童年的快乐,都是被它承包了
社区团购撤城“后遗症”
Run faster with go: use golang to serve machine learning
Photoshop插件-动作相关概念-非加载执行动作文件中动作-PS插件开发
Microframe technology won the "cloud tripod Award" at the global Cloud Computing Conference!
爱可可AI前沿推介(7.5)