当前位置:网站首页>Surpass palm! Peking University Master proposed diverse to comprehensively refresh the NLP reasoning ranking
Surpass palm! Peking University Master proposed diverse to comprehensively refresh the NLP reasoning ranking
2022-07-05 15:00:00 【Zhiyuan community】
lately , Researchers from Peking University and Microsoft based on self consistent new methods DiVeRSe, There are three main innovations , It further improves the reasoning ability of the model .
Thesis link :https://arxiv.org/abs/2206.02336
Code link :https://github.com/microsoft/DiVeRSe
First of all , Be self consistent 「 Different ideas , The answer is the same 」 Inspired by the , That is, sampling different reasoning paths from the language model ,DiVeRSe Go further in diversity , according to 「 All roads lead to Rome 」 Idea , The use of multiple prompt Generate answers , It can generate more complete 、 Complementary answers .
second , When generating reasoning paths , There is no mechanism in the language model to correct errors in previous steps , It may cause confusion in the final prediction results .DiVeRSe reference verifier Thought , Verify the correctness of each reasoning path to guide the voting mechanism . in other words , Not all reasoning mechanisms are equally important or good .
Third , Because the answer is based on multi-step reasoning , When a path generates a correct answer , It can be considered that all the steps have contributed to the final correctness . However , When a wrong answer is generated , This does not mean that all steps are wrong or contribute to the error .
边栏推荐
- FR练习题目---简单题
- 漫画:优秀的程序员具备哪些属性?
- 【華為機試真題詳解】歡樂的周末
- What are the domestic formal futures company platforms in 2022? How about founder metaphase? Is it safe and reliable?
- Interview shock 62: what are the precautions for group by?
- Easyocr character recognition
- 30岁汇源,要换新主人了
- Stm32+bh1750 photosensitive sensor obtains light intensity
- Handwriting promise and async await
- 社区团购撤城“后遗症”
猜你喜欢
Microframe technology won the "cloud tripod Award" at the global Cloud Computing Conference!
Photoshop plug-in action related concepts actionlist actiondescriptor actionlist action execution load call delete PS plug-in development
【数组和进阶指针经典笔试题12道】这些题,满足你对数组和指针的所有幻想,come on !
当代人的水焦虑:好水究竟在哪里?
超级哇塞的快排,你值得学会!
Thymeleaf uses background custom tool classes to process text
There is a powerful and good-looking language bird editor, which is better than typora and developed by Alibaba
CPU design related notes
FR练习题目---简单题
用 Go 跑的更快:使用 Golang 为机器学习服务
随机推荐
Crud de MySQL
Cartoon: programmers don't repair computers!
Mongdb learning notes
【jvm】运算指令
FR练习题目---简单题
useMemo,memo,useRef等相关hooks详解
webRTC SDP mslabel lable
危机重重下的企业发展,数字化转型到底是不是企业未来救星
12 MySQL interview questions that you must chew through to enter Alibaba
How can I quickly check whether there is an error after FreeSurfer runs Recon all—— Core command tail redirection
【leetcode周赛总结】LeetCode第 81 场双周赛(6.25)
How to choose the appropriate certificate brand when applying for code signing certificate?
机器学习框架简述
在Pytorch中使用Tensorboard可视化训练过程
[JVM] operation instruction
百亿按摩仪蓝海,难出巨头
qt creater断点调试程序详解
Under the crisis of enterprise development, is digital transformation the future savior of enterprises
Run faster with go: use golang to serve machine learning
两个BI开发,3000多张报表?如何做的到?