当前位置:网站首页>Surpass palm! Peking University Master proposed diverse to comprehensively refresh the NLP reasoning ranking
Surpass palm! Peking University Master proposed diverse to comprehensively refresh the NLP reasoning ranking
2022-07-05 15:00:00 【Zhiyuan community】
lately , Researchers from Peking University and Microsoft based on self consistent new methods DiVeRSe, There are three main innovations , It further improves the reasoning ability of the model .
Thesis link :https://arxiv.org/abs/2206.02336
Code link :https://github.com/microsoft/DiVeRSe
First of all , Be self consistent 「 Different ideas , The answer is the same 」 Inspired by the , That is, sampling different reasoning paths from the language model ,DiVeRSe Go further in diversity , according to 「 All roads lead to Rome 」 Idea , The use of multiple prompt Generate answers , It can generate more complete 、 Complementary answers .
second , When generating reasoning paths , There is no mechanism in the language model to correct errors in previous steps , It may cause confusion in the final prediction results .DiVeRSe reference verifier Thought , Verify the correctness of each reasoning path to guide the voting mechanism . in other words , Not all reasoning mechanisms are equally important or good .
Third , Because the answer is based on multi-step reasoning , When a path generates a correct answer , It can be considered that all the steps have contributed to the final correctness . However , When a wrong answer is generated , This does not mean that all steps are wrong or contribute to the error .
边栏推荐
- Brief introduction of machine learning framework
- [JVM] operation instruction
- 市值蒸发超百亿美元,“全球IoT云平台第一股”赴港求生
- Shanghai under layoffs
- 计算中间件 Apache Linkis参数解读
- Fr exercise topic --- comprehensive question
- useMemo,memo,useRef等相关hooks详解
- Ecotone technology has passed ISO27001 and iso21434 safety management system certification
- 超级哇塞的快排,你值得学会!
- IPv6与IPv4的区别 网信办等三部推进IPv6规模部署
猜你喜欢
729. 我的日程安排表 I :「模拟」&「线段树(动态开点)」&「分块 + 位运算(分桶)」
Interview shock 62: what are the precautions for group by?
Implement a blog system -- using template engine technology
Topology visual drawing engine
PyTorch二分类时BCELoss,CrossEntropyLoss,Sigmoid等的选择和使用
Coding devsecops helps financial enterprises run out of digital acceleration
30岁汇源,要换新主人了
面试突击62:group by 有哪些注意事项?
Ctfshow web entry explosion
【NVMe2.0b 14-9】NVMe SR-IOV
随机推荐
What are the domestic formal futures company platforms in 2022? How about founder metaphase? Is it safe and reliable?
Thymeleaf uses background custom tool classes to process text
qt creater断点调试程序详解
Is it OK to open the securities account on the excavation finance? Is it safe?
当代人的水焦虑:好水究竟在哪里?
【招聘岗位】软件工程师(全栈)- 公共安全方向
Ctfshow web entry explosion
Change multiple file names with one click
Does maxcompute have SQL that can query the current storage capacity (KB) of the table?
Behind the ultra clear image quality of NBA Live Broadcast: an in-depth interpretation of Alibaba cloud video cloud "narrowband HD 2.0" technology
PyTorch二分类时BCELoss,CrossEntropyLoss,Sigmoid等的选择和使用
CPU design practice - Chapter 4 practical task 2 using blocking technology to solve conflicts caused by related problems
市值蒸发超百亿美元,“全球IoT云平台第一股”赴港求生
Long list optimized virtual scrolling
美团优选管理层变动:老将刘薇调岗,前阿里高管加盟
JS bright blind your eyes date selector
easyOCR 字符识别
两个BI开发,3000多张报表?如何做的到?
Leetcode: Shortest Word Distance II
Brief introduction of machine learning framework