当前位置:网站首页>谷歌新论文-Minerva:用语言模型解决定量推理问题
谷歌新论文-Minerva:用语言模型解决定量推理问题
2022-07-01 10:47:00 【智源社区】
定量推理是语言模型仍然远远低于人类水平的一个领域。解决数学和科学问题需要综合技能,包括用自然语言和数学符号正确解析问题、回忆相关公式和常数,以及生成涉及数值计算和符号操作的逐步解决方案。由于这些挑战,人们通常认为,使用机器学习解决定量推理问题将需要模型架构和训练技术方面的重大进步,允许模型访问外部工具,如 Python 解释器,或者可能需要更深刻的范式转变。
在“使用语言模型解决定量推理问题”(即将在 arXiv 上发布)中,我们介绍了 Minerva,一种能够使用逐步推理解决数学和科学问题的语言模型。我们表明,通过专注于收集与定量推理问题相关的训练数据、大规模训练模型以及采用一流的推理技术,我们在各种困难的定量推理任务上取得了显着的性能提升。 Minerva 通过生成包括数值计算和符号操作的解决方案来解决此类问题,而无需依赖计算器等外部工具。该模型结合使用自然语言和数学符号来解析和回答数学问题。 Minerva 结合了多种技术,包括小样本提示、思维链或暂存器提示以及多数投票,以在 STEM 推理任务上实现最先进的性能。

边栏推荐
- Matplotlib data visualization Foundation
- A new round of popularity of digital collections opens
- Is it safe to buy funds on the access letter?
- Today in history: the semiconductor war in the late 1990s; Von Neumann published the first draft; CBS acquires CNET
- How to solve the problem of SQL?
- CRC 校验
- NC | intestinal cells and lactic acid bacteria work together to prevent Candida infection
- [encounter Django] - (II) database configuration
- Ask everyone in the group about the fact that the logminer scheme of flick Oracle CDC has been used to run stably in production
- Guys, how to export iceberg data to MySQL? What tools are there? Neither sqoop nor dataX
猜你喜欢

机器学习之线性回归详解

LeetCode.515. 在每个树行中找最大值___逐一BFS+DFS+按层BFS

个人商城二开逍遥B2C商城系统源码-可商用版/拼团拼购优惠折扣秒杀源码

Kotlin coprocessor scheduling switch threads it's time to unravel the truth

Matplotlib数据可视化基础

关于#SQL#的问题,如何解决?

数字藏品新一轮热度开启

The list of winners of the digital collection of "century master" was announced
![[matytype] insert MathType inter line and intra line formulas in CSDN blog](/img/ff/871a3f06f898ed107a2a974d2c7bc4.png)
[matytype] insert MathType inter line and intra line formulas in CSDN blog

A new round of popularity of digital collections opens
随机推荐
选择在中金证券上炒股开户可以吗?安全吗?
Lack of comparator, operational amplifier to save the field! (the op amp is recorded as a comparator circuit)
Crawler (2) - requests (1) | deep parsing of requests module
JD and Tencent renewed the three-year strategic cooperation agreement; The starting salary rose to 260000 yuan! Samsung sk of South Korea competes for salary increase to retain semiconductor talents;
Design and practice of new generation cloud native database
Prism journal navigation button usability exploration record
Valgrind usage of memory leak locating tool
.NET 5.0+ 无需依赖第三方 原生实现定时任务
使用强大的DBPack处理分布式事务(PHP使用教程)
Does anyone know why? The table structure is the source table MySQL CDC that has just been directly copied
The exclusive collection of China lunar exploration project is limited to sale!
云上“视界” 创新无限 | 2022阿里云直播峰会正式上线
I'd like to know where I can open an account in Guangzhou? Is it safe to open an account online now?
Handling distributed transactions with powerful dbpack (PHP tutorial)
Have you learned the necessary global exception handler for the project
Submission lottery - light application server essay solicitation activity (may) award announcement
数据库的增删改查问题
flutter path_provider: ^2.0.10可以获取临时目录
106. construct binary tree from middle order and post order traversal sequence
基于Matlab的开环Buck降压斩波电路Simulink仿真电路模型搭建