当前位置:网站首页>Richard Sutton, the father of reinforcement learning, paper: pursuing a general model for intelligent decision makers
Richard Sutton, the father of reinforcement learning, paper: pursuing a general model for intelligent decision makers
2022-06-24 13:16:00 【Zhiyuan community】
The idea of this paper is to strengthen and deepen this premise by putting forward views on decision makers , This view in Psychology 、 Artificial intelligence 、 economics 、 Control theory and neuroscience have substantial and extensive applications , I call it a generic model for intelligent agents . The generic model does not include anything specific to any organism 、 Anything in the world or in the field of application . The generic model does include all aspects of the decision-maker's interaction with his world ( There must be input and output , And a goal ) And the internal components of the decision maker ( For perception 、 Decision making 、 Internal assessment and world model ). I identified these aspects and components , Note that they are given different names in different disciplines , But essentially it means the same idea , The challenges and benefits of designing a neutral term that can be used across disciplines are discussed . It is time to recognize and establish the integration of multiple different disciplines on the substantive general model of intelligent agents .边栏推荐
猜你喜欢

钉钉、飞书、企业微信:迥异的商业门道

Use abp Zero builds a third-party login module (I): Principles

Brief introduction to cluster analysis

Use the open source tool k8tz to gracefully set the kubernetes pod time zone

使用 Abp.Zero 搭建第三方登录模块(一):原理篇

DTU上报的数据值无法通过腾讯云规则引擎填入腾讯云数据库中

【2022国赛模拟】摆(bigben)——行列式、杜教筛

YOLOv6:又快又准的目标检测框架开源啦

Comparator sort functional interface

The agile way? Is agile development really out of date?
随机推荐
LVGL库入门教程 - 颜色和图像
The difference between apt and apt get
CVPR 2022 | 美团技术团队精选论文解读
Another prize! Tencent Youtu won the leading scientific and technological achievement award of the 2021 digital Expo
Concept + formula (excluding parameter estimation)
初中级开发如何有效减少自身的工作量?
Getting started with the go Cobra command line tool
Reset the password, and the automatic login of the website saved by chrome Google browser is lost. What is the underlying reason?
[live broadcast of celebrities] elastic observability workshop
Dingding, Feishu, and enterprise wechat: different business approaches
面试官:MySQL 数据库查询慢,除了索引问题还可能是什么原因?
Metamask项目方给Solidity程序员的16个安全建议
Tencent released credit risk control results safely: it has helped banks lend more than 100 billion yuan
手机开户后多久才能通过?在线开户安全么?
【概率论期末抱佛脚】概念+公式(不含参数估计)
Boss direct employment IPO: both the end and the beginning
105. 简易聊天室8:使用 Socket 传递图片
How to do research on plant endophytes? Special topic on Microbiology
105. simple chat room 8: use socket to transfer pictures
Leetcode 1218. 最长定差子序列