当前位置:网站首页>Richard Sutton, the father of reinforcement learning, paper: pursuing a general model for intelligent decision makers
Richard Sutton, the father of reinforcement learning, paper: pursuing a general model for intelligent decision makers
2022-06-24 13:16:00 【Zhiyuan community】
The idea of this paper is to strengthen and deepen this premise by putting forward views on decision makers , This view in Psychology 、 Artificial intelligence 、 economics 、 Control theory and neuroscience have substantial and extensive applications , I call it a generic model for intelligent agents . The generic model does not include anything specific to any organism 、 Anything in the world or in the field of application . The generic model does include all aspects of the decision-maker's interaction with his world ( There must be input and output , And a goal ) And the internal components of the decision maker ( For perception 、 Decision making 、 Internal assessment and world model ). I identified these aspects and components , Note that they are given different names in different disciplines , But essentially it means the same idea , The challenges and benefits of designing a neutral term that can be used across disciplines are discussed . It is time to recognize and establish the integration of multiple different disciplines on the substantive general model of intelligent agents .边栏推荐
- What if the WordPress website forgets its password
- WPF从零到1教程详解,适合新手上路
- 我真傻,招了一堆只会“谷歌”的程序员!
- Redis' contribution in the field of microservices
- Who said that "programmers are useless without computers? The big brother around me disagrees! It's true
- 初中级开发如何有效减少自身的工作量?
- Mlife forum | microbiome and data mining
- Continuous testing | key to efficient testing in Devops Era
- On the value foam of digital copyright works from the controversial nature of "Meng Hua Lu"
- 一文理解OpenStack网络
猜你喜欢

mLife Forum | 微生物组和数据挖掘

系统测试主要步骤

线程同步的基石AbstractQueuedSynchronizer详解

go Cobra命令行工具入门

一文理解OpenStack网络

Parse NC format file and GRB format file dependent package edu ucar. API learning of netcdfall

敏捷之道 | 敏捷开发真的过时了么?

go Cobra命令行工具入门

解析nc格式文件,GRB格式文件的依赖包edu.ucar.netcdfAll的api 学习

Getting started with the lvgl Library - colors and images
随机推荐
Babbitt | metauniverse daily must read: 618 scores have been announced. How much contribution has the digital collection made behind this satisfactory answer
Use terminal to activate CONDA service in pypharm (the ultimate method is definitely OK)
Creation and use of unified links in Huawei applinking
我開導一個朋友的一些話以及我個人對《六祖壇經》的一點感悟
About the hacked database
LVGL库入门教程 - 颜色和图像
How to make secruecrt more productive
Configuration (enable_*) parameter related to execution plan in PG
一文讲透植物内生菌研究怎么做 | 微生物专题
Metamask项目方给Solidity程序员的16个安全建议
Sinomeni vine was selected as the "typical solution for digital technology integration and innovative application in 2021" of the network security center of the Ministry of industry and information te
Five minutes to develop your own code generator
1、贪吃蛇游戏设计
[day ui] affix component learning
C语言中常量的定义和使用
The data value reported by DTU cannot be filled into Tencent cloud database through Tencent cloud rule engine
申请MIMIC数据库失败怎么办?从失败到成功的经验分享给你~
Concept + formula (excluding parameter estimation)
Are you still working hard to select *? Then put away these skills
Post processing - deep camera deformation effects