当前位置:网站首页>Richard Sutton, the father of reinforcement learning, paper: pursuing a general model for intelligent decision makers
Richard Sutton, the father of reinforcement learning, paper: pursuing a general model for intelligent decision makers
2022-06-24 13:16:00 【Zhiyuan community】
The idea of this paper is to strengthen and deepen this premise by putting forward views on decision makers , This view in Psychology 、 Artificial intelligence 、 economics 、 Control theory and neuroscience have substantial and extensive applications , I call it a generic model for intelligent agents . The generic model does not include anything specific to any organism 、 Anything in the world or in the field of application . The generic model does include all aspects of the decision-maker's interaction with his world ( There must be input and output , And a goal ) And the internal components of the decision maker ( For perception 、 Decision making 、 Internal assessment and world model ). I identified these aspects and components , Note that they are given different names in different disciplines , But essentially it means the same idea , The challenges and benefits of designing a neutral term that can be used across disciplines are discussed . It is time to recognize and establish the integration of multiple different disciplines on the substantive general model of intelligent agents .边栏推荐
- Configuration (enable_*) parameter related to execution plan in PG
- It's settled! Bank retail credit risk control just does it!
- "Interesting" is the competitiveness of the new era
- Cohere、OpenAI、AI21联合发布部署模型的最佳实践准则
- 一文讲透植物内生菌研究怎么做 | 微生物专题
- [live broadcast of celebrities] elastic observability workshop
- Kubernetes集群部署
- Opengauss kernel: simple query execution
- 初中级开发如何有效减少自身的工作量?
- 强化学习之父Richard Sutton论文:追寻智能决策者的通用模型
猜你喜欢

Detailed explanation of abstractqueuedsynchronizer, the cornerstone of thread synchronization

"Interesting" is the competitiveness of the new era

Yolov6: the fast and accurate target detection framework is open source

手把手教你用AirtestIDE无线连接手机!

WPF从零到1教程详解,适合新手上路

"I, an idiot, have recruited a bunch of programmers who can only" Google "

Getting started with the lvgl Library - colors and images

我真傻,招了一堆只会“谷歌”的程序员!

解析nc格式文件,GRB格式文件的依赖包edu.ucar.netcdfAll的api 学习

MySQL foreign key impact
随机推荐
YOLOv6:又快又准的目标检测框架开源啦
实现领域驱动设计 - 使用ABP框架 - 创建实体
Kubernetes cluster deployment
nifi从入门到实战(保姆级教程)——环境篇
Nifi from introduction to practice (nanny level tutorial) - environment
Ask a question about SQL view
“我这个白痴,招到了一堆只会“谷歌”的程序员!”
1. Snake game design
短信服務sms
敏捷之道 | 敏捷开发真的过时了么?
How long will it take to open a mobile account? Is online account opening safe?
CVPR 2022 | interprétation de certains documents de l'équipe technique de meituan
Use txvideoeditor to add watermark and export video card at 99%? No successful failed callback?
Concept + formula (excluding parameter estimation)
DTU上报的数据值无法通过腾讯云规则引擎填入腾讯云数据库中
Yolov6: the fast and accurate target detection framework is open source
Common special characters in JS and TS
Interesting erasure code
手机开户后多久才能通过?在线开户安全么?
实现领域驱动设计 - 使用ABP框架 - 更新操作实体