当前位置:网站首页>Richard Sutton, the father of reinforcement learning, paper: pursuing a general model for intelligent decision makers
Richard Sutton, the father of reinforcement learning, paper: pursuing a general model for intelligent decision makers
2022-06-24 13:16:00 【Zhiyuan community】
The idea of this paper is to strengthen and deepen this premise by putting forward views on decision makers , This view in Psychology 、 Artificial intelligence 、 economics 、 Control theory and neuroscience have substantial and extensive applications , I call it a generic model for intelligent agents . The generic model does not include anything specific to any organism 、 Anything in the world or in the field of application . The generic model does include all aspects of the decision-maker's interaction with his world ( There must be input and output , And a goal ) And the internal components of the decision maker ( For perception 、 Decision making 、 Internal assessment and world model ). I identified these aspects and components , Note that they are given different names in different disciplines , But essentially it means the same idea , The challenges and benefits of designing a neutral term that can be used across disciplines are discussed . It is time to recognize and establish the integration of multiple different disciplines on the substantive general model of intelligent agents .边栏推荐
- what the fuck! I'm flattered. He actually wrote down the answers to the redis interview questions that big companies often ask!
- The pod is evicted due to insufficient disk space of tke node
- Boss direct employment IPO: both the end and the beginning
- Parti,谷歌的自回归文生图模型
- [database] final review (planning Edition)
- Mlife forum | microbiome and data mining
- Attack popular science: DDoS
- 105. 简易聊天室8:使用 Socket 传递图片
- YOLOv6:又快又准的目标检测框架开源啦
- Design and implementation of high performance go log library zap
猜你喜欢

一文讲透植物内生菌研究怎么做 | 微生物专题

openGauss内核:简单查询的执行

线程同步的基石AbstractQueuedSynchronizer详解

How stupid of me to hire a bunch of programmers who can only "Google"!

关于被黑数据库那些事

面试官:MySQL 数据库查询慢,除了索引问题还可能是什么原因?

CVPR 2022 | 美团技术团队精选论文解读

"I, an idiot, have recruited a bunch of programmers who can only" Google "

nifi从入门到实战(保姆级教程)——环境篇

Parse NC format file and GRB format file dependent package edu ucar. API learning of netcdfall
随机推荐
C语言中常量的定义和使用
The text to voice function is available online. You can experience the services of professional broadcasters. We sincerely invite you to try it out
MySQL master-slave replication
Implement Domain Driven Design - use ABP framework - update operational entities
LVGL库入门教程 - 颜色和图像
Another prize! Tencent Youtu won the leading scientific and technological achievement award of the 2021 digital Expo
How to solve the problem that MBR does not support partitions over 2T, and lossless transfer to GPT
Optimization of MP4 file missing seconds caused by TS files when downloading videos from easydss video platform
Post processing - deep camera deformation effects
mLife Forum | 微生物组和数据挖掘
生成 4维 的 气压温度的 nc文件,之后进行代码读取(提供代码)
Use txvideoeditor to add watermark and export video card at 99%? No successful failed callback?
Attack Science: DDoS (Part 2)
Who said that "programmers are useless without computers? The big brother around me disagrees! It's true
Are you still working hard to select *? Then put away these skills
Kubernetes集群部署
How to make secruecrt more productive
[log service CLS] Tencent cloud log service CLS accesses CDN
The agile way? Is agile development really out of date?
WPF from zero to 1 tutorial details, suitable for novices on the road