当前位置:网站首页>Central South University | through exploration and understanding: find interpretable features with deep reinforcement learning
Central South University | through exploration and understanding: find interpretable features with deep reinforcement learning
2022-07-03 16:28:00 【Zhiyuan community】
【 title 】Understanding via Exploration: Discovery of Interpretable Features With Deep Reinforcement Learning
【 The author team 】Jiawen Wei, Zhifeng Qiu, Fangyuan Wang, Wenwei Lin, Ning Gui, Weihua Gui
【 Date of publication 】2022.6.28
【 Thesis link 】https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9810174
【 Recommended reasons 】 Understanding the environment through interaction has become one of the most important intellectual activities for human beings to master unknown systems . as everyone knows , Deep reinforcement learning (DRL) In many applications, effective control is achieved through human like exploration and utilization . However , Deep neural network (DNN) The opacity of often hides the key information related to control , This is essential for understanding the target system . This paper first proposes a new online feature selection framework , That is, attention feature selection based on two worlds (D-AFS) , To identify the contribution of input to the whole control process . With most DRL The world used in is different ,D-AFS It has both the real world and the distorted virtual world . Newly introduced attention based assessment (AR) The module realizes the dynamic mapping from the real world to the virtual world . The existing DRL The algorithm needs only a little modification , You can learn in the dual world . Through analysis DRL Response in two worlds ,D-AFS It can quantitatively identify the importance of each feature to control .
边栏推荐
- [redis foundation] understand redis persistence mechanism together (rdb+aof graphic explanation)
- 用通达信炒股开户安全吗?
- PHP CI (CodeIgniter) log level setting
- 高等数学(第七版)同济大学 习题2-1 个人解答
- PHP二级域名session共享方案
- Cocos Creator 2.x 自动打包(构建 + 编译)
- Nine ways to define methods in scala- Nine ways to define a method in Scala?
- 程序猿如何快速成长
- 面试之 top k问题
- Slam learning notes - build a complete gazebo multi machine simulation slam from scratch (III)
猜你喜欢

Interviewer: how does the JVM allocate and recycle off heap memory

Détails du contrôle de la congestion TCP | 3. Espace de conception

TCP擁塞控制詳解 | 3. 設計空間

跟我学企业级flutter项目:简化框架demo参考

0214-27100 a day with little fluctuation

一台服务器最大并发 tcp 连接数多少?65535?

How can technology managers quickly improve leadership?

Low level version of drawing interface (explain each step in detail)

斑马识别成狗,AI犯错的原因被斯坦福找到了

Q2 encryption market investment and financing report in 2022: gamefi becomes an investment keyword
随机推荐
14 topics for performance interviews between superiors and subordinates (4)
无心剑中译泰戈尔《漂鸟集(1~10)》
跟我学企业级flutter项目:简化框架demo参考
AcWing 第58 场周赛
June to - -------
8个酷炫可视化图表,快速写出老板爱看的可视化分析报告
Qt插件之自定义插件构建和使用
Rk3399 platform development series explanation (WiFi) 5.54. What is WiFi wireless LAN
Expression of request header in different countries and languages
Visual SLAM algorithms: a survey from 2010 to 2016
Top k questions of interview
手机注册股票开户安全吗 开户需要钱吗
[proteus simulation] 74hc595+74ls154 drive display 16x16 dot matrix
1287. Elements that appear more than 25% in an ordered array
[web security] - [SQL injection] - error detection injection
From "zero sum game" to "positive sum game", PAAS triggered the third wave of cloud computing
PyTorch 1.12发布,正式支持苹果M1芯片GPU加速,修复众多Bug
NFT新的契机,多媒体NFT聚合平台OKALEIDO即将上线
EditText request focus - EditText request focus
Golang 装饰器模式以及在NSQ中的使用