当前位置:网站首页>Central South University | through exploration and understanding: find interpretable features with deep reinforcement learning
Central South University | through exploration and understanding: find interpretable features with deep reinforcement learning
2022-07-03 16:28:00 【Zhiyuan community】
【 title 】Understanding via Exploration: Discovery of Interpretable Features With Deep Reinforcement Learning
【 The author team 】Jiawen Wei, Zhifeng Qiu, Fangyuan Wang, Wenwei Lin, Ning Gui, Weihua Gui
【 Date of publication 】2022.6.28
【 Thesis link 】https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9810174
【 Recommended reasons 】 Understanding the environment through interaction has become one of the most important intellectual activities for human beings to master unknown systems . as everyone knows , Deep reinforcement learning (DRL) In many applications, effective control is achieved through human like exploration and utilization . However , Deep neural network (DNN) The opacity of often hides the key information related to control , This is essential for understanding the target system . This paper first proposes a new online feature selection framework , That is, attention feature selection based on two worlds (D-AFS) , To identify the contribution of input to the whole control process . With most DRL The world used in is different ,D-AFS It has both the real world and the distorted virtual world . Newly introduced attention based assessment (AR) The module realizes the dynamic mapping from the real world to the virtual world . The existing DRL The algorithm needs only a little modification , You can learn in the dual world . Through analysis DRL Response in two worlds ,D-AFS It can quantitatively identify the importance of each feature to control .
边栏推荐
- 8 cool visual charts to quickly write the visual analysis report that the boss likes to see
- Hibernate的缓存机制/会话级缓存机制
- PHP中register_globals参数设置
- Chinese translation of Tagore's floating birds (1~10)
- [solved] access denied for user 'root' @ 'localhost' (using password: yes)
- 【LeetCode】94. Middle order traversal of binary tree
- Extraction of the same pointcut
- Pointcut expression
- Rk3399 platform development series explanation (WiFi) 5.54. What is WiFi wireless LAN
- 14 topics for performance interviews between superiors and subordinates (4)
猜你喜欢
[combinatorics] non descending path problem (outline of non descending path problem | basic model of non descending path problem | non descending path problem expansion model 1 non origin starting poi
[solved] access denied for user 'root' @ 'localhost' (using password: yes)
There are several APIs of airtest and poco that are easy to use wrong in "super". See if you have encountered them
Netease UI automation test exploration: airtest+poco
Unreal_ Datatable implements ID self increment and sets rowname
线程池执行定时任务
Data driving of appium framework for mobile terminal automated testing
QT串口ui设计和解决显示中文乱码
Two sides of the evening: tell me about the bloom filter and cuckoo filter? Application scenario? I'm confused..
Mb10m-asemi rectifier bridge mb10m
随机推荐
NFT new opportunity, multimedia NFT aggregation platform okaleido will be launched soon
NSQ source code installation and operation process
面试之 top k问题
"Remake Apple product UI with Android" (3) - elegant statistical chart
[combinatorics] summary of combinatorial identities (eleven combinatorial identities | proof methods of combinatorial identities | summation methods)*
Svn usage specification
用通达信炒股开户安全吗?
From the 18th line to the first line, the new story of the network security industry
Register in PHP_ Globals parameter settings
中南大学|通过探索理解: 发现具有深度强化学习的可解释特征
Mysql 将逗号隔开的属性字段数据由列转行
Multithread 02 thread join
Record windows10 installation tensorflow-gpu2.4.0
Client does not support authentication protocol requested by server; consider upgrading MySQL client
Low level version of drawing interface (explain each step in detail)
于文文、胡夏等明星带你玩转派对 皮皮APP点燃你的夏日
8 tips for effective performance evaluation
Uploads labs range (with source code analysis) (under update)
Mongodb installation and basic operation
消息队列消息丢失和消息重复发送的处理策略