当前位置:网站首页>Central South University | through exploration and understanding: find interpretable features with deep reinforcement learning
Central South University | through exploration and understanding: find interpretable features with deep reinforcement learning
2022-07-03 16:28:00 【Zhiyuan community】
【 title 】Understanding via Exploration: Discovery of Interpretable Features With Deep Reinforcement Learning
【 The author team 】Jiawen Wei, Zhifeng Qiu, Fangyuan Wang, Wenwei Lin, Ning Gui, Weihua Gui
【 Date of publication 】2022.6.28
【 Thesis link 】https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=9810174
【 Recommended reasons 】 Understanding the environment through interaction has become one of the most important intellectual activities for human beings to master unknown systems . as everyone knows , Deep reinforcement learning (DRL) In many applications, effective control is achieved through human like exploration and utilization . However , Deep neural network (DNN) The opacity of often hides the key information related to control , This is essential for understanding the target system . This paper first proposes a new online feature selection framework , That is, attention feature selection based on two worlds (D-AFS) , To identify the contribution of input to the whole control process . With most DRL The world used in is different ,D-AFS It has both the real world and the distorted virtual world . Newly introduced attention based assessment (AR) The module realizes the dynamic mapping from the real world to the virtual world . The existing DRL The algorithm needs only a little modification , You can learn in the dual world . Through analysis DRL Response in two worlds ,D-AFS It can quantitatively identify the importance of each feature to control .
边栏推荐
- Develop team OKR in the way of "crowdfunding"
- Golang 装饰器模式以及在NSQ中的使用
- Cocos Creator 2. X automatic packaging (build + compile)
- Hibernate的缓存机制/会话级缓存机制
- EditText request focus - EditText request focus
- Cocos Creator 2.x 自动打包(构建 + 编译)
- Rk3399 platform development series explanation (WiFi) 5.54. What is WiFi wireless LAN
- Famous blackmail software stops operation and releases decryption keys. Most hospital IOT devices have security vulnerabilities | global network security hotspot on February 14
- Page dynamics [2]keyframes
- 记一次jar包冲突解决过程
猜你喜欢
Explore Cassandra's decentralized distributed architecture
Interviewer: how does the JVM allocate and recycle off heap memory
Slam learning notes - build a complete gazebo multi machine simulation slam from scratch (4)
拼夕夕二面:说说布隆过滤器与布谷鸟过滤器?应用场景?我懵了。。
NFT new opportunity, multimedia NFT aggregation platform okaleido will be launched soon
Deep understanding of grouping sets statements in SQL
初试scikit-learn库
[proteus simulation] 74hc595+74ls154 drive display 16x16 dot matrix
Getting started with Message Oriented Middleware
8个酷炫可视化图表,快速写出老板爱看的可视化分析报告
随机推荐
Is it safe to open a stock account by mobile registration? Does it need money to open an account
[proteus simulation] 74hc595+74ls154 drive display 16x16 dot matrix
8 tips for effective performance evaluation
Thinking about telecommuting under the background of normalization of epidemic | community essay solicitation
Develop team OKR in the way of "crowdfunding"
[combinatorics] combinatorial identities (sum of variable terms 3 combinatorial identities | sum of variable terms 4 combinatorial identities | binomial theorem + derivation to prove combinatorial ide
Mixlab编辑团队招募队友啦~~
Détails du contrôle de la congestion TCP | 3. Espace de conception
"The NTP socket is in use, exiting" appears when ntpdate synchronizes the time
PHP secondary domain name session sharing scheme
MongoDB 的安装和基本操作
Multithread 02 thread join
How to initialize views when loading through storyboards- How is view initialized when loaded via a storyboard?
[combinatorics] combinatorial identity (sum of variable upper terms 1 combinatorial identity | summary of three combinatorial identity proof methods | proof of sum of variable upper terms 1 combinator
[redis foundation] understand redis master-slave architecture, sentinel mode and cluster together (Demo detailed explanation)
PHP中register_globals参数设置
"Remake Apple product UI with Android" (3) - elegant statistical chart
Register in PHP_ Globals parameter settings
Slam learning notes - build a complete gazebo multi machine simulation slam from scratch (4)
Slam learning notes - build a complete gazebo multi machine simulation slam from scratch (I)