当前位置:网站首页>Or talk No.19 | Facebook Dr. Tian Yuandong: black box optimization of hidden action set based on Monte Carlo tree search
Or talk No.19 | Facebook Dr. Tian Yuandong: black box optimization of hidden action set based on Monte Carlo tree search
2020-11-08 11:21:00 【osc_4eht81t7】
Share the outline
The theme :《 Black box optimization of hidden action set based on Monte Carlo tree search 》
The guest : @ Tian Yuandong Doctor
Time : Beijing time. 2020 year 11 month 7 Number ( Saturday ) Good morning! 10:00
place :『 Operational research OR A strategy 』 Bili Bili studio
link :live.bilibili.com/21459168
brief introduction
In the near future ,Facebook AI Lab Dr. Tian Yuandong and Wang Linnan of Brown University and his boss Rodrigo Fonseca Co published an article on black box optimization (arXiv:2007.00708), A new concept called La-MCTS (Latent Action Monte Carlo Tree Search) Black box optimization of (Black-box optimization) Method . The hidden action set here (Latent Action, La) Refer to , Select a good subspace from the current node of the search space ( The left node ), Or bad subspaces ( Right node ).
The goal of traditional Monte Carlo tree search is to search in a given state space (state space S)、 Action space (action space A) And state transition functions (transition matrix, S->A->S') , The traditional Monte Carlo tree search searches how many rewards there are for past behaviors , Find the best action sequence and get the biggest reward . Black box optimization starts from a good starting point to find the optimal solution , It can also be modeled in this way .
But between it and traditional reinforcement learning , There's a key difference : Black box optimized action space can be arbitrarily specified , As long as it is conducive to the search for the optimal solution .LaMCTS It's taking advantage of this , By automatically learning the structure of action space to improve search efficiency .
LaMCTS As a meta algorithm (meta-algorithm), We use nonlinear function to partition space , Can be superimposed on any known black box optimization algorithm , such as Bayesian Optimization(BO) above . This algorithm limits the modeling of high-dimensional Gaussian process in a relatively small range , So as to find the optimal solution in the sub region of leaf node more quickly . In practical terms , Black box optimization is often used in situations where function calls are expensive and derivative information is not available , For example, the value of a function is the average efficiency of a complex system after a day's operation , Or it's a very expensive experiment to get , wait , By reducing the sample complexity of the optimal solution , It can greatly reduce the cost .
LaMCTS Has been NeurIPS 2020 receive . The source code of the algorithm has been published in Github On .
(https://github.com/facebookresearch/LaMCTS)
This live broadcast , Dr. Tian will explain the background and content of this paper in detail .
Introduction to guests
Dr. Tian Yuandong , facebook (Facebook) Researcher and manager of the Institute of artificial intelligence , The research direction is deep reinforcement learning , Multi agent learning , And its application in games , And the theoretical analysis of deep learning model . Worked as an open source go project DarkForest And ELF OpenGo Research and engineering director and first author of the project .2013-2014 In Google The driverless team works as a software engineer .2005 Years and 08 He received his master's degree from Shanghai Jiaotong University in 1986 ,2013 He received his doctorate from the Institute of robotics, Carnegie Mellon University, USA . Have obtained 2013 International Conference on computer vision (ICCV) The Mar prize nomination (Marr Prize Honorable Mentions).

Reference reading :
版权声明
本文为[osc_4eht81t7]所创,转载请带上原文链接,感谢
边栏推荐
- BCCOIN告诉您:年底最靠谱的投资项目是什么!
- Is software testing training class easy to find a job
- 个人目前技术栈
- We interviewed the product manager of SQL server of Alibaba cloud database, and he said that it is enough to understand these four problems
- [data structure Python description] use hash table to manually implement a dictionary class based on Python interpreter
- 年轻一代 winner 的程序人生,改变世界的起点藏在身边
- python小工具:编码转换
- Oops, the system is under attack again
- Hematemesis! Alibaba Android Development Manual! (Internet disk link attached)
- Can you do it with only six characters?
猜你喜欢
If you don't understand the gap with others, you will never become an architect! What's the difference between a monthly salary of 15K and a monthly salary of 65K?
Deeplight Technology Bluetooth protocol SRRC certification services
Harbor项目高手问答及赠书活动
Analysis of istio access control
Analysis of ArrayList source code
一个方案提升Flutter内存利用率
软件测试培训班出来好找工作么
推荐一部经济科普视频,很有价值!
Japan PSE certification
PX4添加新的应用
随机推荐
The young generation of winner's programming life, the starting point of changing the world is hidden around
维图PDMS切图软件
VC++指定目录下文件按时间排序输出
Entry level! Teach you how to develop small programs without asking for help (with internet disk link)
Which is more worth starting with the difference between vivos7e and vivos7
Automatically generate RSS feeds for docsify
It's worth seeing! EMR elastic low cost offline big data analysis best practice (with network disk link)
How does spotify drive data-driven decision making?
Adobe Lightroom / LR 2021 software installation package (with installation tutorial)
Flink's sink: a preliminary study
C language I blog assignment 03
Close to the double 11, he made up for two months and successfully took the offer from a large factory and transferred to Alibaba
Recommend an economic science video, very valuable!
PX4添加新的应用
不多不少,大学里必做的五件事(从我的大一说起)
Web novice problem of attacking and defending the world
Flink's sink: a preliminary study
新的目标市场在哪里?锚定的产品是什么?| 十问2021中国企业服务
Hematemesis! Alibaba Android Development Manual! (Internet disk link attached)
渤海银行百万级罚单不断:李伏安却称治理完善,增速呈下滑趋势