当前位置:网站首页>机器人强化学习——第一人称 VS 第三人称
机器人强化学习——第一人称 VS 第三人称
2022-06-29 04:44:00 【千羽QY】
看17年的机器人强化学习论文,采用模仿学习时,有的论文采用第一人称演示数据,有的论文采用第三人称演示数据,初看第一人称和第三人称,以为是描述相机的位置。
在 THIRD-PERSON IMITATION LEARNING 这篇论文中提到了解释:
第一人称:the agent is provided with a sequence of states and a specification of the actions that it should have taken.
第三人称:they observe other humans perform tasks, infer the task, and accomplish the same task themselves.
通俗地讲:第一人称是看自己演示,演示轨迹中的观测和action与测试时一样;第三人称是看别人演示,演示轨迹中的观测和action需要转换到第一人称。
边栏推荐
- Technical specifications of Tektronix tds3054b oscilloscope
- BERT和ViT简介
- Software architecture experiment summary
- Webapck system foundation
- JVM内存调优方式
- Open source demo| you draw and I guess -- make your life more interesting
- Direct derivation of Bessel function with MATLAB
- 20年秦皇岛D - Exam Results(二分+思维,附易错数据)
- Blue Bridge Cup ruler method
- What if modstart forgets the background user or password?
猜你喜欢
![[hackthebox] dancing (SMB)](/img/bb/7bf81004b9cee80ae49bb0c0c2b810.png)
[hackthebox] dancing (SMB)

直播预约|AWS Data Everywhere 系列活动

ROS URDF model is parsed into KDL tree
![[Verilog quick start of Niuke network question brushing series] ~ asynchronous reset Series T trigger](/img/e3/cf40fb0131ddeb26bc5beeca03d183.png)
[Verilog quick start of Niuke network question brushing series] ~ asynchronous reset Series T trigger

How to display all MySQL databases
![[structural mechanics] the reason why the influence line under joint load is different from that under direct load](/img/a6/fce0bb29cc5c84bc0ef20501617e06.png)
[structural mechanics] the reason why the influence line under joint load is different from that under direct load

It is said on the Internet that a student from Guangdong has been admitted to Peking University for three times and earned a total of 2million yuan in three years

How to solve startup failure due to insufficient MySQL memory

Live broadcast appointment AWS data everywhere series activities

笔记本访问台式机的共享磁盘
随机推荐
Common optimization items
【代码随想录-哈希表】T15、三数之和-双指针+排序
Redis cache penetration, cache breakdown, cache avalanche
Direct derivation of Bessel function with MATLAB
从零到一,教你搭建「以文搜图」搜索服务(一)
Daily practice - February 15, 2022
Gocd is good, but talk about Jenkins
See how I do it step by step (I)
What are the basic usage methods of MySQL
February 14 institutional dragon and tiger list and operation of well-known hot money
Memo pattern
Research Report on the overall scale, major manufacturers, major regions, products and applications of power battery laser welding machines in the global market in 2022
Collection of common terms used in satellite navigation
IDENTITY
Command pattern
Five thousand years of China
ROS URDF model is parsed into KDL tree
[hackthebox] dancing (SMB)
Installation and configuration of interrealsense d435i camera driver
How to use the select statement of MySQL