当前位置:网站首页>Robot reinforcement learning -- first person vs third person
Robot reinforcement learning -- first person vs third person
2022-06-29 04:48:00 【Qianyu QY】
see 17 Robot intensive learning thesis in , When adopting imitation learning , Some papers present data in the first person , Some papers use the third person to demonstrate the data , First person and third person , I think it describes the position of the camera .
stay THIRD-PERSON IMITATION LEARNING The explanation is mentioned in this paper :
first person :the agent is provided with a sequence of states and a specification of the actions that it should have taken.
third person :they observe other humans perform tasks, infer the task, and accomplish the same task themselves.
informally : The first person is to see their own demonstration , Demonstrate observations and... In the trajectory action Same as when testing ; The third person is to watch others demonstrate , Demonstrate observations and... In the trajectory action Need to switch to the first person .
边栏推荐
- Is the interviewer too difficult to serve? A try catch asks so many tricks
- Force deduction solution summary 324- swing sequencing II
- Composite pattern
- 网传广东一名学生3次考上北大,3年共赚200万元奖金
- What exactly does GCC's -Wpsabi option do? What are the implications of supressing it?
- patent filter
- Talking about Canary deployment
- 2022-2028 global and Chinese industrial electronic detonator Market Status and future development trend
- EEG signal processing - wavelet transform series
- Installation and configuration of interrealsense d435i camera driver
猜你喜欢

Agilent digital multimeter software ns multimeter, real-time data acquisition and automatic data saving

Network device setting / canceling console port login separate password

Mysql 中的 mvcc原理
![[high concurrency] deeply analyze the callable interface](/img/dc/174f97fdd27180ed210d76768cc345.jpg)
[high concurrency] deeply analyze the callable interface

机器人强化学习——Transferring End-to-End Visuomotor Control from Simulation to RealWorld (CoRL 2017)

波形记录仪MR6000的实时波形运算功能
![[Verilog quick start of Niuke network question brushing series] ~ asynchronous reset Series T trigger](/img/e3/cf40fb0131ddeb26bc5beeca03d183.png)
[Verilog quick start of Niuke network question brushing series] ~ asynchronous reset Series T trigger

泰克TDS3054B示波器技术指标

The subnet of the pool cannot be overlapped with that of other pools.

I haven't encountered these three problems. I'm sorry to say that I used redis
随机推荐
Webapck system foundation
[code random entry - hash table] T15, sum of three numbers - double pointer + sort
[high concurrency] deeply analyze the callable interface
Research Report on the overall scale, major manufacturers, major regions, products and application segments of 5g modules of the Internet of things in the global market in 2022
How to solve startup failure due to insufficient MySQL memory
How to create a subtype like relationship between two generic classes when the classes are generic related
笔记本访问台式机的共享磁盘
An efficient flutter hybrid stack management scheme with zero intrusion, you deserve it!
Le langage C imprime "Love", "Mars hit Earth" et ainsi de suite en utilisant printf, qui est constamment mis à jour
What are the ways to simulate and burn programs? (including common tools and usage)
Real time waveform calculation function of Waveform Recorder mr6000
[Verilog quick start of Niuke network question brushing series] ~ asynchronous reset Series T trigger
【HackTheBox】dancing(SMB)
Command pattern
Gbase 8s must be a DBSA. Solution to failure to start due to path change
What exactly does GCC's -Wpsabi option do? What are the implications of supressing it?
To learn more about Yibo Hongmeng development
How to quickly install MySQL 5.7.17 under CentOS 6.5
The subnet of the pool cannot be overlapped with that of other pools.
How to write MySQL scheduled backup script in Linux