当前位置:网站首页>VPT Model Video Explanation
VPT Model Video Explanation
2022-06-27 10:54:00 【Zhiyuan community】
Minecraft is one of the harder challenges any RL agent could face. Episodes are long, and the world is procedurally generated, complex, and huge. Further, the action space is a keyboard and a mouse, which has to be operated only given the game's video input. OpenAI tackles this challenge using Video PreTraining, leveraging a small set of contractor data in order to pseudo-label a giant corpus of scraped footage of gameplay. The pre-trained model is highly capable in basic game mechanics and can be fine-tuned much better than a blank slate model. This is the first Minecraft agent that achieves the elusive goal of crafting a diamond pickaxe all by itself.
OUTLINE:
0:00 - Intro
3:50 - How to spend money most effectively?
8:20 - Getting a large dataset with labels
14:40 - Model architecture
19:20 - Experimental results and fine-tuning
25:40 - Reinforcement Learning to the Diamond Pickaxe
30:00 - Final comments and hardware
Blog: https://openai.com/blog/vpt/
Paper: https://arxiv.org/abs/2206.11795
Code & Model weights: https://github.com/openai/Video-Pre-Training
边栏推荐
- Installation manuelle de MySQL par UBUNTU
- 隐私计算FATE-离线预测
- Deep learning in finance in cross sectional sectional predictions for random forests
- R language plot visualization: plot to visualize the two-dimensional histogram contour map, add numerical labels on the contour lines, customize the label font color, and set the mouse hover display e
- Basic violin plot in R with plot
- Uniform Asymptotics by Alexei
- Flutter wechat sharing
- 居家办公竟比去公司上班还累? | 社区征文
- torchvision. models._ utils. Intermediatelayergetter tutorial
- In the three-tier architecture, at which layer is the database design implemented, not at the data storage layer?
猜你喜欢

LVI Sam summary

【TcaplusDB知识库】TcaplusDB Tmonitor模块架构介绍

用户认证技术

21:第三章:开发通行证服务:4:进一步完善【发送短信,接口】;(在【发送短信,接口】中,调用阿里云短信服务和redis服务;一种设计思想:BaseController;)

【TcaplusDB知识库】Tmonitor后台一键安装介绍(二)

What is the experience of telecommuting in a foreign company| Community essay solicitation

以后发现漏洞,禁止告诉中国!

Glide缓存机制

如何在 Methodot 中部署 JupyterLab?
![[tcapulusdb knowledge base] Introduction to tmonitor background one click installation (I)](/img/0a/3eae294b335c120c4aabd05e4230c3.png)
[tcapulusdb knowledge base] Introduction to tmonitor background one click installation (I)
随机推荐
NVME2.0协议——新特性
mysql数据库汉字模糊查询出现异常
Go zero micro Service Practice Series (VII. How to optimize such a high demand)
Leetcode 729. 我的日程安排表 I(牛逼,已解决)
浅析基于边缘计算的移动AR实现(中)
JS all network request modes
C语言学习-Day_04
Memory compression for win10
[so official interview] Why do developers using rust love it so much
用户认证技术
Mail system (based on SMTP protocol and POP3 protocol -c language implementation)
User authentication technology
[tcapulusdb knowledge base] Introduction to tmonitor stand-alone installation guidelines (II)
[methodot topic] what kind of low code platform is more suitable for developers?
2021 CSP J2入门组 CSP-S2提高组 第2轮 视频与题解
C語言學習-Day_04
Metadata of database
Dimitt's law
Experiment notes - Convert Carmen (.Log.Clf) file to rosbag
基于swiftadmin极速后台开发框架,我制作了菜鸟教程[专业版]