当前位置:网站首页>VPT Model Video Explanation
VPT Model Video Explanation
2022-06-27 10:54:00 【Zhiyuan community】
Minecraft is one of the harder challenges any RL agent could face. Episodes are long, and the world is procedurally generated, complex, and huge. Further, the action space is a keyboard and a mouse, which has to be operated only given the game's video input. OpenAI tackles this challenge using Video PreTraining, leveraging a small set of contractor data in order to pseudo-label a giant corpus of scraped footage of gameplay. The pre-trained model is highly capable in basic game mechanics and can be fine-tuned much better than a blank slate model. This is the first Minecraft agent that achieves the elusive goal of crafting a diamond pickaxe all by itself.
OUTLINE:
0:00 - Intro
3:50 - How to spend money most effectively?
8:20 - Getting a large dataset with labels
14:40 - Model architecture
19:20 - Experimental results and fine-tuning
25:40 - Reinforcement Learning to the Diamond Pickaxe
30:00 - Final comments and hardware
Blog: https://openai.com/blog/vpt/
Paper: https://arxiv.org/abs/2206.11795
Code & Model weights: https://github.com/openai/Video-Pre-Training
边栏推荐
- mysql数据库汉字模糊查询出现异常
- mongodb跨主机数据库拷贝以及常用命令
- Test how students participate in codereview
- Support system of softswitch call center system
- Design and Simulation of direct torque control system for induction motor (motion control matlab/simulink)
- .NET6接入Skywalking链路追踪完整流程
- Institute of Microbiology, Chinese Academy of Sciences recruited 20 young PI, with a resettlement fee of 2million yuan and a start-up fund of 10million yuan (long-term effective)
- Oracle连接MySQL报错IM002
- flutter 微信分享
- C language learning day_ 05
猜你喜欢

When does the mobile phone video roll off?
![leetcode:968. Monitor the binary tree [tree DP, maintain the three states of each node's subtree, it is very difficult to think of the right as a learning, analogous to the house raiding 3]](/img/70/3954b0871cc31d24ae016eb99d871e.png)
leetcode:968. Monitor the binary tree [tree DP, maintain the three states of each node's subtree, it is very difficult to think of the right as a learning, analogous to the house raiding 3]

【TcaplusDB知识库】TcaplusDB Tmonitor模块架构介绍

红包雨: Redis 和 Lua 的奇妙邂逅

Metadata of database
![[tcapulusdb knowledge base] Introduction to tmonitor stand-alone installation guidelines (II)](/img/6d/8b1ac734cd95fb29e576aa3eee1b33.png)
[tcapulusdb knowledge base] Introduction to tmonitor stand-alone installation guidelines (II)

C any() and aii() methods

“全班29人24人成功读研”冲上热搜!剩下的5个人去哪了?

Audiotrack and audiolinker

Explain the imaging principle of various optical instruments in detail
随机推荐
闭包的常见问题
If you find any loopholes later, don't tell China!
【TcaplusDB知识库】TcaplusDB机器初始化和上架介绍
浅析基于边缘计算的移动AR实现(中)
数据库之元数据
lvi-sam 总结
微软云 (Microsoft Cloud) 技术概述
【TcaplusDB知识库】TcaplusDB机型管理介绍
三层架构中,数据库的设计在哪一层实现,不是在数据存储层吗?
Deep learning in finance in cross sectional sectional predictions for random forests
Explain the imaging principle of various optical instruments in detail
Support system of softswitch call center system
[hcie-rs review mind map] - STP
torch. utils. data. Randomsampler and torch utils. data. Differences between sequentialsampler
以后发现漏洞,禁止告诉中国!
What is the experience of telecommuting in a foreign company| Community essay solicitation
Openpyxl table reading instance
Installation manuelle de MySQL par UBUNTU
Leetcode 729. 我的日程安排表 I(提供一种思路)
What basic functions are required for live e-commerce application development? What is the future development prospect?