当前位置:网站首页>The paper manual becomes 3D animation in seconds, the latest research of Wu Jiajun of Stanford University, selected for ECCV 2022
The paper manual becomes 3D animation in seconds, the latest research of Wu Jiajun of Stanford University, selected for ECCV 2022
2022-07-31 14:17:00 【QbitAl】
羿阁 发自 凹非寺
量子位 | 公众号 QbitAI
Is there any space poor friend,Do not know how to do get lego instruction every time?
这回,Can move the lego instruction to the!
Tsinghua alumni yao class、Assistant professor at Stanford jia-jun wu,Can lead the team developed a paper instruction into3D动画的技术,The paper has selected and now2022年计算机视觉顶会ECCV.
看完效果图,有网友直呼:It is of great help to lego lovers of all ages are!
3DAnimation instruction
Although the lego instructions are written by professional designers,But for those with poor imagination,不得不说,还是3DAnimation is more sweet.
This step into look easy,In fact behind the two technical difficulties.
The first problem is how to get the paper2DImage projection into3D动画.
The research team to do,Is the task is decomposed into a series of can smoothly、Effective implementation of short steps,Through the establishment of a model,Converts images on the manual machine to explain the algorithm,In order to simplify the machine learning task.
正如上图所示,If you want to put the figurea转化为图c,Need to extract the instruction of each parts of the image position,In order to build the final finished product.
Research in the face of the second challenge is,The shape of lego bricks is so changeable.
Although many of the basic parts shape almost,But just like in the picture the guitar head,Lego have many flexible and complex parts.而且,These parts may produce different combination also greatly increase the difficulty of the interpretation of the machine:Every building steps to form a new unknown image.
为了解决这两个挑战,The team proposed a new framework based on machine learning:Manual execution plan network(manual-To-executable-Plan Network, MEPNet).
Its core idea is to two-dimensional point detection method based on neural network and2D-3DMatching algorithm combining,For the invisible3DThe object of high precision prediction.
MEPNetThe operation of the two stage.The first stage to do,Is the basic shape and new parts3D模型、The target shape2DImage as the input information,Forecast for each part of a set of2D关键点、Rotation Angle and the mask.
在第二阶段中,By looking for basic shape and the possible link between new parts,The first stage and then predict2DThe key to reverse projection3D图像中.
值得一提的是,This method during practice does not require anyground truth图像.
另外,MEPNetData set, performance is superior to other existing methods.Compared with the methods based on end-to-end learning,MEPNetTo keep the efficiency model based on machine learning,And can be better to generate the unknown3D对象上.
最值得注意的是,MEPNetAble to use synthetic data trained individually,Which is applied to the real life scenario.
目前,All code and data is open source,感兴趣的小伙伴可以关注一下.
作者介绍
This paper from the Stanford university jia-jun wu team.The author also include:Ruocheng Wang、Yunzhi Zhang,麻省理工大学的Jiayuan Mao以及Autodesk AI Lab的Chin-Yi Cheng.
吴佳俊,Now, an assistant professor at Stanford university,Affiliated with Stanford vision and learning laboratory (SVL)And the Stanford artificial intelligence laboratory (SAIL).At the Massachusetts institute of technology to complete doctorate,本科毕业于清华大学姚班,曾被誉为“Tsinghua university, one of god”.
论文第一作者Ruocheng Wang,Master degree in computer science at Stanford,Students is jia-jun wu door.本科毕业于浙江大学计算机专业,Also at the university of California, Los Angeles, andAdnan DarwicheThe professor worked for a period of time.
One More Thing
Although the whole paper in lego, for example,But the author also referred to in the paper,In fact this technology can also be applied to other types of assembly instructions.
好多“Hard to install a long time”Netizens are called on to launch the ikea version:
不过,在一片欢呼声中,Some netizens also puts forward the different sounds:
I don't know if it is a surprise or destroyed my play the fun of high.
对此,你怎么看?Do you like watching the instruction spell lego,Or did you play?
参考链接:
[1]https://cs.stanford.edu/~rcwang/projects/lego_manual/
[2]https://twitter.com/_akhaliq/status/1552118469214314496
[3]https://arxiv.org/abs/2207.12572
[4]https://jiajunwu.com/
边栏推荐
- 龟速乘【模板】
- 纸质说明书秒变3D动画,斯坦福大学吴佳俊最新研究,入选ECCV 2022
- Comparison of Optical Motion Capture and UWB Positioning Technology in Multi-agent Cooperative Control Research
- 使用NVM进行node版本切换管理
- Nuget打包并上传教程
- LeetCode旋转数组
- Introduction to BigDecimal, common methods
- [QNX Hypervisor 2.2用户手册]9.13 rom
- leetcode:2032. 至少在两个数组中出现的值
- 英文语法-时与态
猜你喜欢
Sentinel流量控制
Samba 远程命令执行漏洞(CVE-2017-7494)
The batch size does not have to be a power of 2!The latest conclusions of senior ML scholars
jvm 一之 类加载器
Open Inventor 10.12 Major Improvements - Harmony Edition
232层3D闪存芯片来了:单片容量2TB,传输速度提高50%
AWS implements scheduled tasks - Lambda+EventBridge
Description of Hikvision camera streaming RTSP address rules
Redis与分布式:主从复制
已解决(pymysqL连接数据库报错)pymysqL.err.ProgrammingError: (1146,“Table ‘test.students‘ doesn‘t exist“)
随机推荐
Analysis of the startup source code of hyperf (2) - how the request reaches the controller
Linux bash: redis-server: command not found
C# Get network card information NetworkInterface IPInterfaceProperties
endnote引用
技能大赛训练题:交换机的远程管理
自制的数据库安全攻防题,相关靶机自己制作
ECCV 2022 | Robotic Interaction Perception and Object Manipulation
3.爬虫之Scrapy框架1安装与使用
An article makes it clear!What is the difference and connection between database and data warehouse?
MySQL【子查询】
MySQL [aggregate function]
技能大赛训练题:登录安全加固
Combination series - there are combinations when there are arrangements
sentinel与nacos持久化
C语言基础练(九九乘法表)与打印不同星号图案
Open Inventor 10.12 Major Improvements - Harmony Edition
尚硅谷-JVM-内存和垃圾回收篇(P1~P203)
How to grab configuration information for DELL SC compellent storage system
OAuth2:使用JWT令牌
ML、DL、CV常见的问题整理