当前位置:网站首页>CVPR 2022 | Virtual Correspondence: Humans as a Cue for Extreme-View Geometry
CVPR 2022 | Virtual Correspondence: Humans as a Cue for Extreme-View Geometry
2022-07-01 10:53:00 【Zhiyuan community】

Thesis link :http://people.csail.mit.edu/weichium/virtual-correspondence/top.pdf
3D Reconstruction is a very classic problem in graphics , do 3D In the process of reconstruction , Often use multi view geometry , That is, for the same scene ( object ), Observe from different perspectives , Then according to the observation The common part , utilize parallax Carry out three-dimensional information estimation . An important factor affecting the reconstruction effect is the need for different pictures The common part More , Otherwise, the reconstruction effect will be very poor .


however , For the above two figures , Although their shooting angles are almost different 180°, And there are differences in time , However, it is easy to judge from human observation that these two diagrams represent almost the same scene . Why is that ? Because people will also consider the characters in the picture Posture 、 Appearance ( size )、 identity And so on , Unlike multi view geometry, which only uses feature point information .
Based on this , This paper presents a method , Semantic matching can be performed on two graphs with few common perspectives , This method is based on analyzing the corresponding relationship of people in the scene , And can restore the camera pose in each picture .
The author first raises a question : Is it necessary to restore the pose of the camera with the corresponding three-dimensional points on different pictures ? The answer is No , The traditional polar geometry requires that the observed points are the intersections of the polar lines , Here Only the polar lines passing through the points intersect, that is, the two points are considered to be corresponding ( This article is called Virtual Correspondence, abbreviation VC).

however , To judge whether two polar lines intersect , Generally, you need to know the pose of the camera , This is a dead circle , That is, now I want to use the polar line that intersects two points to restore the camera pose , However, the determination of the disjoint of polar lines depends on the camera pose .
therefore , The author's idea here is , Using prior knowledge —— people , Because the human model has strong prior knowledge , There has been a lot of work to restore the human posture model according to a single picture , And with mannequins , It's easy to judge that the polar thread passes through two people “ Point of intersection ” Where it will appear , Thus, the points on different perspectives can be matched ( For example, the ray passing through the chest can quickly find the corresponding point on the back ).
meanwhile , Although here VC It is different from the traditional point by point matching , But it's easy to put classic SfM(Structure from Motion) Method to modify , Used to restore the camera pose .
Selling points of this article ( contribution ):
- Put forward VC, Based on the traditional epipolar geometry , Better applicability .
- A human model is proposed to estimate VC Methods , And it can be compared with the existing 3D frame ( Such as SfM) Good compatibility , It has a wide range of applicable scenarios .
- This method can be combined with some downstream tasks ( Such as multi view geometric reconstruction , Any perspective generation )

The above figure is the flow chart of the algorithm , First, recover human's from the picture 3D Model , Then randomly emit rays , Record all the collision points between it and the model ( Such as the lower abdomen on the front and the back on the back ), Then find their corresponding pixels in the two images , So I found VC.
With VC after , The next step is to estimate the camera pose through these corresponding relationships . Practice and SfM similar , Just replace the traditional matching feature points with VC. meanwhile , because VC Point comparison depends on the accuracy of human shape estimation , So there will be some noise, The method is to reduce the influence of error by optimizing the re projection error as a whole ( I only know some basic knowledge in this part , Interested readers can study deeply by themselves ).
边栏推荐
- Does anyone know why? The table structure is the source table MySQL CDC that has just been directly copied
- venv: venv 的目录结构
- 【MPC】②quadprog求解正定、半正定、负定二次规划
- CCNP Part XII BGP (IV)
- Simulink simulation circuit model of open loop buck buck buck chopper circuit based on MATLAB
- Ask everyone in the group about the fact that the logminer scheme of flick Oracle CDC has been used to run stably in production
- prism journal导航按钮的可用性探索记录
- 106. construct binary tree from middle order and post order traversal sequence
- Recommend a JSON visualization tool artifact!
- 数字藏品新一轮热度开启
猜你喜欢

数字藏品平台搭建需要注意哪些法律风险及资质?

谷歌新论文-Minerva:用语言模型解决定量推理问题

PHP有哪些优势和劣势

The Lantern Festival is held on the fifteenth day of the first month, and the Lantern Festival begins to celebrate the reunion

使用强大的DBPack处理分布式事务(PHP使用教程)

In the new database era, don't just learn Oracle and MySQL

CodeBlocks 左侧项目栏消失,workspace 自动保存项目,Default workspace,打开上次的workspace,工作区(图文教程,已解决)
![[matytype] insert MathType inter line and intra line formulas in CSDN blog](/img/ff/871a3f06f898ed107a2a974d2c7bc4.png)
[matytype] insert MathType inter line and intra line formulas in CSDN blog

LeetCode.515. 在每个树行中找最大值___逐一BFS+DFS+按层BFS
![[MPC] ② quadprog solves positive definite, semi positive definite and negative definite quadratic programming](/img/85/56b12fd664726e4776cab69ca91d57.png)
[MPC] ② quadprog solves positive definite, semi positive definite and negative definite quadratic programming
随机推荐
数据库实验报告(一)
CRC check
NC | 肠道细胞和乳酸菌共同作用来防止念珠菌感染
Mobile hard drive reads but does not display drive letter
数据库的增删改查问题
The exclusive collection of China lunar exploration project is limited to sale!
Sqlachemy common operations
大佬们 有没有搞过sink分流写入clickhouse 或者其他数据库的操作。
Zero foundation software testing must see, 10 years of testing old bird's conscience suggestions (a total of 15)
prism journal导航按钮的可用性探索记录
CVPR 2022 | Virtual Correspondence: Humans as a Cue for Extreme-View Geometry
Personal mall two open Xiaoyao B2C mall system source code - Commercial Version / group shopping discount seckill source code
Graduation season · advanced technology er
[MPC] ② quadprog solves positive definite, semi positive definite and negative definite quadratic programming
数字藏品平台搭建需要注意哪些法律风险及资质?
The list of winners of the digital collection of "century master" was announced
Matplotlib data visualization Foundation
Matplotlib数据可视化基础
LeetCode. 515. Find the maximum value in each tree row___ BFS + DFS + BFS by layer
Venv: directory structure of venv