当前位置:网站首页>Three dimensional reconstruction of deep learning
Three dimensional reconstruction of deep learning
2022-07-03 15:40:00 【Name of algorithm】
be based on MVS Foundation of 3D reconstruction
Three dimensional information representation
It is generally divided into depth map / Parallax map 、 Point cloud 、 grid . They are all expressions 3D A way of information , We will choose different ways to express according to different actual application scenarios . For example, do some background sequencing 、 Face effects can only use depth maps ; And if we want to rebuild a large scene , Such as museums , It needs to be displayed for everyone to browse , You can use grids to represent ; When positioning , We just need to use some clouds . But if we want to make point clouds or grids , Must use depth map , This step must be experienced . Only with depth map can we get point cloud or three-dimensional grid .
- Depth map / Parallax map
- Depth map : The distance from each point in the scene to the camera ;
- Parallax map : The position deviation of pixels imaged under two cameras in the same scene dis
- Relationship between them :depth=bf/dis
- It is a common representation of three-dimensional information

In the diagram above ,Ol and Or It's two cameras , We generally call it binocular camera , The distance between them is called the baseline (Baseline). A point in space P, The distance from it to the baseline Z It's called depth . The two red lines in the above figure are different imaging of two cameras .p Point and p' The difference is P stay Ol and Or Points in camera imaging . parallax d Equal to the column coordinates of the same point pair in the left view minus the column coordinates in the right view , yes Pixel unit
![]()

The above picture is taken by binocular camera , The parallax of the electric rearview mirror is 80-35=45.
In stereovision , The parallax concept is used in baseline corrected image pairs . That is to say, the two cameras are right , parallel , They all shoot objects forward , Only then can the parallax map be used . Generally speaking, we use depth maps , Parallax map is easier to shoot , Then convert it into a depth map . Parallax is measured in pixels , The unit of depth is mm (mm), Change the formula to depth=bf/dis, here b Is the baseline distance of the binocular camera , This is known ,f Represents the normalized focal length , That is, in the internal parameter fx, This is also known ,dis Is the parallax value .
- Three dimensional point cloud
- A 3D point cloud is a data set of points in a coordinate system
- Contains a wealth of information , Including three-dimensional coordinates XYZ, Color RGB Etc
3D point cloud is actually data , It can be viewed intuitively without showing to human beings .
- Three dimensional grid
- It is composed of polygons composed of adjacent point clouds of objects .
- Usually consists of triangles 、 Quadrilateral or other simple convex polygons .

As can be seen from the above figure , 3D mesh is a form of point cloud , It is generally without color information .
- Texture map model
- 3D mesh model with color information
- All color information is stored on a texture map , When displaying, the high-resolution color model is rendered according to the texture coordinates of each mesh and the corresponding texture map .

Image coordinate system 、 Camera coordinate system 、 World coordinate system
边栏推荐
- Enable multi-threaded download of chrome and edge browsers
- Reading notes of "micro service design" (Part 2)
- String functions that you need to know
- Microservices - load balancing ribbon
- Microservices Seata distributed transactions
- Jvm-04-runtime data area heap, method area
- [combinatorial mathematics] binomial theorem and combinatorial identity (binomial theorem | three combinatorial identities | recursive formula 1 | recursive formula 2 | recursive formula 3 Pascal / Ya
- Detailed pointer advanced 1
- Halcon and WinForm study section 2
- 《微服务设计》读书笔记(下)
猜你喜欢

详解指针进阶1

Halcon与Winform学习第二节

qt使用QZxing生成二维码

Introduction series of software reverse cracking (1) - common configurations and function windows of xdbg32/64

The markdown file obtains the pictures of the network and stores them locally and modifies the URL
![[系统安全] 四十三.Powershell恶意代码检测系列 (5)抽象语法树自动提取万字详解](/img/cd/00954b9c592c253d42e6a3b8298999.jpg)
[系统安全] 四十三.Powershell恶意代码检测系列 (5)抽象语法树自动提取万字详解

Reentrantlock usage and source code analysis

The state does not change after the assignment of El switch

详解指针进阶2

Jvm-04-runtime data area heap, method area
随机推荐
Go语言自学系列 | golang switch语句
Matplotlib drawing label cannot display Chinese problems
Custom annotation
求字符串函数和长度不受限制的字符串函数的详解
Dataframe returns the whole row according to the value
Jvm-06-execution engine
Kubernetes帶你從頭到尾捋一遍
App移动端测试【4】apk的操纵
CString getbuffer and releasebuffer instructions
Download and install common programs using AUR
Concurrency-02-visibility, atomicity, orderliness, volatile, CAS, atomic class, unsafe
分布式事务(Seata) 四大模式详解
WinDbg分析dump文件
Calibre LVL
阿飞的期望
视觉上位系统设计开发(halcon-winform)-2.全局变量设计
Visual host system design and development (Halcon WinForm)
Halcon and WinForm study section 2
Subclass hides the function with the same name of the parent class
[combinatorics] combinatorial identities (recursive combinatorial identities | sum of variable terms | simple combinatorial identities and | sum of variable terms | staggered sums of combinatorial ide