当前位置:网站首页>Three dimensional reconstruction of deep learning
Three dimensional reconstruction of deep learning
2022-07-03 15:40:00 【Name of algorithm】
be based on MVS Foundation of 3D reconstruction
Three dimensional information representation
It is generally divided into depth map / Parallax map 、 Point cloud 、 grid . They are all expressions 3D A way of information , We will choose different ways to express according to different actual application scenarios . For example, do some background sequencing 、 Face effects can only use depth maps ; And if we want to rebuild a large scene , Such as museums , It needs to be displayed for everyone to browse , You can use grids to represent ; When positioning , We just need to use some clouds . But if we want to make point clouds or grids , Must use depth map , This step must be experienced . Only with depth map can we get point cloud or three-dimensional grid .
- Depth map / Parallax map
- Depth map : The distance from each point in the scene to the camera ;
- Parallax map : The position deviation of pixels imaged under two cameras in the same scene dis
- Relationship between them :depth=bf/dis
- It is a common representation of three-dimensional information

In the diagram above ,Ol and Or It's two cameras , We generally call it binocular camera , The distance between them is called the baseline (Baseline). A point in space P, The distance from it to the baseline Z It's called depth . The two red lines in the above figure are different imaging of two cameras .p Point and p' The difference is P stay Ol and Or Points in camera imaging . parallax d Equal to the column coordinates of the same point pair in the left view minus the column coordinates in the right view , yes Pixel unit
![]()

The above picture is taken by binocular camera , The parallax of the electric rearview mirror is 80-35=45.
In stereovision , The parallax concept is used in baseline corrected image pairs . That is to say, the two cameras are right , parallel , They all shoot objects forward , Only then can the parallax map be used . Generally speaking, we use depth maps , Parallax map is easier to shoot , Then convert it into a depth map . Parallax is measured in pixels , The unit of depth is mm (mm), Change the formula to depth=bf/dis, here b Is the baseline distance of the binocular camera , This is known ,f Represents the normalized focal length , That is, in the internal parameter fx, This is also known ,dis Is the parallax value .
- Three dimensional point cloud
- A 3D point cloud is a data set of points in a coordinate system
- Contains a wealth of information , Including three-dimensional coordinates XYZ, Color RGB Etc
3D point cloud is actually data , It can be viewed intuitively without showing to human beings .
- Three dimensional grid
- It is composed of polygons composed of adjacent point clouds of objects .
- Usually consists of triangles 、 Quadrilateral or other simple convex polygons .

As can be seen from the above figure , 3D mesh is a form of point cloud , It is generally without color information .
- Texture map model
- 3D mesh model with color information
- All color information is stored on a texture map , When displaying, the high-resolution color model is rendered according to the texture coordinates of each mesh and the corresponding texture map .

Image coordinate system 、 Camera coordinate system 、 World coordinate system
边栏推荐
- Srs4.0+obs studio+vlc3 (environment construction and basic use demonstration)
- Kubernetes - yaml file interpretation
- The state does not change after the assignment of El switch
- 《微服务设计》读书笔记(下)
- Go语言自学系列 | golang中的if else语句
- 详解指针进阶1
- Automatic generation of client code from flask server code -- Introduction to flask native stubs Library
- Baidu AI Cloud helps Shizuishan upgrade the smart health care model of "Internet + elderly care services"
- Microservice - fuse hystrix
- leetcode_ Power of Four
猜你喜欢

详解指针进阶2

Dataframe returns the whole row according to the value

Visual host system design and development (Halcon WinForm)

Halcon and WinForm study section 2

Kubernetes帶你從頭到尾捋一遍

C语言刷题~Leetcode与牛客网简单题

How to use annotations such as @notnull to verify and handle global exceptions

String functions that you need to know

详解指针进阶1

Jvm-04-runtime data area heap, method area
随机推荐
请做好3年内随时失业的准备?
The difference between mutually exclusive objects and critical areas
Concurrency-02-visibility, atomicity, orderliness, volatile, CAS, atomic class, unsafe
Calibre LVL
do{}while()的妙用
Unityshader - materialcapture material capture effect (Emerald axe)
Idea does not specify an output path for the module
Dataframe returns the whole row according to the value
Jvm-05-object, direct memory, string constant pool
Wechat payment -jsapi: code implementation (payment asynchronous callback, Chinese parameter solution)
App mobile terminal test [5] file writing and reading
阿飞的期望
Detailed explanation of string function and string function with unlimited length
String functions that you need to know
leetcode_ Power of Four
Halcon与Winform学习第一节
Summary of concurrent full knowledge points
Kubernetes帶你從頭到尾捋一遍
Microservices Seata distributed transactions
App移动端测试【4】apk的操纵