当前位置:网站首页>Three dimensional reconstruction of deep learning
Three dimensional reconstruction of deep learning
2022-07-03 15:40:00 【Name of algorithm】
be based on MVS Foundation of 3D reconstruction
Three dimensional information representation
It is generally divided into depth map / Parallax map 、 Point cloud 、 grid . They are all expressions 3D A way of information , We will choose different ways to express according to different actual application scenarios . For example, do some background sequencing 、 Face effects can only use depth maps ; And if we want to rebuild a large scene , Such as museums , It needs to be displayed for everyone to browse , You can use grids to represent ; When positioning , We just need to use some clouds . But if we want to make point clouds or grids , Must use depth map , This step must be experienced . Only with depth map can we get point cloud or three-dimensional grid .
- Depth map / Parallax map
- Depth map : The distance from each point in the scene to the camera ;
- Parallax map : The position deviation of pixels imaged under two cameras in the same scene dis
- Relationship between them :depth=bf/dis
- It is a common representation of three-dimensional information
In the diagram above ,Ol and Or It's two cameras , We generally call it binocular camera , The distance between them is called the baseline (Baseline). A point in space P, The distance from it to the baseline Z It's called depth . The two red lines in the above figure are different imaging of two cameras .p Point and p' The difference is P stay Ol and Or Points in camera imaging . parallax d Equal to the column coordinates of the same point pair in the left view minus the column coordinates in the right view , yes Pixel unit
The above picture is taken by binocular camera , The parallax of the electric rearview mirror is 80-35=45.
In stereovision , The parallax concept is used in baseline corrected image pairs . That is to say, the two cameras are right , parallel , They all shoot objects forward , Only then can the parallax map be used . Generally speaking, we use depth maps , Parallax map is easier to shoot , Then convert it into a depth map . Parallax is measured in pixels , The unit of depth is mm (mm), Change the formula to depth=bf/dis, here b Is the baseline distance of the binocular camera , This is known ,f Represents the normalized focal length , That is, in the internal parameter fx, This is also known ,dis Is the parallax value .
- Three dimensional point cloud
- A 3D point cloud is a data set of points in a coordinate system
- Contains a wealth of information , Including three-dimensional coordinates XYZ, Color RGB Etc
3D point cloud is actually data , It can be viewed intuitively without showing to human beings .
- Three dimensional grid
- It is composed of polygons composed of adjacent point clouds of objects .
- Usually consists of triangles 、 Quadrilateral or other simple convex polygons .
As can be seen from the above figure , 3D mesh is a form of point cloud , It is generally without color information .
- Texture map model
- 3D mesh model with color information
- All color information is stored on a texture map , When displaying, the high-resolution color model is rendered according to the texture coordinates of each mesh and the corresponding texture map .
Image coordinate system 、 Camera coordinate system 、 World coordinate system
边栏推荐
- Introduction series of software reverse cracking (1) - common configurations and function windows of xdbg32/64
- VS2017通过IP调试驱动(双机调试)
- Jvm-08-garbage collector
- String functions that you need to know
- App移动端测试【5】文件的写入、读取
- The wonderful use of do{}while()
- Baidu AI Cloud helps Shizuishan upgrade the smart health care model of "Internet + elderly care services"
- Jvm-09 byte code introduction
- Redis cache penetration, cache breakdown, cache avalanche solution
- Visual upper system design and development (Halcon WinForm) -4 Communication management
猜你喜欢
《微服务设计》读书笔记(下)
找映射关系
Download and install common programs using AUR
Visual upper system design and development (Halcon WinForm) -1 Process node design
Seckill system 2 redis solves the problem of distributed session
整形和浮点型是如何在内存中的存储
Unity function - unity offline document download and use
The markdown file obtains the pictures of the network and stores them locally and modifies the URL
Digital image processing -- popular understanding of corrosion and expansion
Jvm-02-class loading subsystem
随机推荐
CString的GetBuffer和ReleaseBuffer使用说明
软件逆向破解入门系列(1)—xdbg32/64的常见配置及功能窗口
【OpenCV 例程200篇】217. 鼠标交互获取多边形区域(ROI)
The state does not change after the assignment of El switch
Kubernetes will show you from beginning to end
String functions that you need to know
Jvm-02-class loading subsystem
Jvm-06-execution engine
How are integer and floating-point types stored in memory
App移动端测试【4】apk的操纵
Unityshader - materialcapture material capture effect (Emerald axe)
Microservice API gateway
Redis高可用与持久化
Jvm-08-garbage collector
WinDbg analysis dump file
Create gradle project
The markdown file obtains the pictures of the network and stores them locally and modifies the URL
视觉上位系统设计开发(halcon-winform)-2.全局变量设计
Digital image processing -- popular Canny edge detection
Unity功能——Unity离线文档下载及使用