当前位置:网站首页>Brief analysis of ref nerf
Brief analysis of ref nerf
2022-07-03 20:42:00 【Zhangchuncheng】
elementary analysis Ref-NeRF
NeRF It is a popular 3D rendering network , This article is based on the new RefNeRF As an opportunity , Try to explain the close relationship between normals in 3D surfaces and rendering .
Research background
rendering method
We simplify the rendering process to : When shooting an object , Find the color of a point on the picture of the object
data:image/s3,"s3://crabby-images/bba5c/bba5c8651db57ef07a55c2d72b1bfee8da819655" alt="Snipaste_2022-07-03_18-27-41.png"
The traditional method needs to obtain the three-dimensional model of the object first , And build a three-dimensional scene , Then the color value of each region is obtained by calculation .
We can render by building a patch model , It can also be rendered by volume integration .
data:image/s3,"s3://crabby-images/c4a35/c4a3561ca5e524b39a9f4a1ca54befb16f71083e" alt="Snipaste_2022-07-03_18-28-02.png"
data:image/s3,"s3://crabby-images/510b7/510b7509e74bf4db0839b519f40baebbf5ff131f" alt="Snipaste_2022-07-03_18-28-10.png"
Here we consider two basic things , One is an object , The other is the observer ,
The model of an object is physical , It does not change with the perspective of observation ; The observer observes the object model from various angles , It can be simplified into an angle .
and NeRF The way is to grasp these two key points , Abstract the rendering process of 3D objects into a function , The difference from traditional methods is , It no longer relies on explicit modeling of 3D objects .
Neural Radiance Fields & Multi-layer perception Neural Radiance Fields (NeRF) is a popular view synthesis technique that represents a scene as a continuous volumetric function, parameterized by multi-layer perception (MLP) that provide the volume density and view dependent emitted radiance at each location.
From the perspective of neural network , The rendering process is as follows
among , Represents model parameters , Represents the location of rendering , Represents the observation angle . The network training process is to shoot the same object from multiple perspectives , So as to optimize the parameters ,
therefore , It can be considered that the surface information of the object is encoded in the network parameters . The rendering of the new angle can be calculated through this set of parameters .
NeRF The problem of
They often fail to accurately capture and reproduce the appearance of glossy surfaces
data:image/s3,"s3://crabby-images/76368/76368895649c38b1c786828890fc2c3376efe7d5" alt="Snipaste_2022-07-03_18-28-17.png"
namely NeRF Poor treatment of smooth surfaces . The reason is , Because NeRF The processing of surface texture depends on interpolation
data:image/s3,"s3://crabby-images/a7e4a/a7e4a3de40fbe379c22e3a534c43f53e67bb924d" alt="Snipaste_2022-07-03_18-28-25.png"
Why do you say that? ? Because the original function is a continuous function
therefore , Without special mechanism , When When it comes to gradients , Output It must also be gradual . Macroscopically, it will show “ interpolation ” The effect of . Reflected in the rendering results , There is no clear boundary for the color of the texture , It looks very vague .
The new method
The main idea
The new method learns the three-dimensional structure of the object surface , No longer rely on interpolation algorithm when rendering color
data:image/s3,"s3://crabby-images/e7910/e79100549389b10b77cc29a76138db823bd170eb" alt="Snipaste_2022-07-03_18-28-35.png"
The core of the three-dimensional structure here is the direction of the local normal .
How important are normals ?
Give an example to illustrate this problem , If you render an object in the traditional way , In addition to its 3D surface structure and texture map color , You also need its surface normal
data:image/s3,"s3://crabby-images/37ed1/37ed101b80ade9689d7e93fef725b21af80975d6" alt="Snipaste_2022-07-03_18-28-41.png"
If you change its normal direction , You will get a completely different rendering effect
data:image/s3,"s3://crabby-images/76124/761245a05c9421beaef757795a67a4584cba1d34" alt="Snipaste_2022-07-03_18-28-51.png"
The reason for this phenomenon can be explained by the following figure , Considering the ambient light , The color of a point of an object can be simplified into two factors
among , Represents the diffusion of materials (diffuse) Color , This color depends on the nature of the object itself , Represents the color brought by ambient light .
data:image/s3,"s3://crabby-images/9aef9/9aef9ba087efee6cc3735308af4ab02bd7fc4f7f" alt="Snipaste_2022-07-03_18-28-25.png"
Because the ambient light is reflected by objects , Therefore, it is inevitably affected by the joint action of illumination angle and surface normal .
Ref-NeRF The algorithm is in the process of learning model parameters , Learn the coupling structure of these normals and incident light at the same time by adding constraints .
Network structure
data:image/s3,"s3://crabby-images/b859f/b859fbbbbd9d64c9598200162f3f4c81eae54ca8" alt="Snipaste_2022-07-03_18-28-59.png"
In primitive NeRF In the method ,
Input For location ; Input Represents the observation angle ; Output Represents material density ; Output Represents the color value ; Intermediate variable Represents bottleneck vector , It's kind of like ResNet Layer hopping transmission .
and Ref-NeRF The intermediate variables added by the method are ,
: The color produced by light and its own color ; : Weighting of light color ; : Local surface roughness ; : local normal .
Why are so many things introduced besides normals ? It's easy to understand .
: Control the value and proportion of external light and its own color ; : After considering the surface normal , Local color value “ great ” The ground is affected by the normal , But in practice , Because the surface of the object is not absolutely smooth , This leads to a great difference between the actual results and the theoretical results . In this network , The rougher the surface , Then the smoother it is , Smoothing is done by fitting vMF Distributed implementation .
We introduce a technique,which we call an Integrated Directional Encoding (IDE), that enables the directional MLP to efficiently represent the function of outgoing radiance for materials with any continuously-valued roughness
data:image/s3,"s3://crabby-images/42e31/42e311504a417a47e57a1d6f98d96a61682b7599" alt="Snipaste_2022-07-03_18-29-06.png"
result
And NeRF Comparison of methods
data:image/s3,"s3://crabby-images/5e0ef/5e0efe0535d2b2f9dc907479542bb7a7e42bc1f3" alt="Snipaste_2022-07-03_18-29-17.png"
data:image/s3,"s3://crabby-images/b0fa5/b0fa5640349cc085e948985db23e7e691e250680" alt="Snipaste_2022-07-03_18-29-24.png"
so ,Ref The method can accurately estimate the surface normal structure of spherical and cylindrical structures , And what information is caused by ambient light . This endows the model with knowledge and learning “ Specular reflection ” The ability of .
Scene editing
Last , Because the model not only learns the surface information of the object , I also learned the information of ambient light , So we can change these two factors , To analyze 3D objects and scenes “ edit ”.
data:image/s3,"s3://crabby-images/c3eb9/c3eb9f5a75568690446f5ce79a99f314916bb56a" alt="Snipaste_2022-07-03_18-29-30.png"
We can edit the diffuse color of the car without affecting the specular reflections of its glossy paint
data:image/s3,"s3://crabby-images/1a7f9/1a7f96336ce3d2dac67e58e303cdb656b395880d" alt="Snipaste_2022-07-03_18-29-38.png"
We can plausibly modify the roughness of the car and material balls by manipulating the κ values used in the IDE
边栏推荐
- 你真的知道自己多大了吗?
- Global and Chinese market of high purity copper foil 2022-2028: Research Report on technology, participants, trends, market size and share
- How to do Taobao full screen rotation code? Taobao rotation tmall full screen rotation code
- In 2021, the global revenue of thick film resistors was about $1537.3 million, and it is expected to reach $2118.7 million in 2028
- Test access criteria
- Apprentissage intensif - notes d'apprentissage 1 | concepts de base
- An old programmer gave it to college students
- JVM JNI and PVM pybind11 mass data transmission and optimization
- 设计电商秒杀系统
- 11-grom-v2-05-initialization
猜你喜欢
Qtablewidget control of QT
SQL injection - Fundamentals of SQL database operation
Shortest path problem of graph theory (acwing template)
Haven't expressed the artifact yet? Valentine's Day is coming. Please send her a special gift~
Gee calculated area
[effective Objective-C] - block and grand central distribution
@Transactional注解失效的场景
How to choose cache read / write strategies in different business scenarios?
2.1 use of variables
Viewing Chinese science and technology from the Winter Olympics (II): when snowmaking breakthrough is in progress
随机推荐
Based on laravel 5.5\5.6\5 X solution to the failure of installing laravel ide helper
9 pyqt5 qscrollarea scroll area and qscrollbar scroll bar
Refer to some books for the distinction between blocking, non blocking and synchronous asynchronous
Interval product of zhinai sauce (prefix product + inverse element)
2022 melting welding and thermal cutting examination materials and free melting welding and thermal cutting examination questions
Do you really know how old you are?
Plan for the first half of 2022 -- pass the PMP Exam
Fingerprint password lock based on Hal Library
C 10 new feature [caller parameter expression] solves my confusion seven years ago
String and+
[postgresql]postgresql custom function returns an instance of table type
How to choose cache read / write strategies in different business scenarios?
19、 MySQL -- SQL statements and queries
上周内容回顾
jvm jni 及 pvm pybind11 大批量数据传输及优化
How to modify the network IP addresses of mobile phones and computers?
Design e-commerce seckill system
Deep search DFS + wide search BFS + traversal of trees and graphs + topological sequence (template article acwing)
Recommendation of books related to strong foundation program mathematics
MySQL master-slave synchronization principle