当前位置:网站首页>[2021]IBRNet: Learning Multi-View Image-Based Rendering Qianqian
[2021]IBRNet: Learning Multi-View Image-Based Rendering Qianqian
2022-07-05 06:17:00 【Dark blue blue blue】
NeRF A big problem is that it can only represent one scene , Therefore, this article proposes a framework for learning multiple scenarios at the same time , And it can be extended to scenes that have not been studied .
This article and NeRF The biggest difference is that the input data does not only have a target perspective , There are also corresponding multi view pictures of the same scene , Therefore, theoretically, it can be directly applied to new scenarios end-to-end .
Model flow :
1. Input the multi view pictures of the same scene into the network ( There is no limit to the number of ), Then use a U-Net To extract each picture (source view) Characteristics of , Features include image color , Camera parameters , Image representation ( Here it can be understood as NeRF Emitting light to the radiation field in , Then save the corresponding light parameters and image features ).
2. Then input the features of each picture in parallel transformer, Used to predict a common color and density . The reason for the common color and density is that the features input from multiple perspectives are the features of the same point in different perspectives by default , So the result is to predict our target perspective (target view) Results at this point .
3. Render the result by volume rendering , Then optimize the network through the reconstruction loss of pixels
4. Another scene , repeat 1~3
remarks : If you keep training with the same scene , In theory, the effect will be better , That's what the paper mentioned finetune The situation of .
Personal understanding : In essence, the model here learns “ How to interpolate ”, Instead of building a radiation field , Therefore, it may not perform well in sparse situations or complex scenes
边栏推荐
- Operator priority, one catch, no doubt
- Is it impossible for lamda to wake up?
- In depth analysis of for (VaR I = 0; I < 5; i++) {settimeout (() => console.log (I), 1000)}
- 阿里新成员「瓴羊」正式亮相,由阿里副总裁朋新宇带队,集结多个核心部门技术团队
- [rust notes] 17 concurrent (Part 2)
- Règlement sur la sécurité des réseaux dans les écoles professionnelles secondaires du concours de compétences des écoles professionnelles de la province de Guizhou en 2022
- Sum of three terms (construction)
- Leetcode-6110: number of incremental paths in the grid graph
- MIT-6874-Deep Learning in the Life Sciences Week 7
- [rust notes] 14 set (Part 1)
猜你喜欢
leetcode-6111:螺旋矩阵 IV
Navicat连接Oracle数据库报错ORA-28547或ORA-03135
Real time clock (RTC)
SQLMAP使用教程(一)
Chapter 6 relational database theory
1.14 - 流水线
Doing SQL performance optimization is really eye-catching
Spark中groupByKey() 和 reduceByKey() 和combineByKey()
Appium automation test foundation - Summary of appium test environment construction
Sqlmap tutorial (1)
随机推荐
Groupbykey() and reducebykey() and combinebykey() in spark
SQLMAP使用教程(二)实战技巧一
Spark中groupByKey() 和 reduceByKey() 和combineByKey()
The sum of the unique elements of the daily question
【Rust 笔记】14-集合(上)
QQ computer version cancels escape character input expression
1040 Longest Symmetric String
SPI details
[leetcode] day95 effective Sudoku & matrix zeroing
Leetcode heap correlation
[rust notes] 17 concurrent (Part 2)
Network security skills competition in Secondary Vocational Schools -- a tutorial article on middleware penetration testing in Guangxi regional competition
One question per day 1020 Number of enclaves
剑指 Offer II 058:日程表
Erreur de connexion Navicat à la base de données Oracle Ora - 28547 ou Ora - 03135
MySQL advanced part 2: SQL optimization
leetcode-6108:解密消息
MySQL advanced part 1: stored procedures and functions
Leetcode-3: Longest substring without repeated characters
MySQL advanced part 2: MySQL architecture