当前位置:网站首页>[2021]IBRNet: Learning Multi-View Image-Based Rendering Qianqian
[2021]IBRNet: Learning Multi-View Image-Based Rendering Qianqian
2022-07-05 06:17:00 【Dark blue blue blue】
NeRF A big problem is that it can only represent one scene , Therefore, this article proposes a framework for learning multiple scenarios at the same time , And it can be extended to scenes that have not been studied .
This article and NeRF The biggest difference is that the input data does not only have a target perspective , There are also corresponding multi view pictures of the same scene , Therefore, theoretically, it can be directly applied to new scenarios end-to-end .
Model flow :
1. Input the multi view pictures of the same scene into the network ( There is no limit to the number of ), Then use a U-Net To extract each picture (source view) Characteristics of , Features include image color , Camera parameters , Image representation ( Here it can be understood as NeRF Emitting light to the radiation field in , Then save the corresponding light parameters and image features ).
2. Then input the features of each picture in parallel transformer, Used to predict a common color and density . The reason for the common color and density is that the features input from multiple perspectives are the features of the same point in different perspectives by default , So the result is to predict our target perspective (target view) Results at this point .
3. Render the result by volume rendering , Then optimize the network through the reconstruction loss of pixels
4. Another scene , repeat 1~3
remarks : If you keep training with the same scene , In theory, the effect will be better , That's what the paper mentioned finetune The situation of .
Personal understanding : In essence, the model here learns “ How to interpolate ”, Instead of building a radiation field , Therefore, it may not perform well in sparse situations or complex scenes
边栏推荐
- Leetcode heap correlation
- Daily question 1342 Number of operations to change the number to 0
- Shutter web hardware keyboard monitoring
- Open source storage is so popular, why do we insist on self-development?
- 11-gorm-v2-03-basic query
- liunx启动redis
- Daily question 2006 Number of pairs whose absolute value of difference is k
- MatrixDB v4.5.0 重磅发布,全新推出 MARS2 存储引擎!
- Règlement sur la sécurité des réseaux dans les écoles professionnelles secondaires du concours de compétences des écoles professionnelles de la province de Guizhou en 2022
- [leetcode] day94 reshape matrix
猜你喜欢
Navicat连接Oracle数据库报错ORA-28547或ORA-03135
什么是套接字?Socket基本介绍
Appium基础 — 使用Appium的第一个Demo
MySQL advanced part 1: index
Matrixdb V4.5.0 was launched with a new mars2 storage engine!
MySQL advanced part 2: SQL optimization
QQ电脑版取消转义符输入表情
leetcode-6111:螺旋矩阵 IV
Sqlmap tutorial (II) practical skills I
MySQL advanced part 2: MySQL architecture
随机推荐
Basic explanation of typescript
LVS简介【暂未完成(半成品)】
leetcode-6108:解密消息
1.13 - RISC/CISC
927. 三等分 模拟
MIT-6874-Deep Learning in the Life Sciences Week 7
对for(var i = 0;i < 5;i++) {setTimeout(() => console.log(i),1000)}的深入分析
MySQL advanced part 2: the use of indexes
11-gorm-v2-02-create data
SQLMAP使用教程(一)
Leetcode-6109: number of people who know secrets
Arduino 控制的 RGB LED 无限镜
【Rust 笔记】13-迭代器(下)
【Rust 笔记】17-并发(上)
[rust notes] 13 iterator (Part 2)
One question per day 1020 Number of enclaves
927. Trisection simulation
MIT-6874-Deep Learning in the Life Sciences Week 7
【Rust 笔记】14-集合(上)
New title of module a of "PanYun Cup" secondary vocational network security skills competition