Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis Implementation

Last update: Dec 27, 2022

Related tags

Overview

Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis Implementation

This project attempted to implement the paper Putting NeRF on a Diet (DietNeRF) in JAX/Flax. DietNeRF is designed for rendering quality novel views in few-shot learning scheme, a task that vanilla NeRF (Neural Radiance Field) struggles. To achieve this, the author coins Semantic Consistency Loss to supervise DietNeRF by prior knowledge from CLIP Vision Transformer. Such supervision enables DietNeRF to learn 3D scene reconstruction with CLIP's prior knowledge on 2D views.

Besides this repo, you can check our write-up and demo here:

✍️ Write-up in Notion: more details of DietNeRF and our experiments
✨ Demo in Hugging Face Space: showcase our trained DietNeRFs by Streamlit

🤩 Demo

You can check out our demo in Hugging Face Space
Or you can set up our Streamlit demo locally (model checkpoints will be fetched automatically upon startup)

pip install -r requirements_demo.txt
streamlit run app.py

✨ Implementation

Our code is written in JAX/ Flax and mainly based upon jaxnerf from Google Research. The base code is highly optimized in GPU & TPU. For semantic consistency loss, we utilize pretrained CLIP Vision Transformer from transformers library.

To learn more about DietNeRF, our experiments and implementation, you are highly recommended to check out our very detailed Notion write-up!

🤗 Hugging Face Model Hub Repo

You can also find our project and our model checkpoints on our Hugging Face Model Hub Repository. The models checkpoints are located in models folder.

Our JAX/Flax implementation currently supports:

Platform	Single-Host GPU		Multi-Device TPU
Type	Single-Device	Multi-Device	Single-Host	Multi-Host
Training
Evaluation

💻 Installation

# Clone the repo
git clone https://github.com/codestella/putting-nerf-on-a-diet
# Create a conda environment, note you can use python 3.6-3.8 as
# one of the dependencies (TensorFlow) hasn't supported python 3.9 yet.
conda create --name jaxnerf python=3.6.12; conda activate jaxnerf
# Prepare pip
conda install pip; pip install --upgrade pip
# Install requirements
pip install -r requirements.txt
# [Optional] Install GPU and TPU support for Jax
# Remember to change cuda101 to your CUDA version, e.g. cuda110 for CUDA 11.0.
!pip install --upgrade jax "jax[cuda110]" -f https://storage.googleapis.com/jax-releases/jax_releases.html
# install flax and flax-transformer
pip install flax transformers[flax]

⚽ Dataset

Download the datasets from the NeRF official Google Drive. Please download the nerf_synthetic.zip and unzip them in the place you like. Let's assume they are placed under /tmp/jaxnerf/data/.

🤟 How to Train

Train in our prepared Colab notebook: Colab Pro is recommended, otherwise you may encounter out-of-memory
Train locally: set use_semantic_loss=true in your yaml configuration file to enable DietNeRF.

python -m train \
  --data_dir=/PATH/TO/YOUR/SCENE/DATA \ # (e.g. nerf_synthetic/lego)
  --train_dir=/PATH/TO/THE/PLACE/YOU/WANT/TO/SAVE/CHECKPOINTS \
  --config=configs/CONFIG_YOU_LIKE

💎 Experimental Results

❗ Rendered Rendering images by 8-shot learned DietNeRF

DietNeRF has a strong capacity to generalise on novel and challenging views with EXTREMELY SMALL TRAINING SAMPLES!

HOTDOG / DRUM / SHIP / CHAIR / LEGO / MIC

❗ Rendered GIF by occluded 14-shot learned NeRF and Diet-NeRF

We made artificial occlusion on the right side of image (Only picked left side training poses). The reconstruction quality can be compared with this experiment. DietNeRF shows better quality than Original NeRF when It is occluded.

Training poses

LEGO

Diet NeRF	NeRF

SHIP

Diet NeRF	NeRF

👨‍👧‍👦 Our Team

Teams	Members
Project Managing	Stella Yang To Watch Our Project Progress, Please Check Our Project Notion
NeRF Team	Stella Yang, Alex Lau, Seunghyun Lee, Hyunkyu Kim, Haswanth Aekula, JaeYoung Chung
CLIP Team	Seunghyun Lee, Sasikanth Kotti, Khalid Sifullah , Sunghyun Kim
Cloud TPU Team	Alex Lau, Aswin Pyakurel, JaeYoung Chung, Sunghyun Kim

*Special mention to our "night owl" contributors 🦉 : Seunghyun Lee, Alex Lau, Stella Yang, Haswanth Aekula

💞 Social Impact

Game Industry
Augmented Reality Industry
Virtual Reality Industry
Graphics Industry
Online shopping
Metaverse
Digital Twin
Mapping / SLAM

🌱 References

This project is based on “JAX-NeRF”.

@software{jaxnerf2020github,
  author = {Boyang Deng and Jonathan T. Barron and Pratul P. Srinivasan},
  title = {{JaxNeRF}: an efficient {JAX} implementation of {NeRF}},
  url = {https://github.com/google-research/google-research/tree/master/jaxnerf},
  version = {0.0},
  year = {2020},
}

This project is based on “Putting NeRF on a Diet”.

@misc{jain2021putting,
      title={Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis}, 
      author={Ajay Jain and Matthew Tancik and Pieter Abbeel},
      year={2021},
      eprint={2104.00677},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

🔑 License

Apache License 2.0

❤️ Special Thanks

Our Project is motivated by HuggingFace X GoogleAI (JAX) Community Week Event 2021.

We would like to take this chance to thank Hugging Face for organizing such an amazing open-source initiative, Suraj and Patrick for all the technical help. We learn a lot throughout this wonderful experience!

Finally, we would like to thank Common Computer AI for sponsoring our team access to V100 multi-GPUs server. Thank you so much for your support!

Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis Implementation

Related tags

Overview

Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis Implementation

🤩 Demo

✨ Implementation

🤗 Hugging Face Model Hub Repo

💻 Installation

⚽ Dataset

🤟 How to Train

💎 Experimental Results

❗ Rendered Rendering images by 8-shot learned DietNeRF

HOTDOG / DRUM / SHIP / CHAIR / LEGO / MIC

❗ Rendered GIF by occluded 14-shot learned NeRF and Diet-NeRF

Training poses

LEGO

SHIP

👨‍👧‍👦 Our Team

💞 Social Impact

🌱 References

🔑 License

❤️ Special Thanks

Owner

Face Recognition plus identification simply and fast | Python

PyTorch implementation of saliency map-aided GAN for Auto-demosaic+denosing

Data Augmentation Using Keras and Python

Starter kit for getting started in the Music Demixing Challenge.

Official PyTorch implementation of "Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image", ICCV 2019

McGill Physics Hackathon 2021: Reaction-Diffusion Models for the Generation of Biological Patterns

The Submission for SIMMC 2.0 Challenge 2021

An implementation for `Text2Event: Controllable Sequence-to-Structure Generation for End-to-end Event Extraction`

Creating Artificial Life with Reinforcement Learning

GLNet for Memory-Efficient Segmentation of Ultra-High Resolution Images

COLMAP - Structure-from-Motion and Multi-View Stereo

VQGAN+CLIP Colab Notebook with user-friendly interface.

Implementation of Kalman Filter in Python

[Preprint] "Chasing Sparsity in Vision Transformers: An End-to-End Exploration" by Tianlong Chen, Yu Cheng, Zhe Gan, Lu Yuan, Lei Zhang, Zhangyang Wang

This is an example of a reproducible modelling project

Learning Time-Critical Responses for Interactive Character Control

Automatic number plate recognition using tech: Yolo, OCR, Scene text detection, scene text recognation, flask, torch

Official Pytorch implementation of ICLR 2018 paper Deep Learning for Physical Processes: Integrating Prior Scientific Knowledge.

Automatic differentiation with weighted finite-state transducers.

The codes of paper 'Active-LATHE: An Active Learning Algorithm for Boosting the Error exponent for Learning Homogeneous Ising Trees'