Pathdreamer: A World Model for Indoor Navigation

Last update: Jan 04, 2023

Related tags

Deep Learning pathdreamer

Overview

Pathdreamer: A World Model for Indoor Navigation

This repository hosts the open source code for Pathdreamer, to be presented at ICCV 2021.

Paper | Project Webpage | Colab Demo

Setup instructions

Environment

Set up virtualenv, and install required libraries:

virtualenv venv
source venv/bin/activate
pip install -r requirements.txt

Add the Pathdreamer library to PYTHONPATH:

export PYTHONPATH=$PYTHONPATH:/home/path/to/pathdreamer_root/

Downloading Pretrained Checkpoints

We provide a pretrained checkpoint which can be acquired by running:

wget https://storage.googleapis.com/gresearch/pathdreamer/ckpt.tar -P data/
tar -xf data/ckpt.tar --directory data/

The results will be extracted to the data/ckpt directory. Two checkpoints are provided, one for the Stage 1 model (Structure Generator), and another for the Stage 2 model (Image Generator).

Colab Demo

Pathdreamer_Example_Colab.ipynb [click to launch in Google Colab] shows how to setup and run the pretrained Pathdreamer model for inference. It includes examples on synthesizing image sequences and continuous video sequences for arbitrary navigation trajectories.

Citation

If you find this work useful, please consider citing:

@inproceedings{koh2021pathdreamer,
  title={Pathdreamer: A World Model for Indoor Navigation},
  author={Koh, Jing Yu and Lee, Honglak and Yang, Yinfei and Baldridge, Jason and Anderson, Peter},
  journal={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
  year={2021}
}

License

Pathdreamer is released under the Apache 2.0 license. The Matterport3D dataset is governed by the Matterport3D Terms of Use.

Disclaimer

Not an official Google product.

Pathdreamer: A World Model for Indoor Navigation

Related tags

Overview

Pathdreamer: A World Model for Indoor Navigation

Setup instructions

Environment

Downloading Pretrained Checkpoints

Colab Demo

Citation

License

Disclaimer

Owner

Google Research

Collective Multi-type Entity Alignment Between Knowledge Graphs (WWW'20)

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

GLM (General Language Model)

Videocaptioning.pytorch - A simple implementation of video captioning

the code used for the preprint Embedding-based Instance Segmentation of Microscopy Images.

RIM: Reliable Influence-based Active Learning on Graphs.

Implementation of "Debiasing Item-to-Item Recommendations With Small Annotated Datasets" (RecSys '20)

This is an official implementation of "Polarized Self-Attention: Towards High-quality Pixel-wise Regression"

Companion code for the paper Theoretical characterization of uncertainty in high-dimensional linear classification

Deep Learning Specialization by Andrew Ng, deeplearning.ai.

Collection of generative models in Pytorch version.

LSTM Neural Networks for Spectroscopic Studies of Type Ia Supernovae

Studying Python release adoptions by looking at PyPI downloads

Code for "Unsupervised Source Separation via Bayesian inference in the latent domain"

A modular active learning framework for Python

This game was designed to encourage young people not to gamble on lotteries, as the probablity of correctly guessing the number is infinitesimal!

Stereo Hybrid Event-Frame (SHEF) Cameras for 3D Perception, IROS 2021

Real-time VIBE: Frame by Frame Inference of VIBE (Video Inference for Human Body Pose and Shape Estimation)

Paaster is a secure by default end-to-end encrypted pastebin built with the objective of simplicity.

Machine learning and Deep learning models, deploy on telegram (the best social media)