《Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis》(2021)

Last update: Nov 27, 2022

Related tags

Overview

Image2Reverb

Image2Reverb is an end-to-end neural network that generates plausible audio impulse responses from single images of acoustic environments. Code for the paper Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis. The architecture is a conditional GAN with a ResNet50 (pre-trained on Places365 and fine-tuned) image encoder. It generates monoaural audio impulse responses (directly applicable to convolution applications) as magnitude spectrograms.

Dependencies

Model/Data:

PyTorch>=1.7.0
PyTorch Lightning
torchvision
torchaudio
librosa
PyRoomAcoustics
PIL

Eval/Preprocessing:

PySoundfile
SciPy
Scikit-Learn
python-acoustics
google-images-download
matplotlib

Usage

We will make a pre-trained model available soon!

Acknowledgments

We borrow and adapt code snippets from GANSynth (and this PyTorch re-implementation), additional snippets from this PGGAN implementation, and more.

Owner

Nikhil Singh

GitHub Repository

Individual Tree Crown classification on WorldView-2 Images using Autoencoder -- Group 9 Weak learners - Final Project (Machine Learning 2020 Course)

Created by Olga Sutyrina, Sarah Elemili, Abduragim Shtanchaev and Artur Bille Individual Tree Crown classification on WorldView-2 Images using Autoenc

2 Dec 08, 2022

《Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis》(2021)

Related tags

Overview

Image2Reverb

Dependencies

Usage

Acknowledgments

Owner

Nikhil Singh

The source code of CVPR17 'Generative Face Completion'.

StableSims is an open-source project aimed at simulating MakerDAO's Dai stablecoin system

Just Go with the Flow: Self-Supervised Scene Flow Estimation

Half Instance Normalization Network for Image Restoration

Neurolab is a simple and powerful Neural Network Library for Python

Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

Shape Matching of Real 3D Object Data to Synthetic 3D CADs (3DV project @ ETHZ)

A large-scale face dataset for face parsing, recognition, generation and editing.

Recovering Brain Structure Network Using Functional Connectivity

Eff video representation - Efficient video representation through neural fields

Towards Part-Based Understanding of RGB-D Scans

Multi-tool reverse engineering collaboration solution.

Si Adek Keras is software VR dangerous object detection.

Implementation for paper LadderNet: Multi-path networks based on U-Net for medical image segmentation

Template repository to build PyTorch projects from source on any version of PyTorch/CUDA/cuDNN.

Use of Attention Gates in a Convolutional Neural Network / Medical Image Classification and Segmentation

CCNet: Criss-Cross Attention for Semantic Segmentation (TPAMI 2020 & ICCV 2019).

Learning to Reconstruct 3D Non-Cuboid Room Layout from a Single RGB Image

SimplEx - Explaining Latent Representations with a Corpus of Examples

Individual Tree Crown classification on WorldView-2 Images using Autoencoder -- Group 9 Weak learners - Final Project (Machine Learning 2020 Course)