Overview
This is the official repo for the paper: "NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion".
NÜWA is a unified multimodal pre-trained model that can generate new or manipulate existing visual data (i.e., images and videos) for 8 visual synthesis tasks (as shown above).