Styled text-to-drawing synthesis method. Featured at the 2021 NeurIPS Workshop on Machine Learning for Creativity and Design

Last update: Dec 23, 2022

Overview

StyleCLIPDraw

Peter Schaldenbrand, Zhixuan Liu, Jean Oh September 2021

To be featured in the 2021 NeurIPS Workshop on Machine Learning and Design

StyleCLIPDraw adds a style loss to the CLIPDraw (Frans et al. 2021) (code) text-to-drawing synthesis model to allow artistic control of the synthesized drawings in addition to control of the content via text. Whereas performing decoupled style transfer on a generated image only affects the texture, our proposed coupled approach is able to capture a style in both texture and shape, suggesting that the style of the drawing is coupled with the drawing process itself.

Checkout our code on Colab

Method

Unlike most other image generation models, CLIPDraw produces drawings consisting of a series of Bezier curves defined by a list of coordinates, a color, and an opacity. The drawing begins as randomized Bezier curves on a canvas and is optimized to fit the given style and text. The StyleCLIPDraw model architecture is shown above. The brush strokes are rendered into a raster image via differentiable model. There are two losses for StyleCLIPDraw that correspond to each input. The text input and the augmented raster drawing are fed the the CLIP model and the difference in embeddings are compared using cosine distance to compute a loss that encourages the drawing to fit the text input. The image is augmented to avoid finding shallow solutions to optimizing through the CLIP model. The raster image and the style image are fed through early layers of the VGG-16 model and the difference in extracted features form the loss that encourages the drawings to fit the style of the style image.

Styled text-to-drawing synthesis method. Featured at the 2021 NeurIPS Workshop on Machine Learning for Creativity and Design

Related tags

Overview

StyleCLIPDraw

Peter Schaldenbrand, Zhixuan Liu, Jean Oh September 2021

Method

Results

StyleCLIPDraw vs. CLIPDraw then Style Transfer

Owner

Peter Schaldenbrand

PyTorch code for 'Efficient Single Image Super-Resolution Using Dual Path Connections with Multiple Scale Learning'

Instance-level Image Retrieval using Reranking Transformers

StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks

Graph-total-spanning-trees - A Python script to get total number of Spanning Trees in a Graph

A tensorflow/keras implementation of StyleGAN to generate images of new Pokemon.

MVGCN: a novel multi-view graph convolutional network (MVGCN) framework for link prediction in biomedical bipartite networks.

Train CNNs for the fruits360 data set in NTOU CS「Machine Vision」class.

TLDR: Twin Learning for Dimensionality Reduction

TorchMetrics is a collection of 25+ PyTorch metrics implementations and an easy-to-use API to create custom metrics.

Train neural network for semantic segmentation (deep lab V3) with pytorch in less then 50 lines of code

Keras Implementation of Neural Style Transfer from the paper "A Neural Algorithm of Artistic Style"

A pytorch implementation of Detectron. Both training from scratch and inferring directly from pretrained Detectron weights are available.

Videocaptioning.pytorch - A simple implementation of video captioning

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Spherical CNNs

API for RL algorithm design & testing of BCA (Building Control Agent) HVAC on EnergyPlus building energy simulator by wrapping their EMS Python API

[CVPR'21] Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild

Illuminated3D This project participates in the Nasa Space Apps Challenge 2021.

an implementation of softmax splatting for differentiable forward warping using PyTorch

This repo is duplication of jwyang/faster-rcnn.pytorch