CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)

Last update: Jan 07, 2023

Overview

CLIP (Contrastive Language–Image Pre-training)

Experiments (Evaluation)

Model	Dataset	Acc (%)
ViT-B/32 (Paper)	CIFAR100	65.1
ViT-B/32 (Our)	CIFAR100	61.71
ViT-B/32 (Paper	CIFAR10	91.3
ViT-B/32 (Our)	CIFAR10	88.8

Overview

Training

Work In Process

Usage

Evaluation

python evaluation.py --dataset CIFAR100 --cuda True

args
- dataset (str): CIFAR10, CIFAR100 (default: CIFAR100)
- num_workers (int): default: 0
- batch_size (int): default: 128
- cuda (bool): False
Training
- Prepare Data
  - Visual Genome Dataset link
  - Download (images, region descriptions)
- training
```
python main.py --base_dir ./ --cuda True
```

Reference

paper link
Author: Alec Radford, Jong Wook Kim, Chris Hallacy, Girish Sastry, Amanda Askell, Pamela Mishkin, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Jack Clark, Gretchen Krueger, Ilya Sutskever
OpenAI

Owner

Myeongjun Kim

Computer Vision Research using Deep Learning

GitHub Repository

Everything's Talkin': Pareidolia Face Reenactment (CVPR2021)

Everything's Talkin': Pareidolia Face Reenactment (CVPR2021) Linsen Song, Wayne Wu, Chaoyou Fu, Chen Qian, Chen Change Loy, and Ran He [Paper], [Video

71 Dec 21, 2022

Projecting interval uncertainty through the discrete Fourier transform

Projecting interval uncertainty through the discrete Fourier transform This repo

1 Mar 02, 2022

Tensorflow implementation of our method: "Triangle Graph Interest Network for Click-through Rate Prediction".

TGIN Tensorflow implementation of our method: "Triangle Graph Interest Network for Click-through Rate Prediction". Files in the folder dataset/ electr

21 Dec 21, 2022

A script depending on VASP output for calculating Fermi-Softness.

Fermi softness calculation for Vienna Ab initio Simulation Package (VASP) Update 1.1.0: Big update: Rewrote the code. Use Bader atomic division instea

11 Nov 08, 2022

Repo for the Video Person Clustering dataset, and code for the associated paper

Video Person Clustering Repo for the Video Person Clustering dataset, and code for the associated paper. This reporsitory contains the Video Person Cl

47 Nov 02, 2022

RobustVideoMatting and background composing in one model by using onnxruntime.

RVM_onnx_compose RobustVideoMatting and background composing in one model by using onnxruntime. Usage pip install -r requirements.txt python infer_cam

4 Apr 07, 2022

Configure SRX interfaces with Scrapli

Configure SRX interfaces with Scrapli Overview This example will show how to configure interfaces on Juniper's SRX firewalls. In addition to the Pytho

1 Jan 07, 2022

This is a repository of our model for weakly-supervised video dense anticipation.

Introduction This is a repository of our model for weakly-supervised video dense anticipation. More results on GTEA, Epic-Kitchens etc. will come soon

2 Apr 09, 2022

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

ActNN : Activation Compressed Training This is the official project repository for ActNN: Reducing Training Memory Footprint via 2-Bit Activation Comp

178 Jan 05, 2023

EasyMocap is an open-source toolbox for markerless human motion capture from RGB videos.

EasyMocap is an open-source toolbox for markerless human motion capture from RGB videos. In this project, we provide the basic code for fitt

2.2k Jan 05, 2023

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

Human-Level Control through Deep Reinforcement Learning Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning. This imp

2.4k Dec 26, 2022

Machine learning Bot detection technique, based on United States election dataset

Machine learning Bot detection technique, based on United States election dataset (2020). Current github repo provides implementation described in pap

4 Nov 20, 2022

Object detection (YOLO) with pytorch, OpenCV and python

Real Time Object/Face Detection Using YOLO-v3 This project implements a real time object and face detection using YOLO algorithm. You only look once,

1 Aug 04, 2022

Models Supported: AlbUNet [18, 34, 50, 101, 152] (1D and 2D versions for Single and Multiclass Segmentation, Feature Extraction with supports for Deep Supervision and Guided Attention)

AlbUNet-1D-2D-Tensorflow-Keras This repository contains 1D and 2D Signal Segmentation Model Builder for AlbUNet and several of its variants developed

1 Nov 15, 2021

Trax — Deep Learning with Clear Code and Speed

Trax — Deep Learning with Clear Code and Speed Trax is an end-to-end library for deep learning that focuses on clear code and speed. It is actively us

7.3k Dec 26, 2022

Dataset for the Research2Clinics @ NeurIPS 2021 Paper: What Do You See in this Patient? Behavioral Testing of Clinical NLP Models

Behavioral Testing of Clinical NLP Models This repository contains code for testing the behavior of clinical prediction models based on patient letter

2 Sep 20, 2022

Semantic segmentation models, datasets and losses implemented in PyTorch.

Semantic Segmentation in PyTorch Semantic Segmentation in PyTorch Requirements Main Features Models Datasets Losses Learning rate schedulers Data augm

1.3k Jan 07, 2023

Source code for the plant extraction workflow introduced in the paper “Agricultural Plant Cataloging and Establishment of a Data Framework from UAV-based Crop Images by Computer Vision”

Plant extraction workflow Source code for the plant extraction workflow introduced in the paper "Agricultural Plant Cataloging and Establishment of a

0 Apr 22, 2022

PyTorch implementation of PP-LCNet

PP-LCNet-Pytorch Pre-Trained Models Google Drive p018 Accuracy Models Top1 Top5 PPLCNet_x0_25 0.5186 0.7565 PPLCNet_x0_35 0.5809 0.8083 PPLCNet_x0_5 0

24 Dec 12, 2022

Fastquant - Backtest and optimize your trading strategies with only 3 lines of code!

fastquant 🤓 Bringing backtesting to the mainstream fastquant allows you to easily backtest investment strategies with as few as 3 lines of python cod

1k Dec 29, 2022

CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)

Related tags

Overview

CLIP (Contrastive Language–Image Pre-training)

Experiments (Evaluation)

Overview

Training

Usage

Reference

Owner

Myeongjun Kim

Everything's Talkin': Pareidolia Face Reenactment (CVPR2021)

Projecting interval uncertainty through the discrete Fourier transform

Tensorflow implementation of our method: "Triangle Graph Interest Network for Click-through Rate Prediction".

A script depending on VASP output for calculating Fermi-Softness.

Repo for the Video Person Clustering dataset, and code for the associated paper

RobustVideoMatting and background composing in one model by using onnxruntime.

Configure SRX interfaces with Scrapli

This is a repository of our model for weakly-supervised video dense anticipation.

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

EasyMocap is an open-source toolbox for markerless human motion capture from RGB videos.

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

Machine learning Bot detection technique, based on United States election dataset

Object detection (YOLO) with pytorch, OpenCV and python

Models Supported: AlbUNet [18, 34, 50, 101, 152] (1D and 2D versions for Single and Multiclass Segmentation, Feature Extraction with supports for Deep Supervision and Guided Attention)

Trax — Deep Learning with Clear Code and Speed

Dataset for the Research2Clinics @ NeurIPS 2021 Paper: What Do You See in this Patient? Behavioral Testing of Clinical NLP Models

Semantic segmentation models, datasets and losses implemented in PyTorch.

Source code for the plant extraction workflow introduced in the paper “Agricultural Plant Cataloging and Establishment of a Data Framework from UAV-based Crop Images by Computer Vision”

PyTorch implementation of PP-LCNet

Fastquant - Backtest and optimize your trading strategies with only 3 lines of code!