The Pytorch implementation for "Video-Text Pre-training with Learned Regions"

Last update: Mar 20, 2022

Related tags

Deep Learning video-language

Overview

Region_Learner

The Pytorch implementation for "Video-Text Pre-training with Learned Regions" (arxiv)

We are still cleaning up the code further and preparing for pre-training weights.

Preparation

Overall, this code is built on PyTorch with DistributedDataParallel (DDP).

Create conda env and install required packages via sh install_env.sh
Create some important folders
1. mkdir data (you can symlink huge datasets to this folder)
2. mkdir results

Finetuning (on MSR-VTT)

Download data (see https://github.com/m-bain/frozen-in-time#-finetuning-benchmarks-msr-vtt)
Run sh finetune.sh

Pre-training

Download WebVid-2M (see https://github.com/m-bain/webvid)
Download CC-3M (see https://ai.google.com/research/ConceptualCaptions/download)
Run sh pre-training.sh

Pre-trained Weights

Coming soon.

Acknowledgements

This code is based off Frozen in Time

Owner

Rui Yan

Computer Vision

GitHub Repository

Ipython notebook presentations for getting starting with basic programming, statistics and machine learning techniques

Data Science 45-min Intros Every week*, our data science team @Gnip (aka @TwitterBoulder) gets together for about 50 minutes to learn something. While

1.6k Dec 31, 2022

POCO: Point Convolution for Surface Reconstruction

POCO: Point Convolution for Surface Reconstruction by: Alexandre Boulch and Renaud Marlet Abstract Implicit neural networks have been successfully use

93 Dec 29, 2022

Official implementation of "Synthetic Temporal Anomaly Guided End-to-End Video Anomaly Detection" (ICCV Workshops 2021: RSL-CV).

Official PyTorch implementation of "Synthetic Temporal Anomaly Guided End-to-End Video Anomaly Detection" This is the implementation of the paper "Syn

11 Oct 07, 2022

Scalable Optical Flow-based Image Montaging and Alignment

SOFIMA SOFIMA (Scalable Optical Flow-based Image Montaging and Alignment) is a tool for stitching, aligning and warping large 2d, 3d and 4d microscopy

16 Dec 21, 2022

This repository implements and evaluates convolutional networks on the Möbius strip as toy model instantiations of Coordinate Independent Convolutional Networks.

Orientation independent Möbius CNNs This repository implements and evaluates convolutional networks on the Möbius strip as toy model instantiations of

59 Dec 09, 2022

The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.

ISC21-Descriptor-Track-1st The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track. You can check our solution

75 Jan 08, 2023

Relative Human dataset, CVPR 2022

Relative Human (RH) contains multi-person in-the-wild RGB images with rich human annotations, including: Depth layers (DLs): relative depth relationsh

112 Dec 02, 2022

School of Artificial Intelligence at the Nanjing University (NJU)School of Artificial Intelligence at the Nanjing University (NJU)

F-Principle This is an exercise problem of the digital signal processing (DSP) course at School of Artificial Intelligence at the Nanjing University (

5 Nov 23, 2022

The Pytorch implementation for "Video-Text Pre-training with Learned Regions"

Related tags

Overview

Region_Learner

Preparation

Finetuning (on MSR-VTT)

Pre-training

Pre-trained Weights

Acknowledgements

Owner

Rui Yan

Ipython notebook presentations for getting starting with basic programming, statistics and machine learning techniques

POCO: Point Convolution for Surface Reconstruction

Official implementation of "Synthetic Temporal Anomaly Guided End-to-End Video Anomaly Detection" (ICCV Workshops 2021: RSL-CV).

Scalable Optical Flow-based Image Montaging and Alignment

This repository implements and evaluates convolutional networks on the Möbius strip as toy model instantiations of Coordinate Independent Convolutional Networks.

The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.

Relative Human dataset, CVPR 2022

School of Artificial Intelligence at the Nanjing University (NJU)School of Artificial Intelligence at the Nanjing University (NJU)

ThunderSVM: A Fast SVM Library on GPUs and CPUs

Code of the paper "Deep Human Dynamics Prior" in ACM MM 2021.

Huawei Hackathon 2021 - Sweden (Stockholm)

Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.

Library of various Few-Shot Learning frameworks for text classification

Deploy pytorch classification model using Flask and Streamlit

Experiments and examples converting Transformers to ONNX

Implementations of polygamma, lgamma, and beta functions for PyTorch

GeneDisco is a benchmark suite for evaluating active learning algorithms for experimental design in drug discovery.

Repositorio oficial del curso IIC2233 Programación Avanzada 🚀✨

Dynamic hair modeling from monocular videos using deep neural networks

A custom DeepStack model for detecting 16 human actions.