A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval

Last update: Dec 26, 2022

Related tags

Overview

CLIP4CMR

A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval

The original data and pre-calculated CLIP features are available at here. The train.pkl and test.pkl include image pixel features and text id features, and the clip_train.pkl and clip_test.pkl include 1024-dimensional image and text features.

Owner

GitHub Repository

Software associated to AAAI paper "Planning with Biological Neurons and Synapses"

jBrain Software associated with the AAAI 2022 paper Francesco D'Amore, Daniel Mitropolsky, Pierluigi Crescenzi, Emanuele Natale, Christos H. Papadimit

1 Apr 10, 2022

An efficient and easy-to-use deep learning model compression framework

TinyNeuralNetwork 简体中文 TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework, which contains features like neura

441 Dec 25, 2022

Code accompanying the paper "ProxyFL: Decentralized Federated Learning through Proxy Model Sharing"

ProxyFL Code accompanying the paper "ProxyFL: Decentralized Federated Learning through Proxy Model Sharing" Authors: Shivam Kalra*, Junfeng Wen*, Jess

14 Dec 06, 2022

TensorFlow implementation of ENet, trained on the Cityscapes dataset.

segmentation TensorFlow implementation of ENet (https://arxiv.org/pdf/1606.02147.pdf) based on the official Torch implementation (https://github.com/e

248 Dec 16, 2022

[Official] Exploring Temporal Coherence for More General Video Face Forgery Detection(ICCV 2021)

Exploring Temporal Coherence for More General Video Face Forgery Detection(FTCN) Yinglin Zheng, Jianmin Bao, Dong Chen, Ming Zeng, Fang Wen Accepted b

57 Dec 28, 2022

IOT: Instance-wise Layer Reordering for Transformer Structures

Introduction This repository contains the code for Instance-wise Ordered Transformer (IOT), which is introduced in the ICLR2021 paper IOT: Instance-wi

19 Nov 15, 2022

SwinTrack: A Simple and Strong Baseline for Transformer Tracking

SwinTrack This is the official repo for SwinTrack. A Simple and Strong Baseline Prerequisites Environment conda (recommended) conda create -y -n SwinT

196 Jan 04, 2023

3ds-Ghidra-Scripts - Ghidra scripts to help with 3ds reverse engineering

3ds Ghidra Scripts These are ghidra scripts to help with 3ds reverse engineering

7 May 23, 2022

Code for Learning to Segment The Tail (LST)

Learning to Segment the Tail [arXiv] In this repository, we release code for Learning to Segment The Tail (LST). The code is directly modified from th

47 Nov 07, 2022

Supplementary materials for ISMIR 2021 LBD paper "Evaluation of Latent Space Disentanglement in the Presence of Interdependent Attributes"

Evaluation of Latent Space Disentanglement in the Presence of Interdependent Attributes Supplementary materials for ISMIR 2021 LBD submission: K. N. W

2 Oct 25, 2021

PyTorch code of my WACV 2022 paper Improving Model Generalization by Agreement of Learned Representations from Data Augmentation

Improving Model Generalization by Agreement of Learned Representations from Data Augmentation (WACV 2022) Paper ArXiv Why it matters? When data augmen

5 Mar 04, 2022

A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval

Related tags

Overview

CLIP4CMR

Owner

Software associated to AAAI paper "Planning with Biological Neurons and Synapses"

An efficient and easy-to-use deep learning model compression framework

Code accompanying the paper "ProxyFL: Decentralized Federated Learning through Proxy Model Sharing"

TensorFlow implementation of ENet, trained on the Cityscapes dataset.

[Official] Exploring Temporal Coherence for More General Video Face Forgery Detection(ICCV 2021)

IOT: Instance-wise Layer Reordering for Transformer Structures

SwinTrack: A Simple and Strong Baseline for Transformer Tracking

3ds-Ghidra-Scripts - Ghidra scripts to help with 3ds reverse engineering

Code for Learning to Segment The Tail (LST)

Supplementary materials for ISMIR 2021 LBD paper "Evaluation of Latent Space Disentanglement in the Presence of Interdependent Attributes"

PyTorch code of my WACV 2022 paper Improving Model Generalization by Agreement of Learned Representations from Data Augmentation

Code for Multinomial Diffusion

[TIP2020] Adaptive Graph Representation Learning for Video Person Re-identification

Individual Treatment Effect Estimation

ML From Scratch

DL & CV-based indicator toolset for the vehicle drivers via live dash-cam footage.

1st place solution to the Satellite Image Change Detection Challenge hosted by SenseTime

The Official PyTorch Implementation of DiscoBox.

Simple Baselines for Human Pose Estimation and Tracking

On the Limits of Pseudo Ground Truth in Visual Camera Re-Localization