MARS: Learning Modality-Agnostic Representation for Scalable Cross-media Retrieva

Last update: Aug 24, 2022

Related tags

Deep Learning MARS_TCSVT2021

Overview

Introduction

This is the source code of our TCSVT 2021 paper "MARS: Learning Modality-Agnostic Representation for Scalable Cross-media Retrieval". Please cite the following paper if you use our code.

Yunbo Wang and Yuxin Peng, "MARS: Learning Modality-Agnostic Representation for Scalable Cross-media Retrieval", IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2021.

Preparation

We use Python 3.7.2, PyTorch 1.1.0, cuda 9.0, and evaluate on Ubuntu 16.04.12

Install anaconda downloaded from https://repo.anaconda.com/archive. And create a new environment sh Anaconda3-2018.12-Linux-x86_64.sh conda create -n MARS python=3.7.2 conda activate MARS
Run the followed commands conda install pytorch==1.1.0 torchvision==0.3.0 cudatoolkit=9.0 -c pytorch pip install -r requirements.txt

Training and evaluation

We use the Wikipedia dataset as example, and the data is placed in ./datasets/Wiki. In addition, the XMedia&XMediaNet datasets are obtiand via http://59.108.48.34/tiki/XMediaNet/. The NUS-WIDE dataset is obtained via https://lms.comp.nus.edu.sg/wp-content/uploads/2019/research/nuswide/NUS-WIDE.html.

Run the followed command for traning&evaluation, and the configure can be found in main_MARS.py. python main_MARS.py --datasets wiki --output_shape 128 --batch_size 64 --epochs 50 --lr [1e-4, 5e-4] # for Wikipedia

The common representations can be found in folder "features".

For any questions, fell free to contact us. ([email protected])

Welcome to our Laboratory Homepage for more information.

MARS: Learning Modality-Agnostic Representation for Scalable Cross-media Retrieva

Related tags

Overview

Introduction

Preparation

Training and evaluation

Owner

[PAMI 2020] Show, Match and Segment: Joint Weakly Supervised Learning of Semantic Matching and Object Co-segmentation

Retina blood vessel segmentation with a convolutional neural network

A Python library created to assist programmers with complex mathematical functions

Python based Advanced AI Assistant

MEND: Model Editing Networks using Gradient Decomposition

Locationinfo - A script helps the user to show network information such as ip address

Notebooks for my "Deep Learning with TensorFlow 2 and Keras" course

PyTorch Implementation of Vector Quantized Variational AutoEncoders.

The pyrelational package offers a flexible workflow to enable active learning with as little change to the models and datasets as possible

On-device speech-to-intent engine powered by deep learning

Megaverse is a new 3D simulation platform for reinforcement learning and embodied AI research

Automatically Build Multiple ML Models with a Single Line of Code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.

NLMpy - A Python package to create neutral landscape models

TensorFlow implementation of "A Simple Baseline for Bayesian Uncertainty in Deep Learning"

Implementation of Heterogeneous Graph Attention Network

Deep and online learning with spiking neural networks in Python

PyTorch implementation of SimSiam: Exploring Simple Siamese Representation Learning

PyTorch implementation of Higher Order Recurrent Space-Time Transformer

Art Project "Schrödinger's Game of Life"

Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI