Code for "Retrieving Black-box Optimal Images from External Databases" (WSDM 2022)

Last update: Apr 13, 2022

Related tags

Overview

Retrieving Black-box Optimal Images from External Databases (WSDM 2022)

We propose how a user retreives an optimal image from external databases of web services (e.g., Flickr) with respect to user-defined functions (e.g., deep learning-based score functions.)

💿 Dependency

Please install

wget and unzip, e.g., by sudo apt install wget unzip,
PyTorch from the official website, and
other dependencies by pip install -r requirements.txt.

📂 Files

download.sh downloads and preprocesses the Open Image dataset.
environments.py implements wrappers of APIs, i.e., the oracles in the paper.
evaluate.py is the evaluation script.
methods.py implements Tiara, Tiara-S, and baseline methods.
openimage_feature_extract.py preprocess the Open Image dataset. Please run this script after you download images. This script is automatically run by download.sh.
preprocess_openimage.py preprocess the Open Image dataset. Please run this script before you download images. This script is automatically run by download.sh.
utils.py implements miscellaneous functions, i.e., the word embbeding loader.

🗃️ Download and Preprocess Datasets

$ bash ./download.sh

🧪 Evaluation

Try with Open Image datasets by

$ python evaluate.py --env open --verbose --num_seeds 1 -c 0

The results are saved in outputs directiory.

Please refer to the help command for further options.

$ python evaluate.py -h
usage: evaluate.py [-h] [--tuning] [--extra] [--env {open,flickr,flickrsim}]
                   [--num_seeds NUM_SEEDS] [--budget BUDGET]
                   [--api_key API_KEY] [--api_secret API_SECRET]
                   [--font_path FONT_PATH] [--verbose]
                   [-c [CLASSES [CLASSES ...]]]

optional arguments:
  -h, --help            show this help message and exit
  --tuning
  --extra
  --env {open,flickr,flickrsim}
  --num_seeds NUM_SEEDS
  --budget BUDGET
  --api_key API_KEY     API key for Flickr.
  --api_secret API_SECRET
                        API secret key for Flickr.
  --font_path FONT_PATH
                        Font path for wordclouds.
  --verbose
  -c [CLASSES [CLASSES ...]], --classes [CLASSES [CLASSES ...]]

Flickr API

The Flickr experiments require a Flickr API key. Please get a key from Flickr official website.

🖋️ Citation

@inproceedings{sato2022retrieving,
  author    = {Ryoma Sato},
  title     = {Retrieving Black-box Optimal Images from External Databases},
  booktitle = {Proceedings of the Fifteenth {ACM} International Conference on Web Search and Data Mining, {WSDM}},
  year      = {2022},
}

Code for "Retrieving Black-box Optimal Images from External Databases" (WSDM 2022)

Related tags

Overview

Retrieving Black-box Optimal Images from External Databases (WSDM 2022)

💿 Dependency

📂 Files

🗃️ Download and Preprocess Datasets

🧪 Evaluation

Flickr API

🖋️ Citation

Owner

joisino

Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.

Human segmentation models, training/inference code, and trained weights, implemented in PyTorch

dyld_shared_cache processing / Single-Image loading for BinaryNinja

Code for the paper "On the Power of Edge Independent Graph Models"

This repository contains PyTorch code for Robust Vision Transformers.

Implementation of "Semi-supervised Domain Adaptive Structure Learning"

AAAI 2022: Stationary diffusion state neural estimation

Source Code of NeurIPS21 paper: Recognizing Vector Graphics without Rasterization

This repository contains code for the paper "Disentangling Label Distribution for Long-tailed Visual Recognition", published at CVPR' 2021

Scripts and outputs related to the paper Prediction of Adverse Biological Effects of Chemicals Using Knowledge Graph Embeddings.

A PyTorch implementation of "Graph Wavelet Neural Network" (ICLR 2019)

Autotype on websites that have copy-paste disabled like Moodle, HackerEarth contest etc.

Official Implementation of "Third Time's the Charm? Image and Video Editing with StyleGAN3" https://arxiv.org/abs/2201.13433

Principled Detection of Out-of-Distribution Examples in Neural Networks

Classifying cat and dog images using Kaggle dataset

Code for KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs

PyTorch implementation of PSPNet

CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

The implementation our EMNLP 2021 paper "Enhanced Language Representation with Label Knowledge for Span Extraction".

Blind visual quality assessment on 360° Video based on progressive learning