Neural Caption Generator with Attention

Last update: Nov 30, 2022

Overview

Neural Caption Generator with Attention

Tensorflow implementation of "Show, attend and Tell" http://arxiv.org/abs/1502.03044
Borrowed most of the idea from the author's source code https://github.com/kelvinxu/arctic-captions

Code

make_flickr_dataset.py: Extracts conv5_3 layer activations of VGG Network for flickr30k images, and save them in 'data/feats.npy'
model_tensorflow.py: Main codes

Usage

Download flickr30k Dataset.
Extract VGG conv5_3 features using make_flickr_dataset.py
Train: run train() in model_tensorflow.py
Test: run test() in model_tensorflow.py

Owner

Taeksoo Kim

GitHub Repository

A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.

NeRF-pytorch NeRF (Neural Radiance Fields) is a method that achieves state-of-the-art results for synthesizing novel views of complex scenes. Here are

3.2k Jan 08, 2023

An open source library for face detection in images. The face detection speed can reach 1000FPS.

libfacedetection This is an open source library for CNN-based face detection in images. The CNN model has been converted to static variables in C sour

11.4k Dec 27, 2022

This repository is for Contrastive Embedding Distribution Refinement and Entropy-Aware Attention Network (CEDR)

CEDR This repository is for Contrastive Embedding Distribution Refinement and Entropy-Aware Attention Network (CEDR) introduced in the following paper

3 Feb 27, 2022

KGDet: Keypoint-Guided Fashion Detection (AAAI 2021)

KGDet: Keypoint-Guided Fashion Detection (AAAI 2021) This is an official implementation of the AAAI-2021 paper "KGDet: Keypoint-Guided Fashion Detecti

35 Dec 29, 2022

YOLTv5 rapidly detects objects in arbitrarily large aerial or satellite images that far exceed the ~600×600 pixel size typically ingested by deep learning object detection frameworks

YOLTv5 rapidly detects objects in arbitrarily large aerial or satellite images that far exceed the ~600×600 pixel size typically ingested by deep learning object detection frameworks.

145 Jan 01, 2023

Official code of the paper "ReDet: A Rotation-equivariant Detector for Aerial Object Detection" (CVPR 2021)

ReDet: A Rotation-equivariant Detector for Aerial Object Detection ReDet: A Rotation-equivariant Detector for Aerial Object Detection (CVPR2021), Jiam

334 Dec 23, 2022

Bald-to-Hairy Translation Using CycleGAN

GANiry: Bald-to-Hairy Translation Using CycleGAN Official PyTorch implementation of GANiry. GANiry: Bald-to-Hairy Translation Using CycleGAN, Fidan Sa

10 Oct 27, 2022

Graph Representation Learning via Graphical Mutual Information Maximization

GMI (Graphical Mutual Information) Graph Representation Learning via Graphical Mutual Information Maximization (Peng Z, Huang W, Luo M, et al., WWW 20

93 Dec 29, 2022

Reusable constraint types to use with typing.Annotated

annotated-types PEP-593 added typing.Annotated as a way of adding context-specific metadata to existing types, and specifies that Annotated[T, x] shou

125 Dec 26, 2022

Voice control for Garry's Mod

WIP: Talonvoice GMod integrations Very work in progress voice control demo for Garry's Mod. HOWTO Install https://talonvoice.com/ Press https://i.imgu

5 Nov 15, 2022

Official implementation of Rethinking Graph Neural Architecture Search from Message-passing (CVPR2021)

Rethinking Graph Neural Architecture Search from Message-passing Intro The GNAS can automatically learn better architecture with the optimal depth of

48 Sep 30, 2022

Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services

Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning

4.2k Dec 29, 2022

Stochastic Tensor Optimization for Robot Motion - A GPU Robot Motion Toolkit

STORM Stochastic Tensor Optimization for Robot Motion - A GPU Robot Motion Toolkit [Install Instructions] [Paper] [Website] This package contains code

101 Dec 12, 2022

Multi-label Co-regularization for Semi-supervised Facial Action Unit Recognition (NeurIPS 2019)

MLCR This is the source code for paper Multi-label Co-regularization for Semi-supervised Facial Action Unit Recognition. Xuesong Niu, Hu Han, Shiguang

60 Nov 29, 2022

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation This the repository for this paper. Find extensions of this w

14 Oct 26, 2022

A curated list of awesome Machine Learning frameworks, libraries and software.

Awesome Machine Learning A curated list of awesome machine learning frameworks, libraries and software (by language). Inspired by awesome-php. If you

57.1k Jan 03, 2023

Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers"

Recurrent Fast Weight Programmers This is the official repository containing the code we used to produce the experimental results reported in the pape

36 Nov 15, 2022

Detectron2 for Document Layout Analysis

Detectron2 trained on PubLayNet dataset This repo contains the training configurations, code and trained models trained on PubLayNet dataset using Det

163 Nov 21, 2022

Code for our paper: Online Variational Filtering and Parameter Learning

Variational Filtering To run phi learning on linear gaussian (Fig1a) python linear_gaussian_phi_learning.py To run phi and theta learning on linear g

16 Aug 14, 2022

Spatial Transformer Nets in TensorFlow/ TensorLayer

MOVED TO HERE Spatial Transformer Networks Spatial Transformer Networks (STN) is a dynamic mechanism that produces transformations of input images (or

36 Nov 23, 2022

Neural Caption Generator with Attention

Related tags

Overview

Neural Caption Generator with Attention

Code

Usage

Owner

Taeksoo Kim

A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.

An open source library for face detection in images. The face detection speed can reach 1000FPS.

This repository is for Contrastive Embedding Distribution Refinement and Entropy-Aware Attention Network (CEDR)

KGDet: Keypoint-Guided Fashion Detection (AAAI 2021)

YOLTv5 rapidly detects objects in arbitrarily large aerial or satellite images that far exceed the ~600×600 pixel size typically ingested by deep learning object detection frameworks

Official code of the paper "ReDet: A Rotation-equivariant Detector for Aerial Object Detection" (CVPR 2021)

Bald-to-Hairy Translation Using CycleGAN

Graph Representation Learning via Graphical Mutual Information Maximization

Reusable constraint types to use with typing.Annotated

Voice control for Garry's Mod

Official implementation of Rethinking Graph Neural Architecture Search from Message-passing (CVPR2021)

Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services

Stochastic Tensor Optimization for Robot Motion - A GPU Robot Motion Toolkit

Multi-label Co-regularization for Semi-supervised Facial Action Unit Recognition (NeurIPS 2019)

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

A curated list of awesome Machine Learning frameworks, libraries and software.

Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers"

Detectron2 for Document Layout Analysis

Code for our paper: Online Variational Filtering and Parameter Learning

Spatial Transformer Nets in TensorFlow/ TensorLayer