A Sign Language detection project using Mediapipe landmark detection and Tensorflow LSTM's

Last update: Feb 06, 2022

Overview

sign-language-detection

A Sign Language detection project using Mediapipe landmark detection and Tensorflow LSTM. The project is built for a vocabulary of 3 words, but more can be added by collecting and adding data for other words.

Vocabulary

Open
to
Work

Output

Disclaimer

Colab doesn't detect webcam and you can't use it for mediapipe detection and dataset collection through webcam so most of that was done locally and then training and inference using Tensorflow was performed on Colab.

You can uncomment the commented part if you wish to do all that locally. In my case, I had some clash between mediapipe and tensorflow on the ARM architecture m1 mac.

The notebook uses the approach to Sign Language Detection by Nicholas Renotte, of course with a whole bunch of tweaks to suit my usecase 🙂

Tweaks:

Input and output in the form of videos to work with colab.
Remove face landmarks as they end up just being noise.
Use tanh activation as it works way better with LSTMs compared to relu.
Colors and Cosmetics.
Disclaimer at bottom.
Different threshold value for inference.

A Sign Language detection project using Mediapipe landmark detection and Tensorflow LSTM's

Related tags

Overview

sign-language-detection

Vocabulary

Output

Disclaimer

Tweaks:

Owner

Hashim

Repo for EchoVPR: Echo State Networks for Visual Place Recognition

[ICLR2021] Unlearnable Examples: Making Personal Data Unexploitable

Pytorch implementation of YOLOX、PPYOLO、PPYOLOv2、FCOS an so on.

DanceTrack: Multiple Object Tracking in Uniform Appearance and Diverse Motion

Python package to generate image embeddings with CLIP without PyTorch/TensorFlow

Official implementation of the ICCV 2021 paper "Joint Inductive and Transductive Learning for Video Object Segmentation"

Implementation of Artificial Neural Network Algorithm

Implicit Model Specialization through DAG-based Decentralized Federated Learning

Histocartography is a framework bringing together AI and Digital Pathology

CV backbones including GhostNet, TinyNet and TNT, developed by Huawei Noah's Ark Lab.

PyTorch Implementation of Region Similarity Representation Learning (ReSim)

Generalized Data Weighting via Class-level Gradient Manipulation

ECCV2020 paper: Fashion Captioning: Towards Generating Accurate Descriptions with Semantic Rewards. Code and Data.

disentanglement_lib is an open-source library for research on learning disentangled representations.

PyTorch implementation of "A Two-Stage End-to-End System for Speech-in-Noise Hearing Aid Processing"

A PyTorch implementation of SlowFast based on ICCV 2019 paper "SlowFast Networks for Video Recognition"

[MICCAI'20] AlignShift: Bridging the Gap of Imaging Thickness in 3D Anisotropic Volumes

PassAPI is a password generator in hash format and fully developed in Python, with the aim of teaching how to handle and build

Deep Inside Convolutional Networks - This is a caffe implementation to visualize the learnt model

RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.