GuideDog is an AI/ML-based mobile app designed to assist the lives of the visually impaired, 100% voice-controlled

Last update: Nov 24, 2021

Related tags

Overview

Guidedog

Authors: Kyuhee Jo, Steven Gunarso, Jacky Wang, Raghav Sharma

GuideDog is an AI/ML-based mobile app designed to assist the lives of the visually impaired, 100% voice-controlled. You may as well think of it as "speaking guide dog," as the name suggests. It has three key features based on the scene captured by your mobile phone:

Reads text upon command
Describes the scene around you upon command
Warns you if there is an obstacle in front of you

Check out this demo video to learn more about our app!

Android App

UI/UX
- Simple and Responsive
- Voice Assistant architecture for targeted audience
Libraries / APIs
- GC Speech-to-text and Text-to-Speech
- Android SDK , androidX
- ML Kit object detection and tracking api
- TensorFlow Lite MobileNet Image Classification Model

Backend

Flask API
- Image Captioning
- Optical Character Recognition
Deployment
- Google App Engine
- fast central API with different endpoints

Image Captioning

We used tensorflow to build and train model for image captioning on MS-COCO 2014 based on the paper Show, Attend and Tell: Neural Image Caption Generation with Visual Attention. The model uses standard convolutional network as an encoder to extract features from images (we use Inception V3) and feed the generated features into an attention-based decoder generate sentences. While the paper used LSTM model as a decoder, we use a simpler RNN instead.

GuideDog is an AI/ML-based mobile app designed to assist the lives of the visually impaired, 100% voice-controlled

Related tags

Overview

Guidedog

Android App

Backend

Image Captioning

Get more insights : Devpost

Owner

Kyuhee Jo

An official implementation of "SFNet: Learning Object-aware Semantic Correspondence" (CVPR 2019, TPAMI 2020) in PyTorch.

Next-Best-View Estimation based on Deep Reinforcement Learning for Active Object Classification

Repo for the Tutorials of Day1-Day3 of the Nordic Probabilistic AI School 2021 (https://probabilistic.ai/)

The codes and related files to reproduce the results for Image Similarity Challenge Track 2.

Introduction to AI assignment 1 HCM University of Technology, term 211

Continual Learning of Long Topic Sequences in Neural Information Retrieval

A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)

Face-Recognition-Attendence-System - This face recognition Attendence system using Python

code for our BMVC 2021 paper "HCV: Hierarchy-Consistency Verification for Incremental Implicitly-Refined Classification"

Implementation of ICCV 2021 oral paper -- A Novel Self-Supervised Learning for Gaussian Mixture Model

The Wearables Development Toolkit - a development environment for activity recognition applications with sensor signals

⚖️🔁🔮🕵️‍♂️🦹🖼️ Code for Measuring the Contribution of Multiple Model Representations in Detecting Adversarial Instances paper.

Code for the paper BERT might be Overkill: A Tiny but Effective Biomedical Entity Linker based on Residual Convolutional Neural Networks

Digitalizing-Prescription-Image - PIRDS - Prescription Image Recognition and Digitalizing System is a OCR make with Tensorflow

The King is Naked: on the Notion of Robustness for Natural Language Processing

Simple Pixelbot for Diablo 2 Resurrected written in python and opencv.

YOLOv5🚀 reproduction by Guo Quanhao using PaddlePaddle

GLM (General Language Model)

CARL provides highly configurable contextual extensions to several well-known RL environments.

VACA: Designing Variational Graph Autoencoders for Interventional and Counterfactual Queries

GuideDog is an AI/ML-based mobile app designed to assist the lives of the visually impaired, 100% voice-controlled

Related tags

Overview

Guidedog

Android App

Backend

Image Captioning

Get more insights : Devpost

Owner

Kyuhee Jo

An official implementation of "SFNet: Learning Object-aware Semantic Correspondence" (CVPR 2019, TPAMI 2020) in PyTorch.

Next-Best-View Estimation based on Deep Reinforcement Learning for Active Object Classification

Repo for the Tutorials of Day1-Day3 of the Nordic Probabilistic AI School 2021 (https://probabilistic.ai/)

The codes and related files to reproduce the results for Image Similarity Challenge Track 2.

Introduction to AI assignment 1 HCM University of Technology, term 211

Continual Learning of Long Topic Sequences in Neural Information Retrieval

A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)

Face-Recognition-Attendence-System - This face recognition Attendence system using Python

code for our BMVC 2021 paper "HCV: Hierarchy-Consistency Verification for Incremental Implicitly-Refined Classification"

Implementation of ICCV 2021 oral paper -- A Novel Self-Supervised Learning for Gaussian Mixture Model

The Wearables Development Toolkit - a development environment for activity recognition applications with sensor signals

⚖️🔁🔮🕵️‍♂️🦹🖼️ Code for *Measuring the Contribution of Multiple Model Representations in Detecting Adversarial Instances* paper.

Code for the paper BERT might be Overkill: A Tiny but Effective Biomedical Entity Linker based on Residual Convolutional Neural Networks

Digitalizing-Prescription-Image - PIRDS - Prescription Image Recognition and Digitalizing System is a OCR make with Tensorflow

The King is Naked: on the Notion of Robustness for Natural Language Processing

Simple Pixelbot for Diablo 2 Resurrected written in python and opencv.

YOLOv5🚀 reproduction by Guo Quanhao using PaddlePaddle

GLM (General Language Model)

CARL provides highly configurable contextual extensions to several well-known RL environments.

VACA: Designing Variational Graph Autoencoders for Interventional and Counterfactual Queries

⚖️🔁🔮🕵️‍♂️🦹🖼️ Code for Measuring the Contribution of Multiple Model Representations in Detecting Adversarial Instances paper.