WeKws

Production First and Production Ready End-to-End Keyword Spotting Toolkit.

The goal of this toolkit it to...

Small footprint keyword spotting (KWS), or specifically wake-up word (WuW) detection is a typical and important module in internet of things (IoT) devices. It provides a way for users to control IoT devices with a hands-free experience. A WuW detection system usually runs locally and persistently on IoT devices, which requires low consumptional power, less model parameters, low computational comlexity and to detect predefined keyword in a streaming way, i.e., requires low latency.

Typical Scenario

We are going to support the following typical applications of wakeup word:

Single wake-up word
Multiple wake-up words
Customizable wake-up word
Personalized wake-up word, i.e. combination of wake-up word detection and voiceprint

Installation

Clone the repo

git clone https://github.com/wenet-e2e/wekws.git

Install Conda: please see https://docs.conda.io/en/latest/miniconda.html
Create Conda env:

conda create -n wenet python=3.8
conda activate wenet
pip install -r requirements.txt
conda install pytorch=1.10.0 torchaudio=0.10.0 cudatoolkit=11.1 -c pytorch -c conda-forge

Dataset

We plan to support a variaty of open source wake-up word datasets, include but not limited to:

All the well-trained models on these dataset will be made public avaliable.

Runtime

We plan to support a variaty of hardwares and platforms, including:

Web browser
x86
Android
Raspberry Pi

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Related tags

Overview

WeKws

Typical Scenario

Installation

Dataset

Runtime

Owner

Vector space based Information Retrieval System for Text Processing - Information retrieval

utoken is a multilingual tokenizer that divides text into words, punctuation and special tokens such as numbers, URLs, XML tags, email-addresses and hashtags.

Repositori untuk belajar pemrograman Python dalam bahasa Indonesia

Convert ebooks with few clicks on Telegram!

PyNews 📰 Simple newsletter made with python 🐍🗞️

This is an AI that is supposed to say you if your text is formal or not

Fuzz a language by mixing up only few words.

Wordle strategy: Find frequency of letters appearing in 5-letter words in the English language

Search for terms(word / table / field name or any) under Snowflake schema names

This repository contains scripts to control a RGB text fan attached to a Raspberry Pi.

Auto translate Localizable.strings for multiple languages in Xcode

Hotpotato is a recipe portfolio App that assists users to discover and comment new recipes.

REST API for sentence tokenization and embedding using Multilingual Universal Sentence Encoder.

Python library for creating PEG parsers

Hamming code generation, error detection & correction.

Chilean Digital Vaccination Pass Parser (CDVPP) parses digital vaccination passes from PDF files

A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.

Adventura is an open source Python Text Adventure Engine

Python character encoding detector

Correcting typos in a word based on the frequency dictionary