Speech Recognition Database Management with python

Overview

Speech Recognition Database Management

The main aim of this project is to recognize voice of the user as input and convert that input voice into the text form.

Libraries Used Inside the Project

We have used Speech Recognition module of Python to accomplish this mission. Inside it we have modules like PyAudio which helps us to play and record audio.

Libraries

Also, we have used the MySQL connector module for connecting our Python program to our MySQL database.

2

Libraries Created During the Project

We have created a library named MySQLvoice which helps our Artificial Intelligence to manage and organise the databases.

3

The main aim of this module is to select the keywords from the given input. After selecting the keywords our Artificial Intelligence start working on the database and provide the required results.

How We Converted the Voice into Text

For getting the voice input of the user we have used the pre-build library of Python which is Speech Recognition. We have taken the voice input from the systems microphone and stored it into a variable. After that we used the recognize function of Speech Recognition to recognize what user said and stored it into a variable.

After recognizing we printed the input into the text form to check the durability of our program.

4

Description

Using MySQLvoice library user doesn't need to know SQL database languages to make any changes or to know anything about their database. We have announced eight new keywords as follows:

5

How to Install and Run the Project

Once the MySQLvoice pip package is uploaded on PyPI, you can directly write "pip install MySQLvoice" in your respective terminals to install it in your system. After installing you can import it in your Python compiler and get benefited.

How to Use the Project

This Project is limited to MySQL Database operations but it can be used in all regions of the world for handeling databases as it is very easy to develop for regional languages. We are mostly working in common English language but it has the capability to be coded for any languages spoken in the world like Kannada, Korean, Japenese, Hindi, Gujrati etc. It will help the Non-Technical person to handle databases with ease.

Advantages

  1. It supports multitasking.

  2. Users don’t need to code.

  3. Can be used in any sector of industry where we employ databases.

  4. It saves time of the user which will enhance work procedure and economy.

Disadvantages

  1. May fail to work during hardware failure.

  2. May take time in data training of speech recognition.

  3. Noise pollution can hamper the quality of voice input.

  4. The improper pronunciation can effect the voice input.

Future Plans

We dream to include the regional languages (such as Kannada, Gujarati, Marathi etc.) which will help non-technical person to handle their databases.

We have a plan to include this developer tool features to small scale industries to enhance their productivity with this time saving database handling.

Conclusion

This project will help a lot of indutries and business as they are able to manage and organize their databases with thier voice. Also it will reduce the work load to a greater extent.

This project is just a small example of Artificial Intelligence related Database Management.

This project was jointly created by:

6

Owner
Abhishek Kumar Jha
Abhishek Kumar Jha
Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

SpeechMix Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together. Introduction For the same input: from datas

Eric Lam 31 Nov 07, 2022
HuggingSound: A toolkit for speech-related tasks based on HuggingFace's tools

HuggingSound HuggingSound: A toolkit for speech-related tasks based on HuggingFace's tools. I have no intention of building a very complex tool here.

Jonatas Grosman 247 Dec 26, 2022
Text editor on python tkinter to convert english text to other languages with the help of ployglot.

Transliterator Text Editor This is a simple transliteration program which is used to convert english word to phonetically matching word in another lan

Merin Rose Tom 1 Jan 16, 2022
📜 GPT-2 Rhyming Limerick and Haiku models using data augmentation

Well-formed Limericks and Haikus with GPT2 📜 GPT-2 Rhyming Limerick and Haiku models using data augmentation In collaboration with Matthew Korahais &

Bardia Shahrestani 2 May 26, 2022
Code for Emergent Translation in Multi-Agent Communication

Emergent Translation in Multi-Agent Communication PyTorch implementation of the models described in the paper Emergent Translation in Multi-Agent Comm

Facebook Research 75 Jul 15, 2022
NLPShala , the best IDE for all Natural language processing tasks.

The revolutionary IDE for all NLP (Natural language processing) stuffs on the internet.

Abhi 3 Aug 08, 2021
Material for GW4SHM workshop, 16/03/2022.

GW4SHM Workshop Wednesday, 16th March 2022 (13:00 – 15:15 GMT): Presented by: Dr. Rhodri Nelson, Imperial College London Project website: https://www.

Devito Codes 1 Mar 16, 2022
Mastering Transformers, published by Packt

Mastering Transformers This is the code repository for Mastering Transformers, published by Packt. Build state-of-the-art models from scratch with adv

Packt 195 Jan 01, 2023
A Streamlit web app that generates Rick and Morty stories using GPT2.

Rick and Morty Story Generator This project uses a pre-trained GPT2 model, which was fine-tuned on Rick and Morty transcripts, to generate new stories

₸ornike 33 Oct 13, 2022
TunBERT is the first release of a pre-trained BERT model for the Tunisian dialect using a Tunisian Common-Crawl-based dataset.

TunBERT is the first release of a pre-trained BERT model for the Tunisian dialect using a Tunisian Common-Crawl-based dataset. TunBERT was applied to three NLP downstream tasks: Sentiment Analysis (S

InstaDeep Ltd 72 Dec 09, 2022
This repository has a implementations of data augmentation for NLP for Japanese.

daaja This repository has a implementations of data augmentation for NLP for Japanese: EDA: Easy Data Augmentation Techniques for Boosting Performance

Koga Kobayashi 60 Nov 11, 2022
Training open neural machine translation models

Train Opus-MT models This package includes scripts for training NMT models using MarianNMT and OPUS data for OPUS-MT. More details are given in the Ma

Language Technology at the University of Helsinki 167 Jan 03, 2023
SentimentArcs: a large ensemble of dozens of sentiment analysis models to analyze emotion in text over time

SentimentArcs - Emotion in Text An end-to-end pipeline based on Jupyter notebooks to detect, extract, process and anlayze emotion over time in text. E

jon_chun 14 Dec 19, 2022
天池中药说明书实体识别挑战冠军方案;中文命名实体识别;NER; BERT-CRF & BERT-SPAN & BERT-MRC;Pytorch

天池中药说明书实体识别挑战冠军方案;中文命名实体识别;NER; BERT-CRF & BERT-SPAN & BERT-MRC;Pytorch

zxx飞翔的鱼 751 Dec 30, 2022
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

OpenSpeech provides reference implementations of various ASR modeling papers and three languages recipe to perform tasks on automatic speech recogniti

Soohwan Kim 26 Dec 14, 2022
Codename generator using WordNet parts of speech database

codenames Codename generator using WordNet parts of speech database References: https://possiblywrong.wordpress.com/2021/09/13/code-name-generator/ ht

possiblywrong 27 Oct 30, 2022
Implementing SimCSE(paper, official repository) using TensorFlow 2 and KR-BERT.

KR-BERT-SimCSE Implementing SimCSE(paper, official repository) using TensorFlow 2 and KR-BERT. Training Unsupervised python train_unsupervised.py --mi

Jeong Ukjae 27 Dec 12, 2022
A CSRankings-like index for speech researchers

Speech Rankings This project mimics CSRankings to generate an ordered list of researchers in speech/spoken language processing along with their possib

Mutian He 19 Nov 26, 2022
Predict the spans of toxic posts that were responsible for the toxic label of the posts

toxic-spans-detection An attempt at the SemEval 2021 Task 5: Toxic Spans Detection. The Toxic Spans Detection task of SemEval2021 required participant

Ilias Antonopoulos 3 Jul 24, 2022
Materials (slides, code, assignments) for the NYU class I teach on NLP and ML Systems (Master of Engineering).

FREE_7773 Repo containing material for the NYU class (Master of Engineering) I teach on NLP, ML Sys etc. For context on what the class is trying to ac

Jacopo Tagliabue 90 Dec 19, 2022