ConferencingSpeech 2021 challenge

This repository contains the datasets list and scripts required for the ConferencingSpeech challenge. For more details about the challenge, please see our website.

Details

baseline, this folder contains baseline system include inference model exported by onnx and inference scripts;
eval, this folder contains evaluation scripts to calculate PESQ, STOI and SI-SNR;
selected_lists, the selected wave about train speech and noise wave name from aishell-1, aishell-3, librispeech-360, VCTK, MUSAN, Audioset. Each participant is only allowed to use the selected speech and noise data below :
- selected_lists/dev/circle.name circle RIR wave utt name of dev set
- selected_lists/dev/linear.name linear RIR wave utt name of dev set
- selected_lists/dev/non_uniform.name non uniform linear RIR wave utt name of dev set
- selected_lists/dev/clean.name wave utt name of dev set used clean set
- selected_lists/dev/noise.name wave utt name of dev set used noise set
- selected_lists/train/aishell_1.name wave utt name from aishell-1 set used in train set
- selected_lists/train/aishell_3.name wave utt name from aishell-3 set used in train set
- selected_lists/train/librispeech_360.name wave utt name from librispeech-360 set used in train set
- selected_lists/train/vctk.name wave utt name from VCTK set used in train set
- selected_lists/train/audioset.name wave utt name from Audioset used in train set
- selected_lists/train/musan.name wave utt name from MUSAN used in train set
- selected_lists/train/circle.name circle wave RIR name of train set
- selected_lists/train/linear.name linear wave RIR name of train set
- selected_lists/train/non_uniform.name non unifrom linear RIR utt name of train set
simulation, about simulation scripts, how to use to see ReadMe
- simulation/mix_wav.py simulate dev set and train set
- simulation/prepare.sh use selected_lists/*/*name to select used wave from downloaded raw data, or you can select them by yourself scripts.
- simulation/quick_select.py quickly select the name by a name list instead of grep -r -f
- simulation/challenge_rirgenerator.py the script to simulate RIRs in train and dev set
- simulation/data/dev_circle_simu_mix.config dev circle set simulation setup, include clean wave, noise wave, rir wave, snr, volume scale, start point
- simulation/data/dev_linear_simu_mix.config dev linear set simulation setup, include clean wave, noise wave, rir wave, snr, volume scale, start point
- simulation/data/dev_non_uniform_linear_simu_mix.config dev non uniform linear set simulation setup, include clean wave, noise wave, rir wave, snr, volume scale, start point
- simulation/data/train_simu_circle.config train circle set simulation setup, include clean wave, noise wave, rir wave, snr, volume scale, start point; please download it from dropbox.
- simulation/data/train_simu_linear.config train linear set simulation setup, include clean wave, noise wave, rir wave, snr, volume scale, start point; please download it from dropbox.
- simulation/data/train_simu_non_uniform.config train non uniform linear set simulation setup, include clean wave, noise wave, rir wave, snr, volume scale, start point; please download it from dropbox.
requirements.txt, dependency

Notes:

1. \*.config file should be replaced with correct path of audio files.
2. Training config files have been released together with challenge data.

Requirements

python3.6 or above

pip install -r requirements.txt

if you simulation RIRs by yourself with our scripts, you may better install this:

pyrirgen

Code license

Apache 2.0

Conferencing Speech Challenge

Related tags

Overview

ConferencingSpeech 2021 challenge

Details

Requirements

Code license

Owner

DaisyXmusic ❤ A bot that can play music on Telegram Group and Channel Voice Chats

Sync Toolbox - Python package with reference implementations for efficient, robust, and accurate music synchronization based on dynamic time warping (DTW)

Telegram Bot to play music in VoiceChat with Channel Support and autostarts Radio.

A python program to cut longer MP3 files (i.e. recordings of several songs) into the individual tracks.

Terminal-based music player written in Python for the best music in the world 🎵 🎧 💻

GNOME powered sound conversion

A collection of free MIDI chords and progressions ready to be used in your DAW, Akai MPC, or Roland MC-707/101

Minimal command-line music player written in Python

𝙰 𝙼𝚞𝚜𝚒𝚌 𝙱𝚘𝚝 𝙲𝚛𝚎𝚊𝚝𝚎𝚍 𝙱𝚢 𝚃𝚎𝚊𝚖𝙳𝚕𝚝 💖

Free and Open Source Channel/Group Voice chat music player for telegram with button support saavn playback support.

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Delta TTA(Text To Audio) SoftWare

Noinoi music is smoothly playing music on voice chat of telegram.

Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.

Speech recognition module for Python, supporting several engines and APIs, online and offline.

A telegram bot for which is help to play songs in vc 🥰 give 🌟 and fork this repo before use 😏

This bot can stream audio or video files and urls in telegram voice chats

A python library for working with praat, textgrids, time aligned audio transcripts, and audio files.

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

A simple music player, powered by Python, utilising various libraries such as Tkinter and Pygame