当前位置:网站首页>About vctk datasets
About vctk datasets
2022-07-01 01:19:00 【Wsyoneself】
- download vctk Data sets ( Download path :https://datashare.ed.ac.uk/download/DS_10283_3443.zip)
- vctk Data set understanding :
- CSTR VCTK The corpus includes 110 Voice data from English speakers using different accents . Each speaker reads about 400 A sentence , These sentences are from a newspaper 、rainbow The article and an inspirational paragraph for voice stress files .
- The text is selected according to the greedy algorithm , Greedy algorithm can increase context and voice coverage .
- All voice data are recorded using the same recording settings : An omnidirectional microphone (DPA 4035) And a small diaphragm condenser microphone , The bandwidth is very wide (Sennheiser MKH 800), Sampling frequency is 96kHz,24 position , Located in the semi anechoic room of the University of Edinburgh .
- All records are converted to 16 position , Downsampling to 48 kHz
- This corpus was originally used to base on HMM Text to speech synthesis system , Especially based on speaker adaptation HMM Voice synthesis of , The synthesis uses the average speech model of multiple speakers and speaker adaptation technology . The corpus is also applicable to DNN Multi spoken human language synthesis system and waveform modeling .** The idea here and PCA The idea of extracting face features and averaging faces to synthesize a given face is similar **
- VCTK There are several variants of corpus :
- Voice enhancement : For training speech enhancement algorithms and TTS Model noise speech database , Audio is artificially directed to VCTK Various types of noise are added :http://dx.doi.org/10.7488/ds/2117
- Reverberation voice database , Used to train speech de reverberation algorithm and TTS Model ,VCTK Various types of reverberation have been artificially added in http://dx.doi.org/10.7488/ds/1425
- For training speech enhancement algorithms and TTS Model noise reverberation speech database http://dx.doi.org/10.7488/ds/2139
- Equipment records VCTK, among VCTK The speech signal of the corpus is played back , And use relatively cheap consumer equipment to re record in the office environment http://dx.doi.org/10.7488/ds/2316
- Microsoft Scalable noisy speech data set (MS-SNSD)https://github.com/microsoft/MS-SNSD
- ASV And anti deception :
- Deception and anti deception (SAS) corpus , It is a collection of synthetic speech signals produced by nine technologies , Two of them are speech synthesis , Seven are voice conversion . All of these are using VCTK Corpus construction .http://dx.doi.org/10.7488/ds/252
- Automated speaker verification deception and countermeasure challenges (ASVspoof 2015) database . The database is composed of synthetic speech signals generated by ten technologies , It has been used in the first automatic speaker verification deception and challenge confrontation (ASVspoof 2015)http://dx.doi.org/10.7488/ds/298
- ASVspoof 2019: The third automatic speaker verification deception and countermeasure challenge database . The database has been used for the third automatic speaker verification deception and countermeasure challenge (ASVspoof 2019)https://doi.org/10.7488/ds/2555
- To use the corpus, you need to add references :
Christophe Veaux, Junichi Yamagishi, Kirsten MacDonald, "CSTR VCTK Corpus: English Multi-speaker Corpus for CSTR Voice Cloning Toolkit", The Centre for Speech Technology Research (CSTR), University of Edinbur
边栏推荐
猜你喜欢

孔乙己第一问之服务通信知多少?

Get to know the drawing component of flutter - custompaint

The longest selling mobile phone in China has been selling well since its launch, crushing iphone12

Win11安装redis 数据库以及redis desktop manager的下载

Impact relay zc-23/dc220v

Locking relay ydb-100, 100V

Dx-11q signal relay

dc_labs--lab1的学习与总结

5. TPM module initialization

A letter to 5000 fans!
随机推荐
JS to convert numbers into Chinese characters for output
酒旅板块复苏,亚朵继续上市梦,距离“新住宿经济第一股“还有多远?
人穷志不短,穷学生也能玩转树莓派
闭锁继电器YDB-100、100V
[learning notes] double + two points
关于VCTK数据集
For the first time in more than 20 years! CVPR best student thesis awarded to Chinese college students!
Install redis database and download redis Desktop Manager in win11
Sword finger offer 19 Regular Expression Matching
集群与LVS介绍及原理解析
Shift operators
Training discipline principle of robot programming
The real topic of the 11th provincial competition of Bluebridge cup 2020 - crop hybridization
Mustache syntax
What is the difference between Pipeline and Release Pipeline in azure devops?
友盟(软件异常实时监听的好帮手:Crash)接入教程(有点基础的小白最易学的教程)
Problem solving: how to manage thread_local pointer variables
ASCII、Unicode、GBK、UTF-8之间的关系
Principes de formation de la programmation robotique
探索互联网时代STEAM教育创新之路