当前位置:网站首页>Speech recognition (ASR) paper selection: talcs: an open source Mandarin English code switching corps and a speech
Speech recognition (ASR) paper selection: talcs: an open source Mandarin English code switching corps and a speech
2022-07-06 19:49:00 【My name is Yongqiang】
Statement : Usually read some articles, take some notes and share them , There are inevitably mistakes in the article , I hope you will have a better understanding of Haihan . Collect some information , It's easy to check and learn :http://yqli.tech/page/speech.html. For a list of papers in the field of speech synthesis, please visit http://yqli.tech/page/tts_paper.html, For the statistics of papers in the field of speech recognition, please visit http://yqli.tech/page/asr_paper.html. How to find voice information, please refer to the article https://mp.weixin.qq.com/s/eJcpsfs3OuhrccJ7_BvKOg). If reproduced , Please indicate the source . Welcome to WeChat official account. : Keep a low profile .
TALCS: An Open-Source Mandarin-English Code-Switching Corpus and a Speech Recognition Baseline
This article is tal in 2022.06.27 Updated articles , Mainly open source the largest Chinese English mixed training corpus , For speech recognition Code-switching Contribute to research .
( Open source data statistics can be found in http://yqli.tech/page/data.html)
Because the main work of this paper is to open source the world's largest Chinese English mixed data , We will not introduce the background , View the data set directly . This data set is the audio of Tal English class , Including mixed Chinese and English speech , There is only one speaker per audio , The dataset has 100 More speakers .( file 63.36G) The data includes the following figure 1 Examples of intrasentence and inter sentence mixing shown . The ratio between Chinese characters and English words in this data is 13:1, among top 20 Pictured 2 Shown .table 1 It shows the division of the training set and test set of the corpus ,table 2 Show how to use this data set in espnet and wenet The results of the experiment on .
Data scale | 587 Hour audio |
Sampling rate | 16KHz |
Sampling bit sound | 16bit |
Recording devices | Ordinary microphone |
The speaker | 200+ |
Recording time | 2019 year |
data format | Audio :.wav; Mark the results :.txt |
Audio length | 1~60s |
data type | English teacher's audio |
边栏推荐
- Leetcode 30. 串联所有单词的子串
- 测试用里hi
- CF960G - Bandit Blues(第一类斯特林数+OGF)
- redisson bug分析
- Interpretation of Dagan paper
- It's enough to read this article to analyze the principle in depth
- About image reading and processing, etc
- 颜色(color)转换为三刺激值(r/g/b)(干股)
- Interview assault 63: how to remove duplication in MySQL?
- 【翻译】Linkerd在欧洲和北美的采用率超过了Istio,2021年增长118%。
猜你喜欢
Example of shutter text component
Teach you to learn JS prototype and prototype chain hand in hand, a tutorial that monkeys can understand
Leetcode 30. Concatenate substrings of all words
腾讯T3大牛手把手教你,大厂内部资料
JDBC details
社招面试心得,2022最新Android高频精选面试题分享
学习探索-使用伪元素清除浮动元素造成的高度坍塌
深入浅出,面试突击版
ZABBIX proxy server and ZABBIX SNMP monitoring
深度剖析原理,看完这一篇就够了
随机推荐
Mind map + source code + Notes + project, ByteDance + JD +360+ Netease interview question sorting
Leetcode 30. 串联所有单词的子串
Cesium 两点之间的直线距离
short i =1; I=i+1 and short i=1; Difference of i+=1
AddressSanitizer 技术初体验
A5000 vGPU显示模式切换
Alibaba数据源Druid可视化监控配置
The "white paper on the panorama of the digital economy" has been released with great emphasis on the digitalization of insurance
Microservice architecture debate between radical technologists vs Project conservatives
MySQL information schema learning (I) -- general table
广州首个数据安全峰会将在白云区开幕
凤凰架构2——访问远程服务
测试用里hi
【计算情与思】扫地僧、打字员、信息恐慌与奥本海默
技术分享 | 抓包分析 TCP 协议
Understand yolov1 Part II non maximum suppression (NMS) in prediction stage
Tensorflow2.0 self defined training method to solve function coefficients
MySQL information schema learning (II) -- InnoDB table
Li Kou 101: symmetric binary tree
算法面试经典100题,Android程序员最新职业规划