当前位置:网站首页>语音识别(ASR)论文优选:全球最大的中英混合开源数据TALCS: An Open-Source Mandarin-English Code-Switching Corpus and a Speech
语音识别(ASR)论文优选:全球最大的中英混合开源数据TALCS: An Open-Source Mandarin-English Code-Switching Corpus and a Speech
2022-07-06 11:48:00 【我叫永强】
声明:平时看些文章做些笔记分享出来,文章中难免存在错误的地方,还望大家海涵。搜集一些资料,方便查阅学习:http://yqli.tech/page/speech.html。语音合成领域论文列表请访问http://yqli.tech/page/tts_paper.html,语音识别领域论文统计请访问http://yqli.tech/page/asr_paper.html。如何查找语音资料请参考文章https://mp.weixin.qq.com/s/eJcpsfs3OuhrccJ7_BvKOg)。如有转载,请注明出处。欢迎关注微信公众号:低调奋进。
TALCS: An Open-Source Mandarin-English Code-Switching Corpus and a Speech Recognition Baseline
本文是好未来在2022.06.27更新的文章,主要开源最大的中英混合训练语料,为语音识别的Code-switching方向研究做贡献。
(开源数据统计可参见http://yqli.tech/page/data.html)
由于本文主要工作是开源全球最大的中英混合数据,我们就不再介绍背景,直接查看数据集的情况。该数据集为好未来英语课授课音频,包含中英文混合讲话的情况,每条音频只有一位说话人,该数据集有100多说话人。(文件63.36G)该数据包含了如图1所示的句内和句间混合的样例。该数据中的中文汉字和英文单词之间的比例为13:1,其中top 20如图2所示。table 1展示了语库的训练集合测试集的划分情况,table 2展示使用该数据集在espnet和wenet上的实验结果。
| 数据规模 | 587小时音频 |
| 采样率 | 16KHz |
| 采样位声 | 16bit |
| 录制设备 | 普通麦克风 |
| 说话人 | 200+ |
| 录制时间 | 2019年 |
| 数据格式 | 音频:.wav;标注结果:.txt |
| 音频长度 | 1~60s |
| 数据类型 | 英语课教师授课音频 |



边栏推荐
- 激进技术派 vs 项目保守派的微服务架构之争
- Hudi vs Delta vs Iceberg
- CF960G - Bandit Blues(第一类斯特林数+OGF)
- [translation] linkerd's adoption rate in Europe and North America exceeded istio, with an increase of 118% in 2021.
- Translation D28 (with AC code POJ 26:the nearest number)
- LeetCode_格雷编码_中等_89.格雷编码
- Teach you to learn JS prototype and prototype chain hand in hand, a tutorial that monkeys can understand
- Dark horse -- redis
- Looting iii[post sequence traversal and backtracking + dynamic planning]
- It's super detailed in history. It's too late for you to read this information if you want to find a job
猜你喜欢

Information System Project Manager - Chapter VIII project quality management

Spark foundation -scala
![[translation] micro survey of cloud native observation ability. Prometheus leads the trend, but there are still obstacles to understanding the health of the system](/img/63/3addcecb69dcb769c4736653952f66.png)
[translation] micro survey of cloud native observation ability. Prometheus leads the trend, but there are still obstacles to understanding the health of the system

学习探索-无缝轮播图
In depth analysis, Android interview real problem analysis is popular all over the network
Classic 100 questions of algorithm interview, the latest career planning of Android programmers

The "white paper on the panorama of the digital economy" has been released with great emphasis on the digitalization of insurance

Hudi vs Delta vs Iceberg

Transformer model (pytorch code explanation)

Configuration and simple usage of the EXE backdoor generation tool quasar
随机推荐
Transformer model (pytorch code explanation)
Configuration and simple usage of the EXE backdoor generation tool quasar
Swiftui game source code Encyclopedia of Snake game based on geometryreader and preference
激进技术派 vs 项目保守派的微服务架构之争
Learning and Exploration - function anti shake
Carte de réflexion + code source + notes + projet, saut d'octets + jd + 360 + tri des questions d'entrevue Netease
How to customize animation avatars? These six free online cartoon avatar generators are exciting at a glance!
Example of applying fonts to flutter
C # - realize serialization with Marshall class
面试突击63:MySQL 中如何去重?
Black Horse - - Redis Chapter
Hudi vs Delta vs Iceberg
POJ 3207 Ikki's Story IV – Panda's Trick (2-SAT)
理解 YOLOV1 第二篇 预测阶段 非极大值抑制(NMS)
The "white paper on the panorama of the digital economy" has been released with great emphasis on the digitalization of insurance
USB host driver - UVC swap
[玩转Linux] [Docker] MySQL安装和配置
IC设计流程中需要使用到的文件
Elastic search indexes are often deleted [closed] - elastic search indexes gets deleted frequently [closed]
保证接口数据安全的10种方案