当前位置:网站首页>语音识别(ASR)论文优选:全球最大的中英混合开源数据TALCS: An Open-Source Mandarin-English Code-Switching Corpus and a Speech
语音识别(ASR)论文优选:全球最大的中英混合开源数据TALCS: An Open-Source Mandarin-English Code-Switching Corpus and a Speech
2022-07-06 11:48:00 【我叫永强】
声明:平时看些文章做些笔记分享出来,文章中难免存在错误的地方,还望大家海涵。搜集一些资料,方便查阅学习:http://yqli.tech/page/speech.html。语音合成领域论文列表请访问http://yqli.tech/page/tts_paper.html,语音识别领域论文统计请访问http://yqli.tech/page/asr_paper.html。如何查找语音资料请参考文章https://mp.weixin.qq.com/s/eJcpsfs3OuhrccJ7_BvKOg)。如有转载,请注明出处。欢迎关注微信公众号:低调奋进。
TALCS: An Open-Source Mandarin-English Code-Switching Corpus and a Speech Recognition Baseline
本文是好未来在2022.06.27更新的文章,主要开源最大的中英混合训练语料,为语音识别的Code-switching方向研究做贡献。
(开源数据统计可参见http://yqli.tech/page/data.html)
由于本文主要工作是开源全球最大的中英混合数据,我们就不再介绍背景,直接查看数据集的情况。该数据集为好未来英语课授课音频,包含中英文混合讲话的情况,每条音频只有一位说话人,该数据集有100多说话人。(文件63.36G)该数据包含了如图1所示的句内和句间混合的样例。该数据中的中文汉字和英文单词之间的比例为13:1,其中top 20如图2所示。table 1展示了语库的训练集合测试集的划分情况,table 2展示使用该数据集在espnet和wenet上的实验结果。
数据规模 | 587小时音频 |
采样率 | 16KHz |
采样位声 | 16bit |
录制设备 | 普通麦克风 |
说话人 | 200+ |
录制时间 | 2019年 |
数据格式 | 音频:.wav;标注结果:.txt |
音频长度 | 1~60s |
数据类型 | 英语课教师授课音频 |
边栏推荐
- Use of deg2rad and rad2deg functions in MATLAB
- 如何自定义动漫头像?这6个免费精品在线卡通头像生成器,看一眼就怦然心动!
- 学习打卡web
- Yyds dry goods inventory leetcode question set 751 - 760
- 接雨水问题解析
- A popular explanation will help you get started
- 学习探索-无缝轮播图
- 激进技术派 vs 项目保守派的微服务架构之争
- 深入分析,Android面试真题解析火爆全网
- It's super detailed in history. It's too late for you to read this information if you want to find a job
猜你喜欢
Mysql Information Schema 学习(二)--Innodb表
Pay attention to the partners on the recruitment website of fishing! The monitoring system may have set you as "high risk of leaving"
Live broadcast today | the 2022 Hongji ecological partnership conference of "Renji collaboration has come" is ready to go
Looting iii[post sequence traversal and backtracking + dynamic planning]
10 schemes to ensure interface data security
冒烟测试怎么做
Hudi vs Delta vs Iceberg
腾讯T2大牛亲自讲解,跳槽薪资翻倍
Tencent Android interview must ask, 10 years of Android development experience
Understand yolov1 Part II non maximum suppression (NMS) in prediction stage
随机推荐
信息系统项目管理师---第八章 项目质量管理
Phoenix Architecture 2 - accessing remote services
spark基础-scala
Spark foundation -scala
测试用里hi
Pay attention to the partners on the recruitment website of fishing! The monitoring system may have set you as "high risk of leaving"
【计算情与思】扫地僧、打字员、信息恐慌与奥本海默
Interpretation of Dagan paper
How to do smoke test
The "white paper on the panorama of the digital economy" has been released with great emphasis on the digitalization of insurance
Cereals Mall - Distributed Advanced p129~p339 (end)
Mysql Information Schema 學習(一)--通用錶
Introduction to enterprise lean management system
swagger2报错Illegal DefaultValue null for parameter type integer
POJ1149 PIGS 【最大流量】
Information System Project Manager - Chapter VIII project quality management
How can my Haskell program or library find its version number- How can my Haskell program or library find its version number?
Lick the dog until the last one has nothing (simple DP)
Is not a drawable (color or path): the vector graph downloaded externally cannot be called when it is put into mipmap, and the calling error program crashes
Test Li hi