当前位置:网站首页>Speech recognition (ASR) paper selection: talcs: an open source Mandarin English code switching corps and a speech
Speech recognition (ASR) paper selection: talcs: an open source Mandarin English code switching corps and a speech
2022-07-06 19:49:00 【My name is Yongqiang】
Statement : Usually read some articles, take some notes and share them , There are inevitably mistakes in the article , I hope you will have a better understanding of Haihan . Collect some information , It's easy to check and learn :http://yqli.tech/page/speech.html. For a list of papers in the field of speech synthesis, please visit http://yqli.tech/page/tts_paper.html, For the statistics of papers in the field of speech recognition, please visit http://yqli.tech/page/asr_paper.html. How to find voice information, please refer to the article https://mp.weixin.qq.com/s/eJcpsfs3OuhrccJ7_BvKOg). If reproduced , Please indicate the source . Welcome to WeChat official account. : Keep a low profile .
TALCS: An Open-Source Mandarin-English Code-Switching Corpus and a Speech Recognition Baseline
This article is tal in 2022.06.27 Updated articles , Mainly open source the largest Chinese English mixed training corpus , For speech recognition Code-switching Contribute to research .
( Open source data statistics can be found in http://yqli.tech/page/data.html)
Because the main work of this paper is to open source the world's largest Chinese English mixed data , We will not introduce the background , View the data set directly . This data set is the audio of Tal English class , Including mixed Chinese and English speech , There is only one speaker per audio , The dataset has 100 More speakers .( file 63.36G) The data includes the following figure 1 Examples of intrasentence and inter sentence mixing shown . The ratio between Chinese characters and English words in this data is 13:1, among top 20 Pictured 2 Shown .table 1 It shows the division of the training set and test set of the corpus ,table 2 Show how to use this data set in espnet and wenet The results of the experiment on .
Data scale | 587 Hour audio |
Sampling rate | 16KHz |
Sampling bit sound | 16bit |
Recording devices | Ordinary microphone |
The speaker | 200+ |
Recording time | 2019 year |
data format | Audio :.wav; Mark the results :.txt |
Audio length | 1~60s |
data type | English teacher's audio |
边栏推荐
- 算法面试经典100题,Android程序员最新职业规划
- 【翻译】云原生观察能力微调查。普罗米修斯引领潮流,但要了解系统的健康状况仍有障碍...
- VMware virtual machine cannot open the kernel device "\.\global\vmx86"
- Information System Project Manager - Chapter VIII project quality management
- 【翻译】供应链安全项目in-toto移至CNCF孵化器
- Logstash expressway entrance
- Vscode debug run fluent message: there is no extension for debugging yaml. Should we find yaml extensions in the market?
- 121. 买卖股票的最佳时机
- Mysql Information Schema 学习(一)--通用表
- [translation] linkerd's adoption rate in Europe and North America exceeded istio, with an increase of 118% in 2021.
猜你喜欢
MySQL information schema learning (II) -- InnoDB table
Blue Bridge Cup microbial proliferation C language
Leetcode 30. Concatenate substrings of all words
学习打卡web
redisson bug分析
学习探索-无缝轮播图
蓝桥杯 微生物增殖 C语言
理解 YOLOV1 第二篇 预测阶段 非极大值抑制(NMS)
系统性详解Redis操作Hash类型数据(带源码分析及测试结果)
How to access localhost:8000 by mobile phone
随机推荐
Appx代码签名指南
LeetCode_双指针_中等_61. 旋转链表
利用 clip-path 绘制不规则的图形
学习探索-使用伪元素清除浮动元素造成的高度坍塌
Li Kou 101: symmetric binary tree
Lick the dog until the last one has nothing (simple DP)
Cesium 点击绘制圆形(动态绘制圆形)
Configuration and simple usage of the EXE backdoor generation tool quasar
深入浅出,面试突击版
爬虫(14) - Scrapy-Redis分布式爬虫(1) | 详解
Reflection and illegalaccessexception exception during application
The "white paper on the panorama of the digital economy" has been released with great emphasis on the digitalization of insurance
Standardized QCI characteristics
logstash高速入口
腾讯T3手把手教你,真的太香了
Alibaba data source Druid visual monitoring configuration
LeetCode_ Double pointer_ Medium_ 61. rotating linked list
(3) Web security | penetration testing | basic knowledge of network security construction, IIS website construction, EXE backdoor generation tool quasar, basic use of
mod_wsgi + pymssql通路SQL Server座
腾讯字节等大厂面试真题汇总,网易架构师深入讲解Android开发