当前位置:网站首页>Speech recognition (ASR) paper selection: talcs: an open source Mandarin English code switching corps and a speech
Speech recognition (ASR) paper selection: talcs: an open source Mandarin English code switching corps and a speech
2022-07-06 19:49:00 【My name is Yongqiang】
Statement : Usually read some articles, take some notes and share them , There are inevitably mistakes in the article , I hope you will have a better understanding of Haihan . Collect some information , It's easy to check and learn :http://yqli.tech/page/speech.html. For a list of papers in the field of speech synthesis, please visit http://yqli.tech/page/tts_paper.html, For the statistics of papers in the field of speech recognition, please visit http://yqli.tech/page/asr_paper.html. How to find voice information, please refer to the article https://mp.weixin.qq.com/s/eJcpsfs3OuhrccJ7_BvKOg). If reproduced , Please indicate the source . Welcome to WeChat official account. : Keep a low profile .
TALCS: An Open-Source Mandarin-English Code-Switching Corpus and a Speech Recognition Baseline
This article is tal in 2022.06.27 Updated articles , Mainly open source the largest Chinese English mixed training corpus , For speech recognition Code-switching Contribute to research .
( Open source data statistics can be found in http://yqli.tech/page/data.html)
Because the main work of this paper is to open source the world's largest Chinese English mixed data , We will not introduce the background , View the data set directly . This data set is the audio of Tal English class , Including mixed Chinese and English speech , There is only one speaker per audio , The dataset has 100 More speakers .( file 63.36G) The data includes the following figure 1 Examples of intrasentence and inter sentence mixing shown . The ratio between Chinese characters and English words in this data is 13:1, among top 20 Pictured 2 Shown .table 1 It shows the division of the training set and test set of the corpus ,table 2 Show how to use this data set in espnet and wenet The results of the experiment on .
| Data scale | 587 Hour audio |
| Sampling rate | 16KHz |
| Sampling bit sound | 16bit |
| Recording devices | Ordinary microphone |
| The speaker | 200+ |
| Recording time | 2019 year |
| data format | Audio :.wav; Mark the results :.txt |
| Audio length | 1~60s |
| data type | English teacher's audio |



边栏推荐
- 广州首个数据安全峰会将在白云区开幕
- Social recruitment interview experience, 2022 latest Android high-frequency selected interview questions sharing
- 350. Intersection of two arrays II
- 转让malloc()该功能后,发生了什么事内核?附malloc()和free()实现源
- 121. The best time to buy and sell stocks
- 【翻译】供应链安全项目in-toto移至CNCF孵化器
- Pay attention to the partners on the recruitment website of fishing! The monitoring system may have set you as "high risk of leaving"
- [infrastructure] deployment and configuration of Flink / Flink CDC (MySQL / es)
- 学习探索-使用伪元素清除浮动元素造成的高度坍塌
- 【云小课】EI第47课 MRS离线数据分析-通过Flink作业处理OBS数据
猜你喜欢

It's enough to read this article to analyze the principle in depth

信息系统项目管理师---第八章 项目质量管理

Learn to explore - use pseudo elements to clear the high collapse caused by floating elements
![[calculating emotion and thought] floor sweeper, typist, information panic and Oppenheimer](/img/8c/afb90128e7a523bbee4c6c4166363f.png)
[calculating emotion and thought] floor sweeper, typist, information panic and Oppenheimer
腾讯字节等大厂面试真题汇总,网易架构师深入讲解Android开发

Chic Lang: attributeerror: partially initialized module 'CV2' has no attribute 'GAPI_ wip_ gst_ GStreamerPipe

Analysis of rainwater connection

In simple terms, interview surprise Edition

Low CPU load and high loadavg processing method

《数字经济全景白皮书》保险数字化篇 重磅发布
随机推荐
In simple terms, interview surprise Edition
力扣101题:对称二叉树
Spark foundation -scala
转让malloc()该功能后,发生了什么事内核?附malloc()和free()实现源
POJ1149 PIGS 【最大流量】
MySQL information schema learning (II) -- InnoDB table
面试突击63:MySQL 中如何去重?
Method keywords deprecated, externalprocname, final, forcegenerate
Pay attention to the partners on the recruitment website of fishing! The monitoring system may have set you as "high risk of leaving"
腾讯字节等大厂面试真题汇总,网易架构师深入讲解Android开发
A5000 vGPU显示模式切换
Logstash expressway entrance
PowerPivot——DAX(初识)
[玩转Linux] [Docker] MySQL安装和配置
Vmware虚拟机无法打开内核设备“\\.\Global\vmx86“的解决方法
【计算情与思】扫地僧、打字员、信息恐慌与奥本海默
数据的同步为每个站点创建触发器同步表
Leetcode brush first_ Maximum Subarray
Alibaba数据源Druid可视化监控配置
HDU 1026 Ignatius and the Princess I 迷宫范围内的搜索剪枝问题