当前位置:网站首页>语音识别-基础(一):简介【语音转文本】
语音识别-基础(一):简介【语音转文本】
2022-06-30 10:24:00 【u013250861】

一、什么是语音识别
语音识别,通常称为自动语音识别(AutomaticSpeechRecognition,ASR),主要是将人类语音中的词汇内容转换为计算机可读的输入,一般都是可以理解的文本内容,也有可能是二进制编码或者字符序列。但是,我们一般理解的语音识别其实都是狭义的语音转文字的过程,简称 语音转文本识别(Speech-To-Text,STT)更合适,这样就能与 语音合成(Text-To-Speech,TTS)对应起来。

参考资料:
语音识别(一):简介
边栏推荐
猜你喜欢

Cp2112 teaching example of using USB to IIC communication

Anhui "requirements for design depth of Hefei fabricated building construction drawing review" was printed and distributed; Hebei Hengshui city adjusts the pre-sale license standard for prefabricated

Deep dive kotlin synergy (16): Channel

LVGL 8.2 Image

DQN笔记

Every time I look at my colleagues' interface documents, I get confused and have a lot of problems...

电化学氧气传感器寿命、工作原理及应用介绍

Rejuvenated Dell and apple hit each other, and the two old PC enterprises declined rapidly

The two e-commerce bigwigs' lacy news screens represent the return of e-commerce to normal, which will be beneficial to the real economy

Use keil5 software to simulate and debug gd32f305 from 0
随机推荐
IDEA 又出新神器,一套代码适应多端!
Q-Learning笔记
Skill combing [email protected] somatosensory manipulator
同事的接口文档我每次看着就头大,毛病多多。。。
LVGL 8.2 Drop down in four directions
59 websites programmers need to know
05_ Node JS file management module FS
iptables目标TPROXY
[STL source code analysis] iterator
程序员需知的 59 个网站
Android 开发面试真题进阶版(附答案解析)
小程序中读取腾讯文档的表格数据
Mysql database foundation: views and variables
Gd32 RT thread flash driver function
File sharing server
MySQL导出sql脚本文件
Skill sorting [email protected]+adxl345+ Motor vibration + serial port output
Gd32 RT thread PWM drive function
LVGL 8.2 re-coloring
Qt之实现QQ天气预报窗体翻转效果