当前位置:网站首页>Voiceprint Technology (III): voiceprint recognition technology
Voiceprint Technology (III): voiceprint recognition technology
2022-06-25 09:05:00 【u013250861】
3.1 Voiceprint recognition : The core of voiceprint technology
3.1.1 Name and concept
A broad sense , Voiceprint technology is a broad concept , It contains many different technologies and applications . Of all these technologies , Voiceprint recognition technology is the basis of other technologies . No matter it's the 5 The voiceprint segmentation and clustering technology will be introduced in chapter , Or no 6 This chapter will introduce the speech synthesis based on voiceprint 、 Voice separation and voice activity detection , Can not be separated from the cooperation with the voiceprint recognition model , The voiceprint recognition model can be pre trained (pre-trained), It can also be joint training (joint training) Got . therefore , This chapter is also the most important part of this book 、 The core chapter .
Voiceprint recognition , Also known as speaker recognition , It corresponds to several kinds of expressions in English , for example voice recognition、speaker recognition、voiceprint recognition、talker recognition And so on are the same concept , That is to say, the pronunciation of different speakers , The technique of distinguishing according to the identity of the speaker .
Here we should pay attention to the combination of voiceprint recognition and speech recognition (speech recognition) Technology . Speech recognition is a technology that recognizes speech signal as text content , In most cases, I don't care who the speaker is , And need to be robust to different speakers' voices [1]. And voiceprint recognition technology —— Especially text independent voiceprint recognition technology —— On the contrary , It is necessary to identify the speaker's identity in different text content . From this we can see that , In a sense, speech recognition and voiceprint recognition are regarded as two mutual “ orthogonal ” The problem of : Speech recognition hopes to filter out the information related to the identity of the speaker from the signal , Keep only text information ; Voiceprint recognition hopes to filter out the information related to the text from the signal , Only the identity information of the speaker is retained .
1.1 As mentioned in section , Everyone's hair
边栏推荐
- 声纹技术(七):声纹技术的未来
- Level 6 easy to mix words
- 十大券商开户风险大吗,安全靠谱吗?
- RMB 3000 | record "tbtools" video, make a friend and get a cash prize!
- The city chain technology platform is realizing the real value Internet reconstruction!
- How to solve the 10061 error of MySQL in Linux
- The meshgrid() function in numpy
- 某次比赛wp
- 声纹技术(一):声纹技术的前世今生
- 对常用I/O模型进行比较说明
猜你喜欢

compiling stm32f4xx_ it. c... “.\Objects\BH-F407.axf“ - 42 Error(s), 1 Warning(s).

C#程序终止问题CLR20R3解决方法

Compile time annotations for custom annotations (retentionpolicy.class)

matplotlib matplotlib中plt.grid()

Oracle one line function Encyclopedia

C language: bubble sort

jmeter中csv参数化

Nodejs using the express framework demo

Jmeter中的断言使用讲解

Matplotlib plt Axis() usage
随机推荐
高速缓冲存储器Cache的映射方式
声纹技术(七):声纹技术的未来
CSV parameterization in JMeter
三、自动终止训练
使用Navicat对比多环境数据库数据差异和结构差异,以及自动DML和DDL脚本
wav文件(波形文件)格式分析与详解
Matplotlib axvline() and axhline() functions in Matplotlib
Unity发布webGL的时候JsonConvert.SerializeObject()转换失败
Notes on key words in the original English work biography of jobs (IV) [chapter two]
声纹技术(三):声纹识别技术
When unity released webgl, jsonconvert Serializeobject() conversion failed
Object. Can defineproperty also listen for array changes?
(translation) the use of letter spacing to improve the readability of all capital text
Analysis of a video website m3u8 non perceptual encryption
socket编程——poll模型
Close a thread
Is it safe to open an account at Huatai Securities?
Chinese solution cannot be entered after webgl is published
matplotlib matplotlib中plt.axis()用法
C#启动程序传递参数丢失双引号,如何解决?