当前位置:网站首页>Voiceprint Technology (II): Fundamentals of audio signal processing
Voiceprint Technology (II): Fundamentals of audio signal processing
2022-06-25 09:05:00 【u013250861】
2.1 To understand voiceprint , Learn audio first
In terms of discipline classification , Voiceprint technology is a branch of speech signal processing , and Speech signal processing belongs to the category of audio signal processing .
Voice signals and sound signal , The difference between the two is :
- Voice signals It refers to the voice with social significance when human beings speak ,
- sound signal It generally refers to all the sounds that human beings can hear . For example, the sound of an instrument , Sounds made by animals , The sound of a car engine , And people snore 、 Sneeze 、 The sound of coughing , These are audio signals in a broad sense , But they are not voice signals , Therefore, it is usually not within the scope of voiceprint technology research .
Many basic concepts and knowledge in audio signal processing , It is very important for learning voiceprint technology .
Any voiceprint system , No matter how advanced the model is , How sophisticated the algorithm is , Can not do without dealing with sound . Only when the correct audio signal is connected , The meaningful feature representation is extracted from it , The latter model can play its role to the greatest extent .
So this chapter , We will specifically and systematically learn these concepts and knowledge related to sound . This chapter covers a wide range , Involving human auditory perception 、 Audio interface 、 Coding technology 、 Discrete signal processing and many other sub fields . At first glance, these sub areas , It doesn't seem to have much to do with each other . However , When we really embark on research or engineering projects in the field of voiceprint , You will find that all the knowledge in these sub domains will inevitably be used . In an enterprise or research institution , Yes
边栏推荐
- Notes on key words in the original English work biography of jobs (III) [chapter one]
- 备战2022年金九银十必问的1000道Android面试题及答案整理,彻底解决面试的烦恼
- 《乔布斯传》英文原著重点词汇笔记(六)【 chapter three 】
- ICer必须知道的35个网站
- [MySQL] understanding of transactions
- flutter 多语言的intl: ^0.17.0导不进去
- Easyplayer streaming media player plays HLS video. Technical optimization of slow starting speed
- 3大问题!Redis缓存异常及处理方案总结
- Jmeter中的断言使用讲解
- 对常用I/O模型进行比较说明
猜你喜欢

【OpenCV】—离散傅里叶变换

Numpy numpy中的meshgrid()函数
![[opencv] - Discrete Fourier transform](/img/03/10ce3d7c5d99ead944b2cae8d0cec0.png)
[opencv] - Discrete Fourier transform

wav文件(波形文件)格式分析与详解

C#程序终止问题CLR20R3解决方法

Make a skylearn high-dimensional dataset_ Circles and make_ moons

WebGL谷歌提示内存不够(RuntimeError:memory access out of bounds,火狐提示索引超出界限(RuntimeError:index out of bounds)

城鏈科技平臺,正在實現真正意義上的價值互聯網重構!

(translation) the use of letter spacing to improve the readability of all capital text

Jmeter中的断言使用讲解
随机推荐
Le labyrinthe des huit diagrammes de la bataille de cazy Chang'an
CSV parameterization in JMeter
Object. Can defineproperty also listen for array changes?
[opencv] - Discrete Fourier transform
《乔布斯传》英文原著重点词汇笔记(三)【 chapter one】
Webgl Google prompt memory out of bounds (runtimeerror:memory access out of bounds, Firefox prompt index out of bounds)
Oracle one line function Encyclopedia
Are the top ten securities companies at great risk of opening accounts and safe and reliable?
声纹技术(一):声纹技术的前世今生
使用Navicat对比多环境数据库数据差异和结构差异,以及自动DML和DDL脚本
从别人库里拷贝的游戏如何再自己的库里显示
华泰证券在上面开股票账户安全吗?
Matplotlib plt grid()
Analysis of a video website m3u8 non perceptual encryption
Notes on key words in the original English work biography of jobs (II) [chapter one]
打新债安全性有多高啊
IC研发常用英文术语缩写
Unknown table 'column of MySQL_ STATISTICS‘ in information_ schema (1109)
QSS buttons of different styles
Specific usage of sklearn polynomialfeatures