当前位置:网站首页>Speech breakpoint detection (short time improved subband spectral entropy)

Speech breakpoint detection (short time improved subband spectral entropy)

2022-06-21 22:42:00 qq-120

1. Audio analysis

1. Output speech segmentation time point information , The time point is expressed in milliseconds ;
2. Split the speech into multiple wav file ;

Endpoint detection : Determine the time starting point and ending point of the sentence , Ignore a small number of non voice frames in the middle ,
For speech recognition .(Speech Endpoint Detection)

Entropy is a quantity that reflects information measurement in information theory . The greater the randomness of a random event ,
That is, the higher the uncertainty , The greater the entropy , So the more information you carry .
This operation adopts Spectral entropy method End point detection for voice .

2. Spectral entropy method

 Insert picture description here
 Insert picture description here

3. Preprocessing

 Insert picture description here

4. Double threshold method endpoint detection

 Insert picture description here

5. experimental result

 Insert picture description here
 Insert picture description here
 Insert picture description here
 Insert picture description here
 Insert picture description here

Handle PHONE_001.wav Information obtained
(1)time.csv: Segment information for voice ;
(2)PHONE_001_vad.wav: For voice VAD After processing , Speech segment synthetic wav;
(3)segmentation Folder : It is the speech of each segment after speech segmentation ;
(4)main_VAD.m: The main function ;
(5)vad.m: It is the endpoint detection function of double threshold method ;
(6)houzhichuli.m: Is the interval length decision function ;
(7)frame2time.m: As a function of time for a frame ;

原网站

版权声明
本文为[qq-120]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/172/202206212048573602.html