当前位置:网站首页>Speech breakpoint detection (short time improved subband spectral entropy)
Speech breakpoint detection (short time improved subband spectral entropy)
2022-06-21 22:42:00 【qq-120】
1. Audio analysis
1. Output speech segmentation time point information , The time point is expressed in milliseconds ;
2. Split the speech into multiple wav file ;
Endpoint detection : Determine the time starting point and ending point of the sentence , Ignore a small number of non voice frames in the middle ,
For speech recognition .(Speech Endpoint Detection)
Entropy is a quantity that reflects information measurement in information theory . The greater the randomness of a random event ,
That is, the higher the uncertainty , The greater the entropy , So the more information you carry .
This operation adopts Spectral entropy method End point detection for voice .
2. Spectral entropy method


3. Preprocessing

4. Double threshold method endpoint detection

5. experimental result





Handle PHONE_001.wav Information obtained
(1)time.csv: Segment information for voice ;
(2)PHONE_001_vad.wav: For voice VAD After processing , Speech segment synthetic wav;
(3)segmentation Folder : It is the speech of each segment after speech segmentation ;
(4)main_VAD.m: The main function ;
(5)vad.m: It is the endpoint detection function of double threshold method ;
(6)houzhichuli.m: Is the interval length decision function ;
(7)frame2time.m: As a function of time for a frame ;
边栏推荐
- About Eureka starting successfully but accessing 404
- Six possible challenges when practicing Devops
- FPGA之道——FPGA开发流程之项目方案与FPGA设计方案
- Verilog参数例化时自动计算位宽的函数
- 左手代码,右手开源,开源路上的一份子
- 牛客月賽-環上食蟲
- 必讀書籍
- KVM virtual machine rescue mode modifying root password -- the road to building a dream
- Second understanding microservice
- The way of FPGA -- interface level standard between digital systems
猜你喜欢

力扣刷題集結4(mysql版本)

Apache shardingsphere 5.1.2 release | new driving API + cloud native deployment to create a high-performance data gateway

更好的管理各种音乐,专业的DJ音乐管理软件Pioneer DJ rekordbox

Nacos安装指南

Games101 job 7- detailed explanation of implementation steps of multi thread speed up

左手代码,右手开源,开源路上的一份子

The way of FPGA -- project scheme and FPGA design scheme of FPGA development process

promise错误捕获处理——Promisifying技术

语音信号处理之多阶MFCC提取(matlab)

翻译软件Bob安装教程
随机推荐
C# 报错:未通过等待任务或访问任务的 Exception 属性观察到任务的异常。因此,终结器线程重新引发了未观察到的异常。
电脑屏幕分辨率怎么调?电脑屏幕修改分辨率SwitchResX
力扣刷题集结4(mysql版本)
2022-06-21:golang选择题,以下golang代码输出什么?A:3;B:4;C:100;D:编译失败。 package main import (
翻译软件Bob安装教程
小程序如何关联微信小程序二维码,并实现二码合一聚合
分布式数据库使用逻辑卷管理存储之扩容
Uwp confirms whether there is pop-up display
WPF 路由
Pi4j GPIO pin pull-up resistance, pull-down resistance concept
关于lg(n!)的渐进紧确界
WPF ListBox虚拟化
pyenv安装anaconda修改清华源
力扣:零钱兑换
Matlab2020a how to export exe using app Designer
Use the for loop to calculate n! Value of
力扣刷題集結4(mysql版本)
Electronic bidding procurement mall system: optimize traditional procurement business and speed up enterprise digital upgrading
WPF dependent properties
.bmp图片的文件头解析