当前位置:网站首页>opensmile简介和安装过程中遇到的问题记录
opensmile简介和安装过程中遇到的问题记录
2022-07-23 01:13:00 【Wsyoneself】
- 定义:一种提取音频特征的工具
- 优点:
- 可以在新数据到达时以增量方式提取特征
- 几乎所有在特征提取过程中生成的中间数据(如窗口音频数据、频谱等)都可以访问并保存到文件中,一遍进行可视化或进一步处理
- 特点:
- 支持的文件格式:音频文件wav,文本数据文件csv(逗号分隔值、电子表格格式),weka数据挖掘得到的arff格式,HTK(隐马尔科夫工具包)参数文件,二进制特征数据的简单二进制浮点矩阵格式
- 为了兼容,官方二进制文件不是使用ffmpeg支持构建的,所以压缩格式(如mp3,mp4,ogg)存储的音频需要转换为未压缩的wave格式,然后才能使用opensmile进行分析。否则需要自行构建具有ffmpeg支持的opensmile
- 应用领域:语音识别(特征提取前端,关键字发现等),情感计算领域(情感识别),音乐信息检索(和弦识别,节拍trace)
- 分类:
- 用c++编写的:安装需要clone源码并使用cmake安装,使用一些指令来使用。(由于博主使用场景是在python中调用,所以没有尝试使用这一种)
- python模块(下面说明安装步骤以及过程中遇到的问题)
- 安装:
pip install opensmile - 遇到的报错:
- ERROR: Cannot uninstall ‘PyYAML’. It is a distutils installed project and thus we cannot accurately determine which files belong to it which would lead to only a partial uninstall.
- 解决:在安装的包后面加上 --ignore-installed PyYAML,即
pip install opensmile --ignore-installed PyYAML
- 解决:在安装的包后面加上 --ignore-installed PyYAML,即
ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
wandb 0.12.18 requires GitPython>=1.0.0, which is not installed. datasets 2.2.2 requires aiohttp, which is not installed. datasets 2.2.2 requires multiprocess, which is not installed. datasets 2.2.2 requires pyarrow>=6.0.0, which is not installed. pytorch-lightning 0.9.0 requires tensorboard==2.2.0, but you have tensorboard 2.9.1 which is incompatible. ltp 4.1.5.post2 requires transformers<=4.7.0,>=4.0.0, but you have transformers 4.19.4 which is incompatible. huggingface-hub 0.7.0 requires packaging>=20.9, but you have packaging 20.4 which is incompatible. datasets 2.2.2 requires fsspec[http]>=2021.05.0, but you have fsspec 0.8.3 which is incompatible.解决:(实际就是根据错误提示将缺的依赖进行安装)
pip install wandb pip install pytorch-lightning pip install --user datasets
- ERROR: Cannot uninstall ‘PyYAML’. It is a distutils installed project and thus we cannot accurately determine which files belong to it which would lead to only a partial uninstall.
- 测试程序:
# 为预定义特征集的功能设置一个特征提取器 import opensmile smile=opensmile.Smile( feature_set=opensmile.FeatureSet.eGeMAPSv02, # 特征集 feature_level=opensmile.FeatureLevel.Functionals # 特征级别,比如低级,高级等 ) smile.feature_names输出:

边栏推荐
猜你喜欢

VS Code快捷键设置

SQL Server database design -- select statement 2

Avantages de la salle des machines bgp

模板学堂丨JumpServer安全运维审计大屏

Mathematical modeling -- graph and network models and methods (II)

C language classic exercise (1) - "daffodil number"“

SPSS Chi-Square
![[C language] file operation](/img/d3/5e5ce369dd3315089b529cf69b36c0.png)
[C language] file operation

SPSS Chi-Square

【管理篇 / 升级】* 02. 查看升级路径 * FortiGate 防火墙
随机推荐
推荐系统专题 | 推荐系统架构与单域跨域召回模型
pytorch简单示例汇总
1646. Recursive method of getting the maximum value in the generated array
Swin transformer object detection project installation tutorial
提升从改变开始...
Unity3d learning note 9 - loading textures
RNA 25. What should we do if there is only Shengxin but no experiment in SCI articles?
2302. 统计得分小于 K 的子数组数目-滑动数组-双百代码
It is not safe to open an account in tongdaxin
makefile中include的作用
C language classic exercise (1) - "daffodil number"“
【FPGA教程案例37】通信案例7——基于FPGA的FFT,IFFT傅里叶变换和逆变换
详解Vector
The peak season of cross-border e-commerce is coming, remittance of international collection 0 rate will help you win the peak season!
视频点播中相关分辨率说明
真人踩過的坑,告訴你避免自動化測試常犯的10個錯誤
Software testing interview ideas, skills and methods to share, learn is to earn
华为应用已经调用了checkAppUpdate接口,为什么应用内不提示版本更新
一文了解微服务低代码实现方式
小白股票开户安全吗?网上可以办理吗?