当前位置:网站首页>Clip +json parsing converts the sound in the video into text
Clip +json parsing converts the sound in the video into text
2022-07-06 22:26:00 【fangye945a】
Preface
Sometimes we want to convert the audio in a video into text , The simplest and dumbest way is to listen through human ears , Then type it out . But this method is undoubtedly very laborious . Smart friends must know that we can do this with the help of speech recognition tools , But the more realistic point is , Many identification software is not charged , That is, the recognition accuracy is relatively low . So what can we do to convert the audio in the video into text without spending money and accurately ?
resolvent
Friends with editing experience must know , Cut and reflect The subtitle recognition function of this software , Behind it is ByteDance speech recognition technology , The recognition accuracy of Putonghua is extremely high . But one thing is , The subtitles he recognizes are segmented , Copy them one by one to form a complete text , Its workload is also a lot .
that , Here is a simple method :
Specific operation steps
- 1. download windows Version clipping
- 2. Import video
- 3. Identify subtitles ( After importing the video , Click on the text -> Smart subtitles -> Just start to recognize )

- 4. Wait a while ( The length of time depends on the size of the video file ), Subtitles will be automatically generated .
- 5. After generating subtitles , Close clip , Will be automatically saved as a draft . At this time, open the clip , The box of draft parameters appears in the upper right corner , In the box, we can find the path to save the draft .

- 6. stay windows Open the save location in Explorer , You can see the following documents .

- 7. It is not difficult to find out according to the English name ,draft_content.json Is our draft content , The subtitle text just generated is also saved in this json In file .
- 8. Since it is json file , The next step is naturally json Parsed , We can extract the subtitle content from this file .
- 9. analysis json file , Find the fields to parse , simplified json The structure is as follows :
{"materials":{"texts":[{"content":"<font id=\"\" path=\"D:/Program Files/JianyingPro/3.0.5.8542/Resources/Font/SystemFont/zh-hans.ttf\"><color=(1.000000, 1.000000, 1.000000, 1.000000)><size=5.000000> Hello, students </size></color></font>"}]}}
And we need subtitles , By three xml Element label wrap .
- 10. Let's go through json Parse get content The contents of the field , And then eliminate by string processing xml label , Get the final text subtitle content .
Here is a simple one I wrote python Script , It can be easily and quickly processed to clip this one with subtitles json file :
import json
if __name__ == '__main__':
with open("draft_content.json", "r", encoding='utf-8') as f:
json_str = f.read()
json_content = json.loads(json_str)
texts_data = json_content["materials"]["texts"]
for text_data in texts_data:
content = text_data["content"]
content = content[:-22]
result = content.split("><size=")[1].split(">")[1]
print(result)
- 11. Directly connect the script to json The files are placed in the same directory , perform python The script can get the subtitle content you need .

We can also redirect symbols , Directly generate the output content into a file
python3 parse_jianyin.py >> output.txt

边栏推荐
- Insert sort and Hill sort
- Applet system update prompt, and force the applet to restart and use the new version
- How does the uni admin basic framework close the creation of super administrator entries?
- 硬件开发笔记(十): 硬件开发基本流程,制作一个USB转RS232的模块(九):创建CH340G/MAX232封装库sop-16并关联原理图元器件
- 3DMax指定面贴图
- 枚举与#define 宏的区别
- MySQL----初识MySQL
- Daily question 1: force deduction: 225: realize stack with queue
- labelimg的安装与使用
- The SQL response is slow. What are your troubleshooting ideas?
猜你喜欢

Crawler obtains real estate data

Barcodex (ActiveX print control) v5.3.0.80 free version

硬件開發筆記(十): 硬件開發基本流程,制作一個USB轉RS232的模塊(九):創建CH340G/MAX232封裝庫sop-16並關聯原理圖元器件

2021 geometry deep learning master Michael Bronstein long article analysis

AI enterprise multi cloud storage architecture practice | Shenzhen potential technology sharing

在IPv6中 链路本地地址的优势

2020 Bioinformatics | GraphDTA: predicting drug target binding affinity with graph neural networks

Web APIs DOM 时间对象

二叉(搜索)树的最近公共祖先 ●●

基于 QEMUv8 搭建 OP-TEE 开发环境
随机推荐
NetXpert XG2帮您解决“布线安装与维护”难题
图像的spatial domain 和 frequency domain 图像压缩
墨西哥一架飞往美国的客机起飞后遭雷击 随后安全返航
Shortcut keys in the terminal
Inno Setup 打包及签名指南
小程序系统更新提示,并强制小程序重启并使用新版本
labelimg的安装与使用
Spatial domain and frequency domain image compression of images
Powerful domestic API management tool
SQL Server生成自增序号
Aardio - 封装库时批量处理属性与回调函数的方法
ZABBIX proxy server and ZABBIX SNMP monitoring
QT | UDP broadcast communication, simple use case
lora同步字设置
A Mexican airliner bound for the United States was struck by lightning after taking off and then returned safely
How do I write Flask's excellent debug log message to a file in production?
Solve project cross domain problems
case 关键字后面的值有什么要求吗?
中国固态氧化物燃料电池技术进展与发展前景报告(2022版)
(十八)LCD1602实验