当前位置:网站首页>Clip +json parsing converts the sound in the video into text
Clip +json parsing converts the sound in the video into text
2022-07-06 22:26:00 【fangye945a】
Preface
Sometimes we want to convert the audio in a video into text , The simplest and dumbest way is to listen through human ears , Then type it out . But this method is undoubtedly very laborious . Smart friends must know that we can do this with the help of speech recognition tools , But the more realistic point is , Many identification software is not charged , That is, the recognition accuracy is relatively low . So what can we do to convert the audio in the video into text without spending money and accurately ?
resolvent
Friends with editing experience must know , Cut and reflect
The subtitle recognition function of this software , Behind it is ByteDance speech recognition technology , The recognition accuracy of Putonghua is extremely high . But one thing is , The subtitles he recognizes are segmented , Copy them one by one to form a complete text , Its workload is also a lot .
that , Here is a simple method :
Specific operation steps
- 1. download windows Version clipping
- 2. Import video
- 3. Identify subtitles ( After importing the video , Click on the text -> Smart subtitles -> Just start to recognize )
- 4. Wait a while ( The length of time depends on the size of the video file ), Subtitles will be automatically generated .
- 5. After generating subtitles , Close clip , Will be automatically saved as a draft . At this time, open the clip , The box of draft parameters appears in the upper right corner , In the box, we can find the path to save the draft .
- 6. stay windows Open the save location in Explorer , You can see the following documents .
- 7. It is not difficult to find out according to the English name ,draft_content.json Is our draft content , The subtitle text just generated is also saved in this json In file .
- 8. Since it is json file , The next step is naturally json Parsed , We can extract the subtitle content from this file .
- 9. analysis json file , Find the fields to parse , simplified json The structure is as follows :
{"materials":{"texts":[{"content":"<font id=\"\" path=\"D:/Program Files/JianyingPro/3.0.5.8542/Resources/Font/SystemFont/zh-hans.ttf\"><color=(1.000000, 1.000000, 1.000000, 1.000000)><size=5.000000> Hello, students </size></color></font>"}]}}
And we need subtitles , By three xml Element label wrap .
- 10. Let's go through json Parse get content The contents of the field , And then eliminate by string processing xml label , Get the final text subtitle content .
Here is a simple one I wrote python Script , It can be easily and quickly processed to clip this one with subtitles json file :
import json
if __name__ == '__main__':
with open("draft_content.json", "r", encoding='utf-8') as f:
json_str = f.read()
json_content = json.loads(json_str)
texts_data = json_content["materials"]["texts"]
for text_data in texts_data:
content = text_data["content"]
content = content[:-22]
result = content.split("><size=")[1].split(">")[1]
print(result)
- 11. Directly connect the script to json The files are placed in the same directory , perform python The script can get the subtitle content you need .
We can also redirect symbols , Directly generate the output content into a file
python3 parse_jianyin.py >> output.txt
边栏推荐
- 剪映+json解析将视频中的声音转换成文本
- 第4章:再谈类的加载器
- 图像的spatial domain 和 frequency domain 图像压缩
- Spatial domain and frequency domain image compression of images
- What are the interface tests? What are the general test points?
- 2022-07-05 使用tpcc对stonedb进行子查询测试
- BarcodeX(ActiveX打印控件) v5.3.0.80 免费版使用
- 0 basic learning C language - interrupt
- signed、unsigned关键字
- [线性代数] 1.3 n阶行列式
猜你喜欢
网络基础入门理解
RESNET rs: Google takes the lead in tuning RESNET, and its performance comprehensively surpasses efficientnet series | 2021 arXiv
将MySQL的表数据纯净方式导出
硬件开发笔记(十): 硬件开发基本流程,制作一个USB转RS232的模块(九):创建CH340G/MAX232封装库sop-16并关联原理图元器件
2020 Bioinformatics | GraphDTA: predicting drug target binding affinity with graph neural networks
[sciter]: encapsulate the notification bar component based on sciter
Attack and defense world miscall
2021 geometry deep learning master Michael Bronstein long article analysis
Seata aggregates at, TCC, Saga and XA transaction modes to create a one-stop distributed transaction solution
【数字IC手撕代码】Verilog无毛刺时钟切换电路|题目|原理|设计|仿真
随机推荐
Mise en place d'un environnement de développement OP - tee basé sur qemuv8
C # realizes crystal report binding data and printing 4-bar code
MySQL----初识MySQL
NetXpert XG2帮您解决“布线安装与维护”难题
(十八)LCD1602实验
GPS from getting started to giving up (XI), differential GPS
Shortcut keys in the terminal
2022-07-05 使用tpcc对stonedb进行子查询测试
[10:00 public class]: basis and practice of video quality evaluation
i. Mx6ull build boa server details and some of the problems encountered
The SQL response is slow. What are your troubleshooting ideas?
Report on technological progress and development prospects of solid oxide fuel cells in China (2022 Edition)
Solve project cross domain problems
Lora sync word settings
GPS from getting started to giving up (12), Doppler constant speed
BarcodeX(ActiveX打印控件) v5.3.0.80 免费版使用
Management background --2 Classification list
2020 Bioinformatics | GraphDTA: predicting drug target binding affinity with graph neural networks
Research and investment strategy report of China's VOCs catalyst industry (2022 Edition)
3DMax指定面贴图