当前位置：网站首页>Clip +json parsing converts the sound in the video into text

Clip +json parsing converts the sound in the video into text

2022-07-06 22:26:00 【fangye945a】

Preface

Sometimes we want to convert the audio in a video into text , The simplest and dumbest way is to listen through human ears , Then type it out . But this method is undoubtedly very laborious . Smart friends must know that we can do this with the help of speech recognition tools , But the more realistic point is , Many identification software is not charged , That is, the recognition accuracy is relatively low . So what can we do to convert the audio in the video into text without spending money and accurately ？

resolvent

Friends with editing experience must know , Cut and reflect The subtitle recognition function of this software , Behind it is ByteDance speech recognition technology , The recognition accuracy of Putonghua is extremely high . But one thing is , The subtitles he recognizes are segmented , Copy them one by one to form a complete text , Its workload is also a lot .
Insert picture description here
that , Here is a simple method ：

Specific operation steps

1. download windows Version clipping
2. Import video
3. Identify subtitles （ After importing the video , Click on the text -> Smart subtitles -> Just start to recognize ）
4. Wait a while ( The length of time depends on the size of the video file ), Subtitles will be automatically generated .
5. After generating subtitles , Close clip , Will be automatically saved as a draft . At this time, open the clip , The box of draft parameters appears in the upper right corner , In the box, we can find the path to save the draft .
6. stay windows Open the save location in Explorer , You can see the following documents .
7. It is not difficult to find out according to the English name ,draft_content.json Is our draft content , The subtitle text just generated is also saved in this json In file .
8. Since it is json file , The next step is naturally json Parsed , We can extract the subtitle content from this file .
9. analysis json file , Find the fields to parse , simplified json The structure is as follows :

{"materials":{"texts":[{"content":"<font id=\"\" path=\"D:/Program Files/JianyingPro/3.0.5.8542/Resources/Font/SystemFont/zh-hans.ttf\"><color=(1.000000, 1.000000, 1.000000, 1.000000)><size=5.000000> Hello, students </size></color></font>"}]}}

And we need subtitles , By three xml Element label wrap .

10. Let's go through json Parse get content The contents of the field , And then eliminate by string processing xml label , Get the final text subtitle content .
Here is a simple one I wrote python Script , It can be easily and quickly processed to clip this one with subtitles json file ：

import json

if __name__ == '__main__':
    with open("draft_content.json", "r", encoding='utf-8') as f:
        json_str = f.read()
        json_content = json.loads(json_str)
        texts_data = json_content["materials"]["texts"]
        for text_data in texts_data:
            content = text_data["content"]
            content = content[:-22]
            result = content.split("><size=")[1].split(">")[1]
            print(result)

11. Directly connect the script to json The files are placed in the same directory , perform python The script can get the subtitle content you need .