whisperer
whisperer copied to clipboard
generate granular word-level captions in srt format
Whisperer
This really messy script combines whisper from OpenAI and a pytorch tutorial on forced alignment to generate word-level .srt captions
Usage
Install dependencies, put a video.wav
file with a 16000 bitrate in the folder, run the script and pray it works.
Example
https://twitter.com/thejohnfish/status/1574204788408926208?s=20&t=IeCezzbsOso508wPYH9ZXg