All the solutions I’m seeing are some third party service where I would have to upload my videos to them to get them transcribed.
All the solutions I’m seeing are some third party service where I would have to upload my videos to them to get them transcribed.
Here’s one I’ve been playing with https://github.com/jhj0517/Whisper-WebUI
The small model of fast Whisper has been amazing for the 3 options it gives (files, YT, or recording), tho I have in mind the limitations and I’ve only used it with somewhat clear audio.
I second this. Also, I have been experiencing with automating it, using https://hub.docker.com/r/onerahmet/openai-whisper-asr-webservice as a provider for Bazarr
I used this CLI tool for it: https://github.com/m1guelpf/auto-subtitle
Whisper-based, worked great for subtitling S2 of ReGenesis as for some reason the DVDs lacked subtitles. I used the largest model, I think. It even managed to correctly identify which character was crying, which I thought was impressive.
That looks pretty much what I’m looking for, thank you! I see instructions to run in Windows, but does it also work on Macs?