qxtv

Results 4 comments of qxtv

I'll recommend using demuc to isolate the vocals first and pass it to whisper.

> it just used $15 for a very small amount of text to index on openai API. Is there any solution to reduce cost for this ? I was just...

@TedTimbrell any follow up on this?

> longer padding means a longer "speech_pad_ms"? By default, the speech_pad_ms is 400, should be changed to 800 or even higher? Yes. I went with 1000ms and 0.4 threshold iirc....