WhisperTimeSync icon indicating copy to clipboard operation
WhisperTimeSync copied to clipboard

[Feature request] Adaptation to the Whisper's JSON output

Open sensboston opened this issue 1 year ago • 8 comments

Hello, is it possible to adapt your project to the Whisper's JSON output? I'm working on karaoke program for Windows, and need all words in the lyrics to be timestamped. I'll be glad to issue a PR for this feature but unfortunately I'm not proficient in Python programming (mostly use a C# & C++).

sensboston avatar Feb 22 '24 20:02 sensboston

@sensboston

I will have a look at it as soon as possible. To be sure of what you expect, can you provide me with an example (JSON+TXT)?

WhisperTimeSync is not written in Python but in Java. ;-)

EtienneAb3d avatar Feb 23 '24 07:02 EtienneAb3d

Here we go: samples.zip There are two directories: English (Smokie, "Living Next Door to Alice" and Russian (Bit-quartet Secret, "Alice") with JSONs and original lyrics (my daughter's name is Alice 😉 ). Whisper's English output is kinda affordable but Russian is a complete mess.

sensboston avatar Feb 23 '24 16:02 sensboston

@sensboston Hmmm... The problem I see with this JSON format is that each word has a mandatory description including its timestamp. It will be very hard to decide what to do with not-matching words. 🤔

EtienneAb3d avatar Feb 23 '24 19:02 EtienneAb3d

Yeah, it's an issue, agree. But I haven't looked to your (or Java code you've ported) implementation (yet). Theoretically it's possible, even without involving AI - for example by using "soundex" algorithm. I thought about this but initially wanna check if someone already done this.

sensboston avatar Feb 23 '24 19:02 sensboston

@sensboston The problem is not to match word by word, this is what WhisperTimeSync is doing. The problem is to know what to do with unmatching words in this specific JSON description. I may adapt an algo I already have for similar cases, but this is quite a work. Do you have a budget for this?

EtienneAb3d avatar Feb 23 '24 20:02 EtienneAb3d

No budget at all, I do development just for fun, will publish open source here when it done.

P.S. If you want, I'll add you to this private repo (but you need a Windows PC to test at least).

sensboston avatar Feb 23 '24 20:02 sensboston

Any progress? Or you have no idea how to implement this? Please le me know - I don't wanna to waste a time.

sensboston avatar Feb 29 '24 05:02 sensboston

@sensboston I understood you were working on the subject on your side. On my side, without a budget, I have to find/allocate on my free time.

EtienneAb3d avatar Mar 05 '24 07:03 EtienneAb3d