jhj0517

Results 153 comments of jhj0517

Hi, I've made a TODO list in the README and added it. I'll work on it later!

I'm testing whisperX and listing some issues here: - Incompatible torch version - whisperX models were trained on `torch 1.10.0+cu102` and this WebUI uses `torch 2.3.1+cu121` - Slow transcription -...

Yes, it seems that whisperX post-process diarization with the result of the faster-whisper. So I think I should modularize the diarization and integrate it with faster-whisper.

Speaker diarization is now enabled in #181. Diarization is embedded into the text with `|` divider. For example, w/ diarization: ``` 1 00:00:00,000 --> 00:00:04,879 SPEAKER_00|Now, as all books not...

@moda20 Can you show the full log before the Traceback? This could happen if the model failed to load. To use pyannote model, you need to go to the 1....

@moda20 Trying to run diarization models with CPU may help in that case. You can change the device in the dropdown. ![image](https://github.com/jhj0517/Whisper-WebUI/assets/97279763/1a8a6d26-e4bd-4af8-91e2-89655cabc92d)

@cookiexND Thanks for reporting this. It's fixed in #183 @Tom-Neverwinter Can you provide more information about the error you received?

Hi @linuxlurak. I tried to fix the bug in #244, can you check the latest version? You can update the WebUI with `update.sh`.

Hi @joshuachough thanks for reporting. I just found out that there was a critical bug that VAD didn't apply to audio, It's fixed in #206. And according to [faster-whisper #452](https://github.com/SYSTRAN/faster-whisper/issues/452#issuecomment-1704859269),...

Hi @joshuachough. I recently found that VAD was incorrectly implemented (There were more bugs than in the earlier one...), and it's fixed in #216. VAD works by first removing the...