jhj0517 comments

Results 153 comments of


                                            jhj0517

WhisperX - Speaker Diarization

Hi, I've made a TODO list in the README and added it. I'll work on it later!

WhisperX - Speaker Diarization

I'm testing whisperX and listing some issues here: - Incompatible torch version - whisperX models were trained on `torch 1.10.0+cu102` and this WebUI uses `torch 2.3.1+cu121` - Slow transcription -...

WhisperX - Speaker Diarization

Yes, it seems that whisperX post-process diarization with the result of the faster-whisper. So I think I should modularize the diarization and integrate it with faster-whisper.

WhisperX - Speaker Diarization

Speaker diarization is now enabled in #181. Diarization is embedded into the text with `|` divider. For example, w/ diarization: ``` 1 00:00:00,000 --> 00:00:04,879 SPEAKER_00|Now, as all books not...

WhisperX - Speaker Diarization

@moda20 Can you show the full log before the Traceback? This could happen if the model failed to load. To use pyannote model, you need to go to the 1....

WhisperX - Speaker Diarization

@moda20 Trying to run diarization models with CPU may help in that case. You can change the device in the dropdown. ![image](https://github.com/jhj0517/Whisper-WebUI/assets/97279763/1a8a6d26-e4bd-4af8-91e2-89655cabc92d)

WhisperX - Speaker Diarization

@cookiexND Thanks for reporting this. It's fixed in #183 @Tom-Neverwinter Can you provide more information about the error you received?

WhisperX - Speaker Diarization

Hi @linuxlurak. I tried to fix the bug in #244, can you check the latest version? You can update the WebUI with `update.sh`.

Generated subtitles are too long

Hi @joshuachough thanks for reporting. I just found out that there was a critical bug that VAD didn't apply to audio, It's fixed in #206. And according to [faster-whisper #452](https://github.com/SYSTRAN/faster-whisper/issues/452#issuecomment-1704859269),...

Generated subtitles are too long

Hi @joshuachough. I recently found that VAD was incorrectly implemented (There were more bugs than in the earlier one...), and it's fixed in #216. VAD works by first removing the...