Greysuki

Results 4 comments of Greysuki

Have test pyannote-audio as speaker diarization. The error rate is about 30% and need lots of extra install step. In other hands, segmentation (and VAD) is working pretty good. I'll...

A successful example about whisper + speaker diarization. https://github.com/MahmoudAshraf97/whisper-diarization

Another example https://huggingface.co/spaces/vumichien/Whisper_speaker_diarization/blob/main/app.py device = 0 if torch.cuda.is_available() else "cpu"   | pipe = pipeline(   | task="automatic-speech-recognition",   | model=MODEL_NAME,   | chunk_length_s=30,   | device=device,   | )   | os.makedirs('output', exist_ok=True)   | pipe.model.config.forced_decoder_ids =...

Hi, @DavidDah. I can't reproduce the issue in the same filename. - How big is the file size? - Could you please check if the file `C:\Users\user\AppData\Local\Temp\tempfreesubtitle\main.MP4` exists? - Additionally,...