Jun Chen
Jun Chen
> How did you get this error? > > Can you share the steps to reproduce? (model size, transcription options, input file if possible) Hi, sorry I can't share the...
I basically tried the notebook here: https://github.com/Majdoddin/nlp/blob/main/Pyannote_plays_and_Whisper_rhymes_v_2_0.ipynb It's not happening on every of my input file though
thanks, I'll try today or tomorrow
sorry was busy with some deadlines. I wasn't able to run that library to repro today after tried for 10+mins, but since I'll not use it, I assume this is...
I think encountered this issue as well, I split the stereo audio to two mono channels, there're expected long silence in each mono audio. Using vad_filter=True fixed a few, but...
Thanks @guillaumekln ! Yeah in my experience end is more accurate than start. So accounting for end will be more accurate. I have one question about `speech_pad_ms` Does it just...
> > So as long as it's smaller than min_speech_duration_ms, then should be fine? > > Did you mean `min_silence_duration_ms`? sorry yes I mean `min_silence_duration_ms` =.= Actually I have another...
Will try with predict speaker and not split
that doesn't work well, I probably still need to split and improve the timestamps matchings -.-
> > Tried to use a thread pool in python to submit jobs for audios, > > That's a good approach. Did you also increase `num_workers` when doing that? Normally...