Leo Huang comments

Results 6 comments of


Leo Huang

Diarization

@ggerganov please help, I did extactly same thing as what @Dmitriuso did, yt-dlp -xv --audio-format wav -o skillsfuture.wav https://www.youtube.com/watch?v=girQacfWjMw&list=PLH2CR4s1lqyjFm4vQPKT0-hE8sh2T10I1 ffmpeg -i skillsfuture.wav -acodec pcm_s16le -ar 16000 sf.wav ./main -m ../whisper-models/ggml-base.en.bin...

Diarization

Is that possible, we integrate ECAPA-TDNN model from [SpeechBrain](https://github.com/speechbrain/speechbrain) into this project, like what following project have done? https://huggingface.co/spaces/vumichien/Whisper_speaker_diarization Tested with this video, https://www.youtube.com/watch?v=girQacfWjMw&list=PLH2CR4s1lqyjFm4vQPKT0-hE8sh2T10I1 works pretty well. But it is...

Leo Huang

Diarization

Diarization

Crash on iPhone when Using CoreML

Crash on iPhone when Using CoreML

How can I continue to download from the disconnection point?

Knowledge distillation support for Nemo ASR models