Juan Coria
Juan Coria
Hi @KentDes, Yes it sounds likely that the system would be much slower on a raspberry pi. The real-time latency should be at most 500ms per chunk to run in...
Changing microphone devices is now implemented in #136 and will be part of the next v0.7 release
Hi @RustX2802, faster-whisper is not supported yet, I'm assuming you implemented it manually? Could you share the part of the code where you align the transcription and diarization?
Hey @AbrarZShahriar, yeah that is the current behavior. I would welcome a PR to open everything in the same tab if you're up for it :) The key thing here...
@BlokusPokus it seemed to work last time I tried but I didn't merge because I wanted to include a faster implementation of Whisper and I needed to clean up the...
@GeorgeDeac @ywangwxd This feature is pretty much finished. But I think it would be nice to replace the default ASR by a faster one, like the ones you mention. Feel...
Hi @thieugiactu, that's an interesting idea. To do this in a streaming way we would need access to a pre-trained model for the enhancement task, then implement a `SpeechEnhancementModel` and...
@thieugiactu sure! Feel free to open a PR too, I'd be glad to discuss possible solutions to this
@thieugiactu something you could also do to reduce the inference time is to directly record audio at 44.1 khz. This way you avoid having to upsample in the first place
Hi @fedexman, Unlike `pyannote.audio`, `diart` is built for streaming (i.e. online) diarization. Even though it leverages pyannote models, this does not mean that the two pipelines are comparable. Sacrifices in...