diart
diart copied to clipboard
A python package to build AI-powered real-time audio applications
## Problem There's been some interest in adding compatibility with networking protocols (see #9 #27 #67 #68). ## Idea Add a `TCPAudioSource` and `UDPAudioSource` following a similar implementation to `MicrophoneAudioSource`.
## Problem There's been some interest in adding compatibility with networking protocols (see #9 #27 #67 #68). ## Idea Add a `WebsocketAudioSource` following a similar implementation to `MicrophoneAudioSource`.
Basic Colab example notebook, can be updated and modified by you as you like. https://colab.research.google.com/drive/1Bp4_v5SNU1anDDEDkvA1FLnqGIfxM7ov?usp=sharing
## Problem It's difficult to understand the shape and format expected from tensors. ## Idea Add [torchtyping](https://github.com/patrick-kidger/torchtyping). There's also the possibility to check that types are respected at runtime (instead...
## Problem Cannot-link constraints are currently hard-coded in `OnlineSpeakerClustering`. If a segmentation model over-segments speakers, it may be better to rely on speaker embeddings instead to determine the identity of...
There is some dead code inside `OnlineSpeakerClustering`. It would be better to remove it to improve the quality and readability of the code base.
I've been trying a while to get streaming diarization + recognition working together. I've failed trying many different strategies for streaming wav to pyannote. I started with an example that...
It would be nice to make diart compatible with the `torchaudio` streaming API. This would also make it possible to easily add other audio sources like the screen, videos, and...
_just putting it here as a note to myself :)_ * Looks like [it is now possible](https://gradio.app/real_time_speech_recognition/) to do live speech processing in [Gradio](https://gradio.app/). * There has also been [some](https://github.com/dataprofessor/realtime-transcription)...
Hi @zaouk, @Yagna24, @ckliao-nccu, @igordertigor, @AMITKESARI2000! I'm planning to submit a paper for diart at [The Journal of Open Source Software (JOSS)](https://joss.theoj.org/) and it seems natural that all of you,...