diart issues

Minimizing missed detection

7

Iv'e been working on tuning the pipeline for my application which is a real time conversational system, the best results so far are: 36.02% DER, 2.41% false alarm, 20.04% missed...

arielrado

question

quality concerns

1

It looks like pipeline quickly forgets previous speakers, assigning wrong tags to new ones, so that a conversation of 4-5 people being inferenced as a conversation of 2. I am...

DmitriyG228

duplicate

question

Create a Docker image

5

## Problem Setting up the project is a bit too long with all the dependencies and the use of conda. ## Idea Create and publish docker images with new diart...

juanmc2005

ops

Feature Request: Implementing Persistent Speaker Embeddings Across Conversations

3

### Feature Description I propose the addition of a feature to the DIART project that allows for the persistence and reuse of speaker embeddings across multiple conversations. I am willing...

DmitriyG228

feature

The latency of wespeaker model is to large

1

hello @juanmc2005 I use the hbredin/wespeaker-voxceleb-resnet34-LM (ONNX) model to extract speaker embedding in diarization pipeline, but I found the latency is too large(1300ms) when calculate per chunk with the default...

SheenChi

question

Implement voicefixer for audio enhancement

6

Is there any way to implement [voicefixer](https://github.com/haoheliu/voicefixer_main) to speaker diarization pipeline? The package takes a wav file as input and gives a upsampled 44100kHz wav file as output, but that...

thieugiactu

feature

ImportError: cannot import name 'OnlineSpeakerDiarization' from 'diart'

9

I am trying to run your tutorial on [transcription coloring](https://betterprogramming.pub/color-your-captions-streamlining-live-transcriptions-with-diart-and-openais-whisper-6203350234ef). But I am getting the mentioned error. The library runs fine per "diart.stream microphone". Running on Windows 11 with Python...

ameer-kanaan

question

wip: add pseudo speaker diarization pipeline based on segmentation stitching

1

As segmentation models are getting better, it might make sense to revisit the idea of stitching based on segmentation alone. That's what this (WIP) pipeline does. Also, that was an...

hbredin

feature

Get rid of LazyModel

### Problem `LazyModel` makes it rather complicated for someone to add their own model, especially when some changes need to be made to the input/output. The reason `LazyModel` exists is...

juanmc2005

feature

API

Optimize weighted embedding extraction with pyannote 3.1

9

With pyannote 3.1, we could do only 1 forward pass of the audio instead of `num_speakers` when extracting embeddings with weights. This is probably at least one of the causes...

juanmc2005

feature

diart
diart copied to clipboard

Metadata

Minimizing missed detection

quality concerns

Create a Docker image

Feature Request: Implementing Persistent Speaker Embeddings Across Conversations

The latency of wespeaker model is to large

Implement voicefixer for audio enhancement

ImportError: cannot import name 'OnlineSpeakerDiarization' from 'diart'

wip: add pseudo speaker diarization pipeline based on segmentation stitching

Get rid of LazyModel

Optimize weighted embedding extraction with pyannote 3.1

← Metadata

Owner

Metadata

diart diart copied to clipboard

Metadata

← Metadata

Owner

Metadata

diart
diart copied to clipboard