diart issues

[joss] Need locations of benchmarking data!

2

Trying to find the data to run the benchmarks, and I can't find all the source data: - [x] **VoxConverse** - found it! - [ ] **AMI** - found the...

sneakers-the-rat

documentation

Add tutorials to the documentation page

- [ ] Getting started (audio sources, pipelines, inference) - [ ] Speaker Diarization - [ ] Voice Activity Detection - [ ] Benchmark - [ ] Hyper-parameter tuning -...

juanmc2005

documentation

Add a caching mechanism for benchmark and tuning

### Problem It's getting more and more difficult to tune and evaluate diarization pipelines with different models or combinations of models, even with a GPU. ### Idea Implement a caching...

juanmc2005

feature

Load pipeline config from yaml file

### Problem Configuring a pipeline and tracking changes is hard with the large amount of arguments. This also leads to duplicated code in the CLI scripts. ### Idea Load configurations...

juanmc2005

API

[joss] Tests?

5

Hey! Sorry for the long delay. just started a new job and things have been hectic. Orienting myself to the package, and I can't seem to find any tests? JOSS...

sneakers-the-rat

ops

`step` controls the minimum _algorithmic latency_ of the speaker diarization pipeline. Targetting real-time processing, one needs to make sure that the _processing latency_ (i.e. the time it takes to process...

hbredin

feature

Add speaker-aware transcription

6

**Depends on #144** This PR adds a new `SpeakerAwareTranscription` pipeline that combines streaming diarization and streaming transcription to determine "who says what" in a live conversation. By default, this is...

juanmc2005

feature

Add hooks for improved customization

This PR addresses issue #102

juanmc2005

feature

API

Speaker-blind speech recognition

6

**Depends on #143** Adding a streaming ASR pipeline needed a big refactoring (that began with #143). This PR continues this effort to allow a new type of pipeline that transcribes...

juanmc2005

bug

feature

API

refactoring

Running Diart_Whisper on Windows and nothing happens

2

Hello, I've been trying to get your colored text demo working but nothing seems to happen. I've gotten the basic demo working from this repo and it works fine, but...

ScottSump

question

diart
diart copied to clipboard

Metadata

[joss] Need locations of benchmarking data!

Add tutorials to the documentation page

Add a caching mechanism for benchmark and tuning

Load pipeline config from yaml file

[joss] Tests?

Adapt "step" automagically

Add speaker-aware transcription

Add hooks for improved customization

Speaker-blind speech recognition

Running Diart_Whisper on Windows and nothing happens

← Metadata

Owner

Metadata

diart diart copied to clipboard

Metadata

← Metadata

Owner

Metadata

diart
diart copied to clipboard