speakerbox icon indicating copy to clipboard operation
speakerbox copied to clipboard

Speakerbox: Fine-tune Audio Transformers for speaker identification.

Results 9 speakerbox issues
Sort by recently updated
recently updated
newest added

Bumps [codecov/codecov-action](https://github.com/codecov/codecov-action) from 3 to 4. Release notes Sourced from codecov/codecov-action's releases. v4.0.0 v4 of the Codecov Action uses the CLI as the underlying upload. The CLI has helped to...

dependencies

### Describe the Bug After a small training run, the program halted with the following error indicating a required library was missing. ### Expected Behavior Completion of training. ### Reproduction...

bug

### Describe the Bug `speakerbox` requires `datasets~=1.18` which in turn requires `numpy>=1.17`. The `datasets` `1.x` line hasn't been updated since mid-2022. `numpy` `v1.20` [deprecated use of aliases of built-in types](https://numpy.org/devdocs/release/1.24.0-notes.html#expired-deprecations)...

bug

### Describe the Bug The `apply()` function fails as a Hugging Face token is not supplied. The contained [`_diarize()` function calls](https://github.com/CouncilDataProject/speakerbox/blob/e9f1f4efd0199a5c0266873c345d11aa13d72ec8/speakerbox/main.py#L363): ```bash diarization_pipeline = Pipeline.from_pretrained("pyannote/speaker-diarization") ``` This results in an...

bug

### Describe the Bug When attempting an installation of the module, version 0.0.2 is installed by default. ### Expected Behavior PyPI has all of the versions, not sure why this...

bug

Hi! First of all, thank you for developing SpeakerBox. It has proven to be an invaluable tool for my current project. Allow me to provide you with some context regarding...

enhancement

### Feature Description Support a more direct directory structure of speakers instead of "conversations". i.e.: ``` data/ ├── bob/ | ├── 0.wav | ├── 1.wav | ├── 2.wav | ├──...

enhancement
good first issue

### Link to Relevant Issue This pull request resolves #14 ### Description of Changes Change only on 1 line, glob("*.wav") => chain of globs for every relevant audio file format....

### Feature Description In `preprocess.py`, you use `AudioSegment.from_file` to load the audio files. According to [definition](https://github.com/jiaaro/pydub#quickstart) of this function, it seems it accepts a bunch of file formats. but at...

enhancement