pyannote-database
pyannote-database copied to clipboard
Reproducible experimental protocols for multimedia (audio, video, text) database
Part of my configuration: ``` Databases: # tell pyannote.database where to find AMI wav files. # {uri} is a placeholder for the session name (eg. ES2004c). # you might need...
Should all the rttm files have unique speaker tags? For eg., If there are three audio files and all of them have two different speakers(within files and across files too)....
Hello: I try to apply my own YAML with the command: from pyannote.database import get_protocol But it shows"ImportError: cannot import name 'registry' from 'pyannote.database' (/home/xuan/anaconda3/envs/pyannote/lib/python3.8/site-packages/pyannote/database/__init__.py)" How can I solve this...
RTTMLoader class is extremely slow for large RTTM files containing annotation of multiple audio files (e.g. VoxCeleb dataset). We should make it faster!
RTTM files sometimes contain a bunch of lines describing the list of speakers. Those are marked with `SPKR-INFO` first field. https://github.com/pyannote/pyannote-database/blob/11d8dcb72e54874fc9437cfbd7d944dcce7e9217/pyannote/database/util.py#L279
Hey, @mmmaat has found this neat little lib : https://github.com/soundata/soundata Just leaving this here as a potential idea: wouldn't it be nice if there were wrappers for this library included...
Related to #99 Hi, I don't think `LABLoader` is behaved as expected. If I call it directly, e.g. `loader = LABLoader(path='../only_words/labs/dev/{uri}.lab')` It works smoothly. But when I use it with...