vosk-api
vosk-api copied to clipboard
Add aligner
Aligner is a program for aligning words in time relative to other words in audio file. Gentle project used m3.cc and k3.cc as language and acoustic models for alignment, these approaches were reworked into aligner, which made it possible to use different language models and accelerated the alignment process. Also in setup.py was added ability to run the aligner not only from the folder with it was added.
How to work:
- You have to download any language model
- You have to prepare .wav and .txt files
- When starting the program, you have to specify the required arguments: a) path to the wavfile; b) path to the textfile; c) path to the language model.
Example(how to run): python3 vosk_align.py example/glorious.wav example/glorious.txt example/model
I'm also waiting for the tests for the aligner so we can automatically verify the code
In testing this locally, there needs to be an empty __init__.py
file at python/vosk/aligner/__init__.py
, otherwise I would get ModuleNotFoundError: No module named 'vosk.aligner'
when trying to run vosk-aligner
or vosk_align.py
.
@ryanfb thx for the info. I will fix it.
I'm really interested in this PR, is there anything I can do to help?
@Laurian just ping me if I forget please, I'll try to merge it
@Laurian just ping me if I forget please, I'll try to merge it
I am also very interested in this as well. What can I do to help?
@nshmyrev ping 🙏
@nshmyrev Ping hoping this get merged. Anxiously excited for this merge since last year what can the community do to help?
@nshmyrev Really hoping this can get merged.