silero-models
silero-models copied to clipboard
Feature request - [slovenian language]
🚀 Feature
Please add support for slovenian language, here you can find a quality dataset:
- audio: https://www.clarin.si/repository/xmlui/handle/11356/1776
- transcriptions: https://www.clarin.si/repository/xmlui/handle/11356/1772
Artur_B_Studio (inside the dataset) contains 50 hours of a single speaker recorded in a studio (high quality). In total there are 800 transcribed hours (multiple speakers, varying quality)
for phonemizer you can use espeak-ng with "sl" language ("slovenian" voice)