low-resource-languages topic

List low-resource-languages repositories

Exploring the Limits of Low-Resource Neural Machine Translation

GlotLID

76
Stars
6
Forks
Watchers

GlotLID: Language Identification with Support for More Than 2000 Labels -- EMNLP 2023

Turkish-Speech-to-Text

28
Stars
1
Forks
Watchers

Fine-tuning for automatic speech recognition on low-resource languages with character-based CTC model

BembaSpeech

30
Stars
2
Forks
Watchers

This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/TV shows transcripts, Youtube Video transcripts, Online sources...

vad-sli-asr

18
Stars
3
Forks
Watchers

A pipeline to isolate and transcribe one language in mixed-language speech

thesis

20
Stars
4
Forks
Watchers

My thesis on "Open Source Code and Low Resource Languages" for an MSc in Language Science and Technology at Saarland University

relm_unmt

35
Stars
3
Forks
Watchers

Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".