parallel-corpus topic

List parallel-corpus repositories

indonesian-NLP-resources

220
Stars
50
Forks
Watchers

data resource untuk NLP bahasa indonesia

Classical-Modern

923
Stars
199
Forks
Watchers

非常全的文言文(古文)-现代文平行语料

banglanmt

145
Stars
45
Forks
Watchers

This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Pr...

Cross-Language-Dataset

60
Stars
21
Forks
Watchers

A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection

nepali-translator

47
Stars
16
Forks
Watchers

Neural Machine Translation on the Nepali-English language pair

Indian_ParallelCorpus

29
Stars
3
Forks
Watchers

Curated list of publicly available parallel corpus for Indian Languages

bertalign

88
Stars
40
Forks
Watchers

Multilingual sentence alignment using sentence embeddings

astred

18
Stars
0
Forks
Watchers

An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For instance useful for comparing a translation with the original text,...

TALPCo

48
Stars
13
Forks
Watchers

TUFS Asian Language Parallel Corpus