BUET CSE NLP Group

Results 7 repositories owned by BUET CSE NLP Group

xl-sum

245
Stars
42
Forks
Watchers

This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Compu...

banglabert

229
Stars
31
Forks
Watchers

This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining an...

banglanmt

145
Stars
45
Forks
Watchers

This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Pr...

BanglaNLG

80
Stars
11
Forks
Watchers

This repository contains the official release of the model "BanglaT5" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaNLG: Benchmarks and Resources for Eva...

CoDesc

48
Stars
9
Forks
Watchers

A large dataset of 4.2m Java source code and parallel data of their description from code search, and code summarization studies.

CrossSum

48
Stars
7
Forks
Watchers

This repository contains the code, data, and models of the paper titled "CrossSum: Beyond English-Centric Cross-Lingual Summarization for 1,500+ Language Pairs" published in Proceedings of the 61st An...

normalizer

34
Stars
7
Forks
Watchers

This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine trans...