Umar Butler

Results 3 repositories owned by Umar Butler

open-australian-legal-corpus-creator

60
Stars
7
Forks
Watchers

The code used to create and update the Open Australian Legal Corpus, the first and only multijurisdictional open corpus of Australian legislative and judicial documents.

semchunk

158
Stars
9
Forks
Watchers

A fast and lightweight pure Python library for splitting text into semantically meaningful chunks.

orjsonl

34
Stars
2
Forks
Watchers

A lightweight, high-performance Python library for parsing jsonl files.