daac-tools

Results 8 repositories owned by daac-tools

daachorse

190
Stars
12
Forks
Watchers

🐎 A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure in Rust.

vaporetto

218
Stars
10
Forks
Watchers

🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer

vibrato

303
Stars
14
Forks
Watchers

🎤 vibrato: Viterbi-based accelerated tokenizer

crawdad

27
Stars
2
Forks
Watchers

🦞 Rust library of natural language dictionaries using character-wise double-array tries.

python-vaporetto

20
Stars
1
Forks
Watchers

🛥 Vaporetto is a fast and lightweight pointwise prediction based tokenizer. This is a Python wrapper for Vaporetto.

python-vibrato

34
Stars
1
Forks
Watchers

Viterbi-based accelerated tokenizer (Python wrapper)

trie-match

31
Stars
0
Forks
Watchers

Fast match expression optimized for string comparison

find-simdoc

56
Stars
3
Forks
Watchers

Finding all pairs of similar documents time- and memory-efficiently