josh bowles

Results 12 comments of josh bowles

- [Punkt Tokenizer](https://github.com/ferristseng/rust-punkt) - [vtext General Tokenizer focused on Machine Learning applications](https://github.com/rth/vtext) Do we want to include string distance metrics as part of tokenization or a separate project? In this...

[Natural language detection for Rust with focus on simplicity and performance: whatlang-rs](https://github.com/greyblake/whatlang-rs) has been in development for 3 years; I've not used it but it looks solid.