nlprule icon indicating copy to clipboard operation
nlprule copied to clipboard

A fast, low-resource Natural Language Processing and Text Correction library written in Rust.

Results 26 nlprule issues
Sort by recently updated
recently updated
newest added

Many useful LanguageTool rules are written in Java and not in the XML rule format (e.g. the [A-Vs-AN rule](https://github.com/languagetool-org/languagetool/blob/master/languagetool-language-modules/en/src/main/java/org/languagetool/rules/en/AvsAnRule.java) which is kind of essential for me, because I always forget...

The biggest issue using this library currently is the fact, that on each startup a _lot_ of regular expressions are compiled. If `regex` (or whatever crate being used) implements serialization...

P2

Currently the core is still largely in the state it was during prototyping. While abstractions are good and the code is clean documentation is missing in the internals. So the...

P1

`rayon` is not necessary and should be behind a feature flag (like in e.g. [`ndarray`](https://docs.rs/ndarray/0.14.0/ndarray/#crate-feature-flags)). Resolving this issue in `rayon-cond` would make it trivial: https://github.com/cuviper/rayon-cond/issues/3 If this change does not...

P3

As suggested originally by @drahnr (https://github.com/bminixhofer/nlprule/issues/2#issuecomment-786509451) we should consider splitting nlprule into multiple sub-crates. What's certain is that there will be one `nlprule` crate combining sub-crates into one higher level...

P1

This meta-issue tracks what I plan to do with this library in the near future. I wrote this up to make it possible to comment on the direction and priorities...

help wanted

I love this library already, I've been looking for something like this for a project of mine for months now! However, I saw the README said this about the project:...

enhancement
P1

`Tokenizer.pipe`, `Rules.suggest` and maybe some other methods should return an iterator instead of a `Vec` for more flexibility.

good first issue
P3
api

More languages were understandably already requested multiple times (e. g. #14). This issue tracks progress in this area. The primary goal is to make it easy for contributors to add...

P2