Benjamin Minixhofer
Benjamin Minixhofer
I am trying to run a CTC Model in the browser. In keras, it looks like this: ``` __________________________________________________________________________________________________ Layer (type) Output Shape Param # Connected to ================================================================================================== the_input (InputLayer)...
This PR implements spellchecking using the algorithm described in [Error-tolerant Finite-state Recognition with Applications to Morphological Analysis and Spelling Correction](https://www.aclweb.org/anthology/J96-1003.pdf). I tried multiple other methods: - [SymSpell](https://github.com/wolfgarbe/SymSpell) which would probably...
There is now a benchmark in `bench/__init__.py`. It computes suggestions from LanguageTool via [language-tool-python](https://pypi.org/project/language-tool-python/) and NLPRule on 10k sentences from Tatoeba and compares the times. Heres's the output for German:...
Web demo
A web demo running client-side via WebAssembly would be really cool and useful. Ideally this should have: - a live text correction tool - text analysis like https://community.languagetool.org/analysis (this would...
This is the main modularization PR. Fixes #50. I've been quite busy lately but I've gotten around to doing what has become to some degree a rewrite now :) Now...
Currently the core is still largely in the state it was during prototyping. While abstractions are good and the code is clean documentation is missing in the internals. So the...
`rayon` is not necessary and should be behind a feature flag (like in e.g. [`ndarray`](https://docs.rs/ndarray/0.14.0/ndarray/#crate-feature-flags)). Resolving this issue in `rayon-cond` would make it trivial: https://github.com/cuviper/rayon-cond/issues/3 If this change does not...
As suggested originally by @drahnr (https://github.com/bminixhofer/nlprule/issues/2#issuecomment-786509451) we should consider splitting nlprule into multiple sub-crates. What's certain is that there will be one `nlprule` crate combining sub-crates into one higher level...
Roadmap
This meta-issue tracks what I plan to do with this library in the near future. I wrote this up to make it possible to comment on the direction and priorities...
`Tokenizer.pipe`, `Rules.suggest` and maybe some other methods should return an iterator instead of a `Vec` for more flexibility.