Benjamin Minixhofer

Results 28 issues of Benjamin Minixhofer

I am trying to run a CTC Model in the browser. In keras, it looks like this: ``` __________________________________________________________________________________________________ Layer (type) Output Shape Param # Connected to ================================================================================================== the_input (InputLayer)...

This PR implements spellchecking using the algorithm described in [Error-tolerant Finite-state Recognition with Applications to Morphological Analysis and Spelling Correction](https://www.aclweb.org/anthology/J96-1003.pdf). I tried multiple other methods: - [SymSpell](https://github.com/wolfgarbe/SymSpell) which would probably...

There is now a benchmark in `bench/__init__.py`. It computes suggestions from LanguageTool via [language-tool-python](https://pypi.org/project/language-tool-python/) and NLPRule on 10k sentences from Tatoeba and compares the times. Heres's the output for German:...

A web demo running client-side via WebAssembly would be really cool and useful. Ideally this should have: - a live text correction tool - text analysis like https://community.languagetool.org/analysis (this would...

help wanted
P3

This is the main modularization PR. Fixes #50. I've been quite busy lately but I've gotten around to doing what has become to some degree a rewrite now :) Now...

Currently the core is still largely in the state it was during prototyping. While abstractions are good and the code is clean documentation is missing in the internals. So the...

P1

`rayon` is not necessary and should be behind a feature flag (like in e.g. [`ndarray`](https://docs.rs/ndarray/0.14.0/ndarray/#crate-feature-flags)). Resolving this issue in `rayon-cond` would make it trivial: https://github.com/cuviper/rayon-cond/issues/3 If this change does not...

P3

As suggested originally by @drahnr (https://github.com/bminixhofer/nlprule/issues/2#issuecomment-786509451) we should consider splitting nlprule into multiple sub-crates. What's certain is that there will be one `nlprule` crate combining sub-crates into one higher level...

P1

This meta-issue tracks what I plan to do with this library in the near future. I wrote this up to make it possible to comment on the direction and priorities...

help wanted

`Tokenizer.pipe`, `Rules.suggest` and maybe some other methods should return an iterator instead of a `Vec` for more flexibility.

good first issue
P3
api