lttoolbox
lttoolbox copied to clipboard
Finite state compiler, processor and helper tools used by apertium
Then we could have bidix entries like ``. This would require changes to the compiler to allow it, and presumably trim and compose, perhaps also fst_processor?
At the very least, getting Lower and Upper ranges would be nice, so we could ```xml \\p{Lu}\\p{Ll}+ ``` and whatnot. If we do the "simple" thing and just expand like...
I imagine it will be called `lt-reweight` It should have two arguments: 1) a binary lttoolbox file e.g. ` grn.automorf.bin ` 2) a tagged corpus `grn.tagged` ``` $ lt-reweight grn.automorf.bin...
opening an issue so https://github.com/apertium/lttoolbox/issues/82#issuecomment-646959621 doesn't get lost
``` PATTERNS (C* V)* C* V C* LEXICON C k LEXICON V o:ou ``` ```bash $ echo oko | lt-proc blah.bin oukou $ echo oko | lt-proc -t blah.bin oukouu...
We need to implement a way to represent infinite weights. The current outcome is strange! ``` $ cat sample.att 0 1 a b 2 1 2 b c 1 1...
``` $ echo '~Detla umanitat' | lt-proc -p oci.autopgen.bin L'umanitat $ echo '"~Detla umanitat' | lt-proc -p oci.autopgen.bin "l'umanitat $ echo '~que ~Detla umanitat' | lt-proc -p oci.autopgen.bin que l'umanitat...
Due to the way `` is handled, `ab` will match arbitrary tags but `abc` will not because the two ``s are not perfectly aligned. It should be possible to automatically...
`hfst-proc` behaviour (expected): ``` $ echo "с." | hfst-proc sah.automorf.hfst ^с./с.$ $ echo "с.1" | hfst-proc sah.automorf.hfst ^с./с.$^1/1/1/1+э$ ``` `lt-proc` behaviour (second one is unexpected): ``` $ echo "с." |...