lttoolbox
lttoolbox copied to clipboard
Finite state compiler, processor and helper tools used by apertium
Either (1) update the code so that we never get these errors/warnings: ``` Error: Invalid dictionary (hint: the left side of an entry is empty) Error: Invalid dictionary (hint: entry...
This is a bit bikeshedding, but I think that we should change the epsilon symbol in lt-print from `ε` either to `` or to `@0@`. It probably needs to change...
I've attached an excerpt of about 25 lines that causes apertium to be busy for a long time and eventually crash with a bad_alloc exception. Troublesome excerpt: [apertium-bug-ca-es-bad-alloc.txt](https://github.com/apertium/apertium/files/4824939/apertium-bug-ca-es-bad-alloc.txt) I'm feeding...
It would be cool to be able to set the default weight for a section, e.g. ``` ... ```
``` $ lt-comp lr apertium-scn.scn.dix scn.automorf.bin apertium-scn.scn.acx Error: Trying to link nonexistent states (-1, 21, 0) ``` ``` $ lt-comp lr apertium-scn.scn.dix scn.automorf.bin apostrophes@postblank 166 230 final@inconditional 34 117 main@standard...
In addition to #78, it would be great to have a tool, let's call it `lt-segment` that would calculate a segment vocabulary from a `.dix file`. E.g. ``` ... s...
It would be cool to be able to define, on a per-transducer basis language specific basis certain characters which can appear anywhere in the stream but that don't effect the...
Before https://github.com/apertium/lttoolbox/commit/944ed2556c38f058a5118ab5e481b3412aa3e3d8 / https://github.com/apertium/lttoolbox/pull/52 it was possible to use monodix files with an empty `` in order to segment into all known analyses (presumably symbols without analyses were output as...
It would be useful to try and run a roundtrip test for `lt-print` and `lt-comp` over available dictionary, to make sure that nothing segfaults or causes infinite loops. See e.g....
Currently, weights are specified using the ```w``` attribute. Using it in a bidix gives an entry the same weight for both directions, potentially producing unwanted effects if there are lexical...