Francis Tyers
Francis Tyers
It would be cool to be able to define, on a per-transducer basis language specific basis certain characters which can appear anywhere in the stream but that don't effect the...
It would be useful to try and run a roundtrip test for `lt-print` and `lt-comp` over available dictionary, to make sure that nothing segfaults or causes infinite loops. See e.g....
A tool should be included in lttoolbox which calculates a BPE vocabulary as defined in this paper: https://arxiv.org/pdf/1508.07909.pdf The idea is to use BPE to weight our morphological transducers.
``` fran@matxine:~/source/apertium/staging/apertium-mlt-heb$ make apertium-validate-dictionary apertium-mlt-heb.mlt-heb.dix lt-comp rl apertium-mlt-heb.mlt-heb.dix heb-mlt.autobil.bin lt-trim .deps/heb.automorf.bin heb-mlt.autobil.bin heb-mlt.automorf.bin Error: empty set of final states Makefile:764: recipe for target 'heb-mlt.automorf.bin' failed make: *** [heb-mlt.automorf.bin] Error 1...
It seem a bit of overkill to have two separate file formats for this stuff.
``` apertium-ady apertium-ain apertium-alt apertium-asm apertium-aze apertium-bas apertium-bis apertium-btc apertium-byv apertium-cak apertium-chm apertium-ckb apertium-ckt apertium-cnh apertium-cro apertium-dar apertium-dlg apertium-dsb apertium-epo apertium-frp apertium-gld apertium-gle apertium-glk apertium-guj apertium-hau apertium-haw apertium-hsb apertium-hun apertium-ibo...
We should write some general guidelines for releases.
There is lots of dead wood in the repository. For example most of the pairs in incubator that are `*-eng` are unmaintained because they were generated automatically for a workshop...
Not sure if this is the right place for it, but it would be cool to add phonemisers to Apertium in a standard way. E.g. ``` $ echo "sibʼalaj" |...
https://github.com/abhinav-surendra/apertium-eng-kan