Francis Tyers comments

Results 288 comments of


                                            Francis Tyers

Implement a tool to calculate a BPE vocabulary

@anjalibhavan well, the first version would be just the algorithm as described in the paper. Later the tool would support weighting lttoolbox transducers according to the vocabulary of the tool.

Implement a tool to calculate a BPE vocabulary

The code is implemented in Python by https://github.com/rsennrich/subword-nmt

More Consistent GET variables?

The way to do this right now is basically to use vocabulary coverage over a corpus. This is the best indicator of the quality of a pair. This is something...

More Consistent GET variables?

@Ryu945 I can't do it, but you could! :) I don't expect that such a script should take longer than an hour or two to write.

lt-print segfaults on heb-mlt.autobil.bin

``` fran@matxine:~/source/apertium/staging/apertium-mlt-heb$ gdb lt-print GNU gdb (Debian 8.1-4+b1) 8.1 Copyright (C) 2018 Free Software Foundation, Inc. License GPLv3+: GNU GPL version 3 or later This is free software: you are...

lt-print segfaults on heb-mlt.autobil.bin

Ok, this seems like a classic error and might be a duplicate. Here is the fix: ``` diff --git a/apertium-mlt-heb.mlt-heb.dix b/apertium-mlt-heb.mlt-heb.dix index 8c75d3d..d9dd506 100644 --- a/apertium-mlt-heb.mlt-heb.dix +++ b/apertium-mlt-heb.mlt-heb.dix @@ -88,6...

Francis Tyers

Implement a tool to calculate a BPE vocabulary

Implement a tool to calculate a BPE vocabulary

More Consistent GET variables?

More Consistent GET variables?

lt-print segfaults on heb-mlt.autobil.bin

lt-print segfaults on heb-mlt.autobil.bin

apertium crashes with bad_alloc

New section type that doesn't minimise

Use Github readme tag things and tags to tag unmaintained stuff

Write a utility to assign weights to a compiled transducer based on a corpus