Daniel Swanson

Results 98 comments of Daniel Swanson

Due to -pes having been fixed, eng-pes now works. pes-eng, however, is still broken due to an invalid transfer file.

ICU provides an a way to define custom normalizations. The documentation isn't terribly helpful, but it looks to me like we just need to edit https://github.com/unicode-org/icu/blob/master/icu4c/source/data/unidata/norm2/nfc.txt to make a more...

> Excellent point, I am personally not very worried about Ångström sign but there might be something useful there as well... Perhaps we should go through the list cooperatiively somehow,...

https://gist.github.com/mr-martian/11dd5c4dad3861b55054a209393c1e0c And here's just the unconditional replacements, since I expect that's the part we're most interested in editing.

I would expect it to be in conjunction with format handling (either before or after, not sure which).

Inserting a normalizer between deformatting and analysis would handle it without requiring every deformatter to be updated and also deals with the issue (that I guess was discussed on IRC...

Are all our files pretty much the same license so that this could be automated? 1. Look up comment char by file suffix 2. Check for shebang 3. Add SPDX...

Does this look good? ```python3 import sys import os xml_comment = '\n' hfst_comment = '! %s\n' cpp_comment = '// %s\n' other_comment = '# %s\n' license = 'SPDX-License-Identifier: GPL-3.0-or-later' lines =...

I knew about -fin and was assuming that whatever this spit out would require manual checking, but would at least be mostly correct for language repos.

The lexc and lexd ones seem awkwardly long to me. The others look good though. https://github.com/mr-martian/apertium-lint/issues/2 might also be relevant if there is some question about the proper way to...