Maarten van Gompel

Results 365 comments of Maarten van Gompel

Partially addressed for first release, may need further work

This may be narrower than your domain, but for research software in the (digital) humanities, there is https://vocabs.dariah.eu/tadirah/en/ . In our project we use it with codemeta.

This looks very useful and works in `exec`. However, I notice that this variable is not set in `on-click` etc, it would help if it's also available there.

> One can access the last token with some basic scripting from the tokenizer, but the token's hyphenation information is not (that trivially) accessible. Or ist it? I think a...

Not at bad idea at all, the information is availabe inside ucto after all (and propagated to the FoliA output), so we could do something similar for the python binding.

Those lists are part of the [uctodata](https://github.com/LanguageMachines/uctodata) repository, and referred to from the individual configuration files (like ``tokconfig-nld``). Contributions there are welcome of course! (you can just send a pull...

Simply adding them to ``deu.abr`` should work yes, but those changes may be overwritten on LaMachine update again. Alternatively you could make your own ucto configuration (copy ``tokconfig-deu``) and refer...

This is already built-in functionality. You can just request FoLiA output from python-ucto using `foliaoutput=True`, see the example in the README.

Ah ok, you're feeding parts to the tokenizer on the fly, that probably doesn't combine well with `foliaoutput=True` indeed, as that produces entire documents for the input. You're on the...