Osma Suominen

Results 374 comments of Osma Suominen

The analysis by @mj seems correct - if you have a separate `run` step, then it will use the PHP from the system. The solution is not to use a...

While I can't speak for the maintainers, I've previously implemented support for CONSTRUCT, DESCRIBE and ASK queries in the `hdtsparql` command line tool (see PR #27). Your best bet may...

I've also seen the converter create `@ru-cyrl` language tags where the `-cyrl` is redundant and forbidden by BCP 47. I've chosen to ignore them for now.

+1 for avoiding duplicate assertions. However, depending on the amount of indirection involved, this could happen naturally - if the assertions was simply e.g. ` bf:mainLanguage "eng"` (see my proposal...

Some comments, from my perspective obviously. > Should the title property of the Work come from the 130/240 or the 245? In a translation, there are (at least) two Works...

BTW what's the relationship of this issue to #25?

This is great news! @nikopartanen and @mariguttorm are currently testing the Northern Sámi lemmatization on real world example texts.

Thanks again from my part as well. I will close the issue.

> It is not surprising that CLD3 is faster than Lingua. CLD3 has been implemented in C++ whereas my library is pure Python only (with exception of the internally used...

Thanks @pemistahl for the update, that is great news! I will try to do a new round of experiments soon, comparing language filtering with either pycld3, Lingua or the recently...