Osma Suominen
Osma Suominen
The analysis by @mj seems correct - if you have a separate `run` step, then it will use the PHP from the system. The solution is not to use a...
While I can't speak for the maintainers, I've previously implemented support for CONSTRUCT, DESCRIBE and ASK queries in the `hdtsparql` command line tool (see PR #27). Your best bet may...
I've also seen the converter create `@ru-cyrl` language tags where the `-cyrl` is redundant and forbidden by BCP 47. I've chosen to ignore them for now.
+1 for avoiding duplicate assertions. However, depending on the amount of indirection involved, this could happen naturally - if the assertions was simply e.g. ` bf:mainLanguage "eng"` (see my proposal...
Some comments, from my perspective obviously. > Should the title property of the Work come from the 130/240 or the 245? In a translation, there are (at least) two Works...
BTW what's the relationship of this issue to #25?
This is great news! @nikopartanen and @mariguttorm are currently testing the Northern Sámi lemmatization on real world example texts.
Thanks again from my part as well. I will close the issue.
> It is not surprising that CLD3 is faster than Lingua. CLD3 has been implemented in C++ whereas my library is pure Python only (with exception of the internally used...
Thanks @pemistahl for the update, that is great news! I will try to do a new round of experiments soon, comparing language filtering with either pycld3, Lingua or the recently...