Patrice Lopez

Results 390 comments of Patrice Lopez

Also consider crosswiki resource for setting priors, although this is very noisy.

see also #72 when morphosyntactic variant is identify as wikidata label, but not in the wikipedia vocabulary.

For some language and some more intermediary Wikipedia in term of size, like Arabic, it seems that the existing "redirects" clearly do not provide enough morphosyntactic variants.

Hello ! I can't reproduce this error. It might be limited to a previous version or loading issues. ![Screenshot from 2022-05-03 23-18-37](https://user-images.githubusercontent.com/2340795/166568094-eb6e4eb8-9912-4fe3-bf44-e29879e5be05.png)

`prob_i` requires another lmbd, wikidata properties have not been experimented yet.

Hello! Another issue with Finnish is the super rich morphology of the language, we would probably need a dedicated morphological analyzer to get enough mentions and candidates prior to disambiguation....

There are solutions for sure, my point was rather that for supporting Finnish it won't be just importing the fiwiki data and retraining. It has an impact on the current...

Thank you Olivier, yes you're right, it's a much better idea to use Wikidata English label as value name. This is more consistent but afaik there is no guarantee to...

It could be in entity-fishing. You may have seen that the labels are not loaded right now, but the statements are loaded from the Wikidata json dump file directly in...

Fix with #142 without hacky way, we create an additional lmdb for the wikidata labels of the supported languages. concept look-up for Q492038 -> ``` Statements: --- subclass of |...