soweego
soweego copied to clipboard
Investigate Wikidata graph embeddings
A bleeding-edge research effort has produced Wikidata graph embeddings that could be useful to improve the performance of our probabilistic linkers.
References:
- Facebook research post: https://ai.facebook.com/blog/open-sourcing-pytorch-biggraph-for-faster-embeddings-of-extremely-large-graphs;
- Pytorch-BigGraph: https://github.com/facebookresearch/PyTorch-BigGraph;
- pre-trained Wikidata embeddings: https://dl.fbaipublicfiles.com/torchbiggraph/wikidata_translation_v1.tsv.gz;
- SocialLink with graph embeddings: https://rdcu.be/58fm
On the target catalog side, we could leverage simple relations between entities to build comparable graph embeddings, e.g., the knownForTitles and primaryProfession in IMDb.