ontology-tracker icon indicating copy to clipboard operation
ontology-tracker copied to clipboard

page rank

Open VladimirAlexiev opened this issue 7 years ago • 10 comments

(Split from #19)

@chile12 said: I'd suggest a new property like: dbo:pageRankValue = http://purl.org/voc/vrank#hasRank ° http://purl.org/voc/vrank#rankValue (http://www.aifb.kit.edu/images/4/4d/Dbrank2012.pdf) This will enable us to create page-rank datasets in NT format with just one triple per resource. Atm it looks like this:

http://dbpedia.org/resource/!!!_(Chk_Chk_Chk) 
  http://purl.org/voc/vrank#hasRank 
    [http://purl.org/voc/vrank#rankValue 
      "0.15"^^http://www.w3.org/2001/XMLSchema#float] .

Vladimir: I'd say simplify the name: dbo:pageRank (range xsd:double).

And how about computing RDF Rank? http://graphdb.ontotext.com/documentation/standard/rdf-rank.html. We've found such measure quite useful for displaying "top" entities satisfying some condition. Eg try this factforge query

VladimirAlexiev avatar Sep 07 '17 07:09 VladimirAlexiev

That is the idea. Until now we were provided with the page-rank dataset. Starting with this release we will compute it on our own.

chile12 avatar Oct 02 '17 09:10 chile12

@chile12 To clarify: will your rank be based on RDF resource links, and not only page links?

VladimirAlexiev avatar Oct 04 '17 08:10 VladimirAlexiev

i'm not entirely sure what's planned either... both would be interesting, but computing page-rank on (mostly) human generated links (e.g., dbo:wikiPageWikiLinks) is probably more meaningful

joernhees avatar Oct 04 '17 09:10 joernhees

We will probably go with: https://github.com/athalhammer/danker by Andreas Thalhammer. Which would need minimal involvement from our side. Please have a look and share your thoughts.

chile12 avatar Oct 04 '17 09:10 chile12

yepp, i think that works on pagelinks only... https://github.com/athalhammer/danker/blob/master/lib/createLinks.sh downloads the files, resolves redirects etc... probably you have all necessary information already during release and could just re-use them... all sorts can probably be sped up a lot with -S 2G if you can spare 2 GB of RAM

joernhees avatar Oct 04 '17 11:10 joernhees

Yes, experience shows that the ranks of ontological/organizational concepts (such as male, English, etc.) are skyrocketing in pure RDF settings. This is often not useful for applications such as autocomplete and the like. I'm currently experimenting with mixed settings (see experimental branch of danker) which provide more coverage.

@chile12 I think we can extend the vrank vocabulary accordingly. Let me check whether it is possible to use the old namespace for that.

@joernhees pull requests welcome ;)

Andreas

athalhammer avatar Oct 16 '17 05:10 athalhammer

I made some suggestions on how the new property could look like, I would be happy about any feedback on https://github.com/6020peaks/vRank/issues/1

athalhammer avatar Oct 19 '17 17:10 athalhammer

I opened a pull request on vrank for a new release planned for 31.10.2017:

https://github.com/6020peaks/vRank/pull/2

I'm not entirely sure about the property chain axiom and the implications, so reviews are welcome.

athalhammer avatar Oct 25 '17 20:10 athalhammer

Dear all,

We just released a new version of vRank (including vrank:pagerank):

https://github.com/6020peaks/vRank/releases/tag/v2.0

Happy Halloween!

athalhammer avatar Oct 31 '17 19:10 athalhammer

Hey, is this issue still open? Are there any improvements needed in the existing implementation?

reeshabhranjan avatar Mar 05 '20 10:03 reeshabhranjan