giraph icon indicating copy to clipboard operation
giraph copied to clipboard

EdgeListFormat conversion to JSONLongDoubleFloatDoubleVertexInputFormat

Open skymanaditya1 opened this issue 9 years ago • 2 comments

Giraph jobs e.g (SingleSourceShortestPaths, PageRank) accepts input in JsonLongDoubleFloatDoubleVertexInputFormat. Stanford Analysis Network Platform (SNAP) contains a huge variety of network graphs that Giraph jobs can be employed upon. The big disadvantage is the Input is in Simple Edge List Format. Since Giraph does not inherently supports a simple edge list format. This java class converts the SimpleEdgeList file to JSON file format accepted by giraph

skymanaditya1 avatar Jul 31 '15 21:07 skymanaditya1

Probably useful but pretty badly implemented. Try running on a large-ish dataset (https://snap.stanford.edu/data/cit-Patents.html)

PayasR avatar Dec 05 '16 11:12 PayasR

Hahaha yeah, this was when I was in my second year. You should check out my python implementation. Thanks for pointing out @PayasR , I will update it.

skymanaditya1 avatar Dec 05 '16 16:12 skymanaditya1