incubator-graphar icon indicating copy to clipboard operation
incubator-graphar copied to clipboard

[Feat] Benchmarking with some common data format used in graph

Open acezen opened this issue 11 months ago • 2 comments

Is your feature request related to a problem? Please describe. Add some benchmark report about comparison with some common format used in graph like CSV, Parquet, ORC and json

Describe the solution you'd like benchmark

  • Size
  • Read / Write with C++ SDK
  • Get basic stats(min, max, vertex num, edge num)
  • Construct CSR/CSC structure
  • Neighborhood retrieval

datasets (to keep align to production environment)

  • twitter2010
  • com-friendster
  • ldbc snb

Describe alternatives you've considered A clear and concise description of any alternative solutions or features you've considered.

Additional context Add any other context or screenshots about the feature request here.

acezen avatar Mar 05 '24 07:03 acezen

I would also like to add something about neighborhood retrieval. For example, stats (min, max, etx.) for an ego-network.

SemyonSinchenko avatar Mar 05 '24 07:03 SemyonSinchenko

I would also like to add something about neighborhood retrieval. For example, stats (min, max, etx.) for an ego-network.

neighborhood retrieval is indeed a common operation in graph analysis, surely need to be add to benchmark. Thanks Sem.

acezen avatar Mar 05 '24 07:03 acezen