incubator-graphar
incubator-graphar copied to clipboard
[Feat] Benchmarking with some common data format used in graph
Is your feature request related to a problem? Please describe. Add some benchmark report about comparison with some common format used in graph like CSV, Parquet, ORC and json
Describe the solution you'd like benchmark
- Size
- Read / Write with C++ SDK
- Get basic stats(min, max, vertex num, edge num)
- Construct CSR/CSC structure
- Neighborhood retrieval
datasets (to keep align to production environment)
- twitter2010
- com-friendster
- ldbc snb
Describe alternatives you've considered A clear and concise description of any alternative solutions or features you've considered.
Additional context Add any other context or screenshots about the feature request here.
I would also like to add something about neighborhood retrieval. For example, stats (min, max, etx.) for an ego-network.
I would also like to add something about neighborhood retrieval. For example, stats (min, max, etx.) for an ego-network.
neighborhood retrieval is indeed a common operation in graph analysis, surely need to be add to benchmark. Thanks Sem.