Gaffer icon indicating copy to clipboard operation
Gaffer copied to clipboard

Performance testing of Gaffer

Open james257 opened this issue 8 years ago • 4 comments

It would be really helpful to generate synthetic data for performance testing of Gaffer at a reasonable scale on a Gaffer/Accumulo cluster.

james257 avatar Oct 20 '16 08:10 james257

I've started looking at this issue from the perspective of the socialsensor Graph database comparison benchmarks.

I've started integrating the Gaffer Accumulo store into their benchmarking project so we can see a comparison against several other graph DB technologies (including Neo4J and Titan).

They use a variety of real and synthetic datasets and a number of graph-oriented benchmarks, most of which are easy to support, but some may require a bit more thought or some new analytical functionality.

Once I've made some progress I'll try to make this available in the gaffer-experimental project.

t616178 avatar Dec 08 '16 22:12 t616178

Steps

  1. Write install scripts for yarn/accumulo/gaffer
  2. Measure performance of adding data from hdfs (how many elements added per second?)
  3. Write install scripts for Gaffer REST API
  4. Measure performance of element retrieval from REST API
  5. Measure performance of streaming data into Gaffer
  6. Optimise

p013570 avatar Jan 11 '17 11:01 p013570

@gaffer01 it would be great to get some statistics included before we do version 1.0.0

p013570 avatar Aug 10 '17 12:08 p013570

See https://github.com/gchq/gaffer-tools/tree/develop/random-element-generation and https://github.com/gchq/gaffer-tools/tree/develop/performance-testing which may-or-may-not work or be relevant.

n3101 avatar Nov 17 '21 14:11 n3101

Duplicates #3027

GCHQDeveloper314 avatar Mar 13 '24 09:03 GCHQDeveloper314