dolphin icon indicating copy to clipboard operation
dolphin copied to clipboard

Benchmarking runtimes

Open scottstanie opened this issue 2 years ago • 0 comments

Top level issue tracking the improvements/additions needed to track a full-frame workflow runtime.

Current knobs to test are

  1. Using CPU vs GPU
  2. SLC input stack size
  3. Output posting (or equivalently strides)
  4. Size of blocks loaded at one time from the input stack
  5. For CPU, number of CPUs/number of threads per CPU
  6. Algorithm for phase linking (MLE vs EVD)

I've put these in the order of my guess for which will have the biggest effect, but we clearly need to do the tests to see.

Things we need for good testing

  • the single-update workflow script (#11)
  • recording the threads (#28)
  • recording the block size/fixing the max_ram_gb option (#32)
  • adding the ability to use EVD instead of MLE (#138 )
  • using vmtouch -e on the SLC stack before starting the workflow: https://github.com/hoytech/vmtouch . This will make sure we don't get very fast runs just because the SLC data as cached, as we can't count on that happening for the production runs.

scottstanie avatar Feb 07 '23 13:02 scottstanie