nebuly icon indicating copy to clipboard operation
nebuly copied to clipboard

Benchmarks

Open ogencoglu opened this issue 3 years ago • 1 comments

Thanks for this work. It would make sense:

  • to explain if a speed-accuracy trade-off exists and if so, how much
  • add benchmarks of your optimization for various applications and architectures.

ogencoglu avatar Feb 22 '22 14:02 ogencoglu

Thanks ogencoglu. We are working right these days on building use cases to show both how to use the library in a more granular way and to provide benchmarks on SOTA model acceleration on various hardware devices (mainly CPUs and GPUs). We will try to provide some benchmarks starting from the next release!

And as for the speed-accuracy tradeoff, we actually want to avoid any drop in accuracy. Exactly for this reason we are not using methods like pruning and quantization. It may happen that some of the supported AI compilers use some accuracy reduction strategies. However, in our tests so far we have not detected any significant drop in performance

diegofiori avatar Feb 22 '22 15:02 diegofiori