kernel_tuner icon indicating copy to clipboard operation
kernel_tuner copied to clipboard

Verification of TunablePrecision parameters only work if using the AccuracyObserver

Open isazi opened this issue 11 months ago • 0 comments

Hi @stijnh I assigned this to you to check if this is a bug or a feature :)

In practice if you only want to use the TunablePrecision types to test the performance of various types of floats, and are not interested in measuring the loss of accuracy with the AccuracyObserver the verification does not work anymore.

Simple test, using the accuracy.py example in examples/cuda, just remove the observers and you get this error:

TypeError: Element 3 of the expected results list is not of the same dtype as the kernel output: float64 != float32.

If the observers are passed to tune_kernel, then everything works.

isazi avatar Mar 01 '24 09:03 isazi