gdrcopy icon indicating copy to clipboard operation
gdrcopy copied to clipboard

add autotuning support

Open drossetti opened this issue 7 years ago • 5 comments

optimized memcpy implementations should be chosen at run-time during a tuning phase, possibly in gdr_open()

drossetti avatar Dec 06 '17 22:12 drossetti

Curious: what dimensions are you going to be tuning over here in the autotuner?

maddyscientist avatar Dec 06 '17 23:12 maddyscientist

@maddyscientist that is a good question. I am not expecting a dependency on the buffer size, but I might be wrong.

drossetti avatar Jun 20 '18 00:06 drossetti

@drossetti BTW, is there any calculation formula, otherwise that would depend on experimental values on kinds of HW configuration

hongbilu avatar Aug 21 '23 09:08 hongbilu

@hongbilu any performance model would be HW dependent inherently, so it would involve maintaining a database of FOMs for each platform. That is why I was proposing a run-time autotuning phase instead.

drossetti avatar Aug 21 '23 19:08 drossetti

@drossetti that would be a big work and cpu's work frequency or workload also need to be considered in theory. Experiments show that cpu's work frequency is a key influence factor

hongbilu avatar Aug 22 '23 01:08 hongbilu