hipBLASLt
hipBLASLt copied to clipboard
[tensilelite] Update NumWarmups and MaxEnqueuesPerSync
- NumWarmups: This factor will be restricted to 30ms if set, and also affect MaxEnqueuesPerSync.
- MaxEnqueuesPerSync: max solution enqueues to perform per synchronization. This factor will be restricted if NumWarmups touches 30ms.
Question: should warmup introduce GPU time like enqueue?
FYI, there is an experimental implementation for GPU time-based warmup mechanism for tensilelite and hipblaslt-bench.