auto-tuning topic
CLTune
CLTune: An automatic OpenCL & CUDA kernel tuner
kernel-ml
Machine Learning Framework for Operating Systems - Brings ML to Linux kernel
awesome-machine-learning-in-compilers
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
Tensile
Stretching GPU performance for GEMMs and tensor contractions.
neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
TLCBench
Benchmark scripts for TVM
ck-crowdtuning
Collective Knowledge crowd-tuning extension to let users crowdsource their experiments (using portable Collective Knowledge workflows) such as performance benchmarking, auto tuning and machine learnin...
uptune
A Generic Distributed Auto-Tuning Infrastructure