mlc-llm
mlc-llm copied to clipboard
Is tunning scripts available?
It seems like the tuning is per device, although the m1 tuning is applied when using any GPU. How would I use relax_integration.tune_relax on mod_deploy to create other databases?
We are not using tune_relax because it only supports static shape workloads. Will release a tutorial soon