Tianqi Chen
Tianqi Chen
@tlopex can you look a bit into this model?
Thanks @tlopex !
@MikeLP yes, we should keep such customization
https://github.com/mlc-ai/mlc-llm/pull/2279 brings an initial iOS ver of MLCEngine
https://github.com/mlc-ai/mlc-llm/pull/2380 transitions iOS ChatApp to MLCEngine
https://github.com/mlc-ai/mlc-llm/pull/2410 transitions the android to the MLCEngine
We have completed the transition steps
We are already applying auto-tune implicitly in many cases. The current mechanism dlight is already somewhat auto-tuned then coded into the rule, but indeed can be further tweaked, see examples...
Relax is actually a better iteration from relay that address some of the long time compilation issues. The prebuilt are already better space so it is better optimized. You can...