mlc-llm
mlc-llm copied to clipboard
Universal LLM Deployment Engine with ML Compilation
hello ,does there any speed throughout benchmark comparing with llama.cpp?
It seems there is no official support for interacting with MLC LLM using python api yet. For the convenience of anyone who wants to develop a program that integrates MLC...
Hi, do you happen to know how the optimization performance for this TVM based solution might compare to https://github.com/hidet-org/hidet for NVIDIA edge devices (like NVIDIA Jetson Xavier)? I'm curious to...
Does MLC support the non pro iPhone 13, or do I need to buy more RAM?
Hey, nice launch! Since LLama and its variant vicuna has commercial restriction and huggingface released openllama https://huggingface.co/openlm-research/open_llama_7b_preview_200bt. Can you support it?

trying to build ios app from the source and everything is ok except for running the app on the iPhone, the app shows ready to chat but after sending the...
Fixed a small typo. The two previous commits were just updating my forked branch.
I am new to TVM, and I encountered an error while compiling it according to the [instructions](https://github.com/mlc-ai/mlc-llm/blob/main/ios/README.md). I cannot install it successfully. It seems that TVM or something else cannot...
This actually do not provide a loadable in huggingface repo, how does the tokenizer can load? ``` OSError: ./dist/models/vicuna-v1-7b does not appear to have a file named config.json. Checkout 'https://huggingface.co/./dist/models/vicuna-v1-7b/None'...