mlc-llm icon indicating copy to clipboard operation
mlc-llm copied to clipboard

Universal LLM Deployment Engine with ML Compilation

Results 578 mlc-llm issues
Sort by recently updated
recently updated
newest added

python3 build.py --hf-path databricks/dolly-v2-3b --quantization q3f16_0 --max-seq-len 768 Weights exist at dist/models/dolly-v2-3b, skipping download. Using model path dist/models/dolly-v2-3b Automatically configuring target: cuda -keys=cuda,gpu -arch=sm_80 -max_num_threads=1024 -thread_warp_size=32 Segmentation fault (core dumped)...

documentation

mlc_ chat_ Cli is running on my WSL, but it seems that I didn't use GPU, but instead used CPU entirely, which is very slow. How can I use GPU...

documentation

I tested the 'mlc_chat_cli' command in a Linux environment, and I want to deploy it as an API that can call services. Is there a convenient way to deploy it,...

feature request

build code: `python build.py --model vicuna-v1-7b --quantization q4f16_0 --target android --max-seq-len 768` error: ``` Using model path dist/models/vicuna-v1-7b Load cached module from dist/vicuna-v1-7b-q4f16_0/mod_cache_before_build_android.pkl and skip tracing. You can use --use-cache=0...

documentation

Cannot find vicuna-v1-7b lib in preferred path "dist/vicuna-v1-7b/float16/vicuna-v1-7b_metal_float16.dylib" or other candidate paths%

documentation

when I run `python3 build.py --model vicuna-v1-7b` , I get a error following: ![4eb38723413d8c2fd6c092530b46d6e](https://github.com/mlc-ai/mlc-llm/assets/51256576/87852775-e50d-49a4-a2aa-c191c617dd89) Does anyone resolve it ?

documentation

Hello. I follow your building instructions from README.MD and it is not reproducible. It would be great if the team can fix the building instructions. One idea: create a new...

documentation

![image](https://github.com/mlc-ai/mlc-llm/assets/48009681/6546b2d3-85a9-4ee0-95cd-0aff5c76343d) I'm using conda enviroment, with python=3.10, and I did: `pip install apache-tvm pytest` why that happend? did I installed tvm the wrong way?

documentation

is there a hakcy way to monitor current system metrics on iPadOS?

enhancement

![image](https://github.com/mlc-ai/mlc-llm/assets/5838111/2b8df4c1-3ded-475f-9cf9-e848dc844029) 下面是vulkaninfo --summary的结果 ![image](https://github.com/mlc-ai/mlc-llm/assets/5838111/5763fbfe-4809-4506-b055-63b76dc11477)

bug