mlc-llm issues

which commit of relax should be used ?

10

python3 build.py --hf-path databricks/dolly-v2-3b --quantization q3f16_0 --max-seq-len 768 Weights exist at dist/models/dolly-v2-3b, skipping download. Using model path dist/models/dolly-v2-3b Automatically configuring target: cuda -keys=cuda,gpu -arch=sm_80 -max_num_threads=1024 -thread_warp_size=32 Segmentation fault (core dumped)...

ljliu

documentation

mlc_chat_cli doesn't using GPU

1

mlc_ chat_ Cli is running on my WSL, but it seems that I didn't use GPU, but instead used CPU entirely, which is very slow. How can I use GPU...

NeoWang9999

documentation

What is the best way to deploy the 'mlc_chat' service as an API that accepts HTTP requests?

2

I tested the 'mlc_chat_cli' command in a Linux environment, and I want to deploy it as an API that can call services. Is there a convenient way to deploy it,...

NeoWang9999

feature request

build vicuna-v1-7b tvm error

1

build code: `python build.py --model vicuna-v1-7b --quantization q4f16_0 --target android --max-seq-len 768` error: ``` Using model path dist/models/vicuna-v1-7b Load cached module from dist/vicuna-v1-7b-q4f16_0/mod_cache_before_build_android.pkl and skip tracing. You can use --use-cache=0...

cc11235

documentation

Cannot find vicuna-v1-7b lib in preferred path "dist/vicuna-v1-7b/float16/vicuna-v1-7b_metal_float16.dylib" or other candidate paths%

drppsy

documentation

build vicuna-v1-7b error

when I run `python3 build.py --model vicuna-v1-7b` , I get a error following: ![4eb38723413d8c2fd6c092530b46d6e](https://github.com/mlc-ai/mlc-llm/assets/51256576/87852775-e50d-49a4-a2aa-c191c617dd89) Does anyone resolve it ?

sgSillage

documentation

Building from source not reproducible

Hello. I follow your building instructions from README.MD and it is not reproducible. It would be great if the team can fix the building instructions. One idea: create a new...

leiniercs

documentation

ImportError: cannot import name 'relax' from 'tvm'

2

![image](https://github.com/mlc-ai/mlc-llm/assets/48009681/6546b2d3-85a9-4ee0-95cd-0aff5c76343d) I'm using conda enviroment, with python=3.10, and I did: `pip install apache-tvm pytest` why that happend? did I installed tvm the wrong way?

NeoWang9999

documentation

Q: way to monitor RAM usage on ipad

is there a hakcy way to monitor current system metrics on iPadOS?

su77ungr

enhancement

An error is reported after executing mlc_chat_cli

1

![image](https://github.com/mlc-ai/mlc-llm/assets/5838111/2b8df4c1-3ded-475f-9cf9-e848dc844029) 下面是vulkaninfo --summary的结果 ![image](https://github.com/mlc-ai/mlc-llm/assets/5838111/5763fbfe-4809-4506-b055-63b76dc11477)

pangr

bug

mlc-llm
mlc-llm copied to clipboard

Metadata

which commit of relax should be used ?

mlc_chat_cli doesn't using GPU

What is the best way to deploy the 'mlc_chat' service as an API that accepts HTTP requests?

build vicuna-v1-7b tvm error

Cannot find vicuna-v1-7b lib in preferred path "dist/vicuna-v1-7b/float16/vicuna-v1-7b_metal_float16.dylib" or other candidate paths%

build vicuna-v1-7b error

Building from source not reproducible

ImportError: cannot import name 'relax' from 'tvm'

Q: way to monitor RAM usage on ipad

An error is reported after executing mlc_chat_cli

← Metadata

Owner

Metadata

mlc-llm mlc-llm copied to clipboard

Metadata

← Metadata

Owner

Metadata

mlc-llm
mlc-llm copied to clipboard