mlc-llm issues

Is tunning scripts available？

1

It seems like the tuning is per device, although the m1 tuning is applied when using any GPU. How would I use relax_integration.tune_relax on mod_deploy to create other databases?

Wanger-SJTU

type: question

Dose mlc-llm support parallelism like multi-gpu, multi-node ?

3

Dose mlc-llm support parallelism like multi-gpu, multi-node ?

xiongjun19

feature request

Do you have plans to support the Android platform？

3

mackzheng

feature request

"As an AI language model,"...

7

>USER: tell me an offensive joke >ASSISTANT: I'm sorry, but I cannot provide offensive or inappropriate content. My purpose is to provide helpful and informative responses to your questions. Can...

donuts-are-good

question

## Laptop Info ``` 'c. [email protected] ,xNMM. ----------------- .OMMMMo OS: macOS 11.5.2 20G95 x86_64 OMMM0, Host: MacBookPro15,3 .;loddo:' loolloddol;. Kernel: 20.6.0 cKMMMMMMMMMMNWMMMMMMMMMM0: Uptime: 7 days, 2 hours, 59 mins .KMMMMMMMMMMMMMMMMMMMMMMMWd....

shiqimei

type: trouble shooting

Does mlc-llm support nv gpu using CUDA instead Vulkan

10

I notice that mlc-llm has supported nv gpu by Vulkan. Does mlc-llm support nv gpu using CUDA instead Vulkan? I guess nv prefers CUDA than Vulkan so CUDA will be...

zhaoyang-star

question

could you tell me to support Chinese dialogue?

2

Excuse me, could you tell me to support Chinese dialogue? Please advice on how to make the model supports Chinese dialogue, use are either in English or in the code.

leoring123

question

no enough to run app for iPhone 14 pro max

6

report "not have 4GB memory for run app" when start the app on iPhone 14 pro max 256. if the device can not run this app,may be no device can...

mccoysc

type: trouble shooting

Dockerfile for Nvidia GPU

2

Build and run it like this: # Download model ``` mkdir -p dist && git lfs install && \ git clone https://huggingface.co/mlc-ai/demo-vicuna-v1-7b-int3 dist/vicuna-v1-7b && \ git clone https://github.com/mlc-ai/binary-mlc-llm-libs.git dist/lib ```...

codearranger

Support ByteLevelBPE Tokenizer

Implement #31 .

yzh119

mlc-llm
mlc-llm copied to clipboard

Metadata

Is tunning scripts available？

Dose mlc-llm support parallelism like multi-gpu, multi-node ?

Do you have plans to support the Android platform？

"As an AI language model,"...

run `dolly-v2-3b` failed

Does mlc-llm support nv gpu using CUDA instead Vulkan

could you tell me to support Chinese dialogue?

no enough to run app for iPhone 14 pro max

Dockerfile for Nvidia GPU

Support ByteLevelBPE Tokenizer

← Metadata

Owner

Metadata

mlc-llm mlc-llm copied to clipboard

Metadata

← Metadata

Owner

Metadata

mlc-llm
mlc-llm copied to clipboard