mlc-llm issues

[Model Request] nemotron-nas

raise ValueError(f"Unknown model type: {model_type}. Available ones: {list(MODELS.keys())}") ValueError: Unknown model type: nemotron-nas. Available ones: ['llama', 'mistral', 'gemma', 'gemma2', 'gemma3', 'gemma3_text', 'gpt2', 'mixtral', 'gpt_neox', 'gpt_bigcode', 'phi-msft', 'phi', 'phi3', 'phi3_v', 'qwen',...

iEddie-cmd

new-models

[Bug] Trouble to run `mlc_llm chat` with Gemma 3 models

3

## 🐛 Bug I am having troubles when I run mlc-llm with Gemma 3 models on M3 Pro Macbook (details are below). The error is same as follows. ``` libc++abi:...

grf53

bug

[Bug] CMake Error at 3rdparty/tokenizers-cpp/msgpack/CMakeLists.txt during CMake iOS

2

## 🐛 Bug Followed the guidelines at [https://llm.mlc.ai/docs/deploy/ios.html](url) Getting error when running ``` cd mlc_llm/ios/MLCChat mlc_llm package ``` Verified that all prerequisites are correctly installed 1. Installed CMake (cmake 4.0.1)...

KingSlayer06

bug

[Question] Should we expect generation quality comparable to gguf with 4-bit quantization?

## ❓ General Questions While the inference speed is 2-3 times faster than llama.cpp, I observe some metrics degradation. For example, I have a simple test to do some punctuation/capitalization/correction...

bene-ges

question

[Question]

## ❓ General Questions How to crosscompile the mlc llm android with adb shell to run it on my android phone, I already finish the process of mlc-llm package, E:\mlc_llm\mlc-llm\3rdparty\xgrammar\cpp\tokenizer.cc(217,33):...

haoxuanWeng

question

[Android] Support LLaVA and Phi-V

2

This PR introduces the support of LLaVA and Phi-V on android device. Not thoroughly tested, but it works on my device. (Android 14.0) checkpoints: - https://huggingface.co/davidlightmysterion/llava-1.5-7b-hf-q4f16_1-MLC - https://huggingface.co/mlc-ai/Phi-3.5-vision-instruct-q4f16_1-MLC ![Screenshot_20250331_114427](https://github.com/user-attachments/assets/13e801de-0d11-4adb-8141-11d627776f77)

davidlightmysterion

[Bug] Binary was created using {relax.Executable} but a loader of that name is not registered. Available loaders are relax.VMExecutable, const_loader, static_library, metal. Perhaps you need to recompile with this runtime enabled.

7

## 🐛 Bug After following the steps for installation, running MLCChat, clicking on a model (in this case, i chose the model that came with it "Llama-3.2-3B-Instruct-q4f16_1-MLC") I get this...

Japp-Galang

bug

[Model Request] BitNet b1.58 2B4T - Scaling Native 1-bit LLM

## ⚙️ Request New Models - Link to an existing implementation (e.g. Hugging Face/Github): https://huggingface.co/microsoft/bitnet-b1.58-2B-4T - Is this model architecture supported by MLC-LLM? No It has just come out. May...

pikann

new-models

[Bug] Missing post layernorm in CLIP model

## 🐛 Bug ![Image](https://github.com/user-attachments/assets/a1762c05-5b1e-479c-8e60-23114a15eb44) I found clip model missing post_layernorm in forward, I tried to add post_layernorm, but I found there are some problem. ## To Reproduce Steps to reproduce...

vincentccc

bug

[Bug] gemma3 WebGPU <unnamed> panicked

2

## 🐛 Bug I've seen that in the last days support for gemma3 was added. Which is great! However, it does not seem to work with the webgpu target. I...

nico-martin

bug

mlc-llm
mlc-llm copied to clipboard

Metadata

[Model Request] nemotron-nas

[Bug] Trouble to run `mlc_llm chat` with Gemma 3 models

[Bug] CMake Error at 3rdparty/tokenizers-cpp/msgpack/CMakeLists.txt during CMake iOS

[Question] Should we expect generation quality comparable to gguf with 4-bit quantization?

[Question]

[Android] Support LLaVA and Phi-V

[Bug] Binary was created using {relax.Executable} but a loader of that name is not registered. Available loaders are relax.VMExecutable, const_loader, static_library, metal. Perhaps you need to recompile with this runtime enabled.

[Model Request] BitNet b1.58 2B4T - Scaling Native 1-bit LLM

[Bug] Missing post layernorm in CLIP model

[Bug] gemma3 WebGPU <unnamed> panicked

← Metadata

Owner

Metadata

mlc-llm mlc-llm copied to clipboard

Metadata

← Metadata

Owner

Metadata

mlc-llm
mlc-llm copied to clipboard