mlc-llm [Model Request] OpenChat 3.5 -

[Model Request] OpenChat 3.5 -

Open alpayariyak opened this issue 1 year ago • 4 comments

trafficstars

Link to an existing implementation (e.g. Hugging Face/Github): https://huggingface.co/openchat/openchat_3.5
Is this model architecture supported by MLC-LLM? (the list of supported models) Yes

Current best 7b OSS model, beating March GPT-3.5 on most benchmarks

Dec 09 '23 17:12 alpayariyak

I've converted it, and it works well

python3 -m mlc_llm.build --hf-path  openchat/openchat_3.5 --target metal_x86_64 --quantization q4f16_1

I have only 1 issue the model doesn't stop streaming sometimes (like stop word doesn't work), looking for a solution.

Dec 11 '23 06:12 MikeLP

Have you uploaded it somewhere on HuggingFace? Seems like an eos token issue, might be fixable with a change to the config file

Dec 11 '23 21:12 alpayariyak

@alpayariyak Uploaded to HF

https://huggingface.co/iyanello/openchat_3.5-q4f16_1-mlc-llm

https://huggingface.co/iyanello/openchat_3.5-1210-q4f16_1-mlc-llm/tree/main

Dec 12 '23 21:12 MikeLP

https://github.com/mlc-ai/mlc-llm/issues/1776#issuecomment-1992550167

Mar 12 '24 20:03 MikeLP