mlc-llm icon indicating copy to clipboard operation
mlc-llm copied to clipboard

[Model Request] OpenChat 3.5 -

Open alpayariyak opened this issue 1 year ago • 4 comments
trafficstars

⚙️ Request New Models

  • Link to an existing implementation (e.g. Hugging Face/Github): https://huggingface.co/openchat/openchat_3.5
  • Is this model architecture supported by MLC-LLM? (the list of supported models) Yes

Additional context

Current best 7b OSS model, beating March GPT-3.5 on most benchmarks

alpayariyak avatar Dec 09 '23 17:12 alpayariyak

I've converted it, and it works well

python3 -m mlc_llm.build --hf-path  openchat/openchat_3.5 --target metal_x86_64 --quantization q4f16_1

I have only 1 issue the model doesn't stop streaming sometimes (like stop word doesn't work), looking for a solution.

MikeLP avatar Dec 11 '23 06:12 MikeLP

Have you uploaded it somewhere on HuggingFace? Seems like an eos token issue, might be fixable with a change to the config file

alpayariyak avatar Dec 11 '23 21:12 alpayariyak

@alpayariyak Uploaded to HF

OpenChat 3.5

https://huggingface.co/iyanello/openchat_3.5-q4f16_1-mlc-llm

OpenChat 3.5 - 1210 update

https://huggingface.co/iyanello/openchat_3.5-1210-q4f16_1-mlc-llm/tree/main

MikeLP avatar Dec 12 '23 21:12 MikeLP

https://github.com/mlc-ai/mlc-llm/issues/1776#issuecomment-1992550167

MikeLP avatar Mar 12 '24 20:03 MikeLP