Zihao Ye comments

Results 332 comments of


                                            Zihao Ye

Add support for downloading weights from HF path

I have already mentioned, all models under mlc-ai (including the mlc-ai/demo-vicuna-v1-7b-int4 you used) are already compiled by MLC-LLM, and you should find some pre-compiled models in raw huggingface format (like...

Will this work on an iPhone 14 pro?

> Tagging along here, i tried with my iPhone 13 with testflight, it also crashes after showing the system initializing message. iPhone 13 does not have enough RAM size (6GB...

Refactor the design of conversation.py

Another future working item is to transform the conversion template into a YAML/JSON file we can load from disk, rather than hardcoded in C++ file.

fix supported_models import error

Duplicate of #357

end-to-end GNN training and inference

No DGL still uses CuSparse.

[43_6] `cwith` tags in TeXmacs blocks are not translated properly when exporting to LaTeX

Sounds good to me.

[WIP] Convenient script for auto tuning

Hi @MarcelDelhez, what do we mean by "tuning" here is tuning the kernel performance (to be faster) instead of fine-tuning weights. Support fine-tuning in MLC-LLM is indeed very important but...

An alternative python interface for MLC LLM

> It seems there is no official support for interacting with MLC LLM using python api yet Actually, there is one: https://github.com/mlc-ai/mlc-llm/blob/main/tests/chat.py

Runing mlc-llm python code on windows fail

Some of the functionalities for dynamic shape have not been upstreamed yet, please use the [relax repo](https://github.com/mlc-ai/relax) for now.

python chat.py can not be run

Hi @lucasjinreal , to build models on your own, you should clone the models' original huggingface repository, for example, for LLaMA, you should: ``` git clone https://huggingface.co/decapoda-research/llama-7b-hf.git dist/models/llama-7b-hf ``` The...