Hao Zhang comments

Results 174 comments of


                                            Hao Zhang

NETWORK ERROR DUE TO HIGH TRAFFIC. PLEASE REGENERATE OR REFRESH THIS PAGE.

@zxzhijia your chatbot is behind GFW?

NETWORK ERROR DUE TO HIGH TRAFFIC. PLEASE REGENERATE OR REFRESH THIS PAGE.

stale issue. Closing.

why the model output text is meaningless?

Please use v1.1 weight meanwhile use the latest version of fastchat and transformers.

The output of the model is weird

Yes, the issue was caused HF's refactoring on the llama tokenizer, which has been fixed by us later. Please make sure to use the latest version of fastchat and vicuna-v1.1...

Support Programmatic Usage of FastChat with Standard Streams

@laidybug why don't you submit a PR and let us take a look?

When will the deployment of Vicuna-30B be released？

Duplicated with #170. Please monitor that thread for updates.

Repository Not Found for url: https://huggingface.co/base/resolve/main/config.json.

@ShoubhikBanerjee Please follow the instructions **step-by-step** to get llama weights, then vicuna weights, and then run apply_delta.

Repository Not Found for url: https://huggingface.co/base/resolve/main/config.json.

refer to this page how to get the llama weights. https://huggingface.co/docs/transformers/main/model_doc/llama

Any idea on how to reproduce vicuna in Nvidia L4?

refer to this reply: https://github.com/lm-sys/FastChat/issues/543#issuecomment-1520909606 it is unlikely you can fine-tune any version of vicuna with only 96gb total VRAM.

Support for GPTQ-LLAMA

For now, I think you can try to get GPTQ vicuna in other ecosystems like GPT4all.