FastChat
FastChat copied to clipboard
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Hello, I using default target_modules `q_proj, v_proj` the result looks good. Will it more good if I using more target_modules trainable? such as : ``` "q_proj", "v_proj", "down_proj", "gate_proj", "up_proj",...
path: https://github.com/lm-sys/FastChat/tree/main/fastchat/eval issue: missing field "metadata" example: { "reviewer_id":"gpt-4-0328-default", "prompt_id":1, "metadata":{ "temperature":0.2, "max_tokens":8192 }, "description":"GPT-4 for general questions", "category":"general" }
Hi, I'm training a FlanT5 network. The training completes successfully, but when I try to run a simple inference, I have a tensor of zeros, so the prediction is null....
Hi team, I use the command below without changing the delta-path and it was automatically downloaded. It still takes effective when I rerun the command but I just wonder where...
When using parallel computing with multiple GPUs, if I set "--num-gpus 4," only three GPUs are actually activated. How can I solve this issue?
Currently the weights saved are all models weights include optimzier, is HUGE, how to set save lora weights only? 