FastChat
FastChat copied to clipboard
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
I would like to know how fast, I could locate and setting to see token/s any change this is an option or change info the logger settings ? Awesome job...
Hi there! I have verified flan-t5 training goes smoothly. Could you review the training file? I am rebasing inference file and will update later.  A sample answer by the...
I use following command to train with my data, model_name_or_path is vicuna-7b-1.1 not llama 7B, Is there any problem with training like this? how to load save step checkpoint-200? Do...
Loading checkpoint shards: 100%|███████████████████████████████████████████████████████████████████| 41/41 [00:28
i launch the model worker 7B and 13B, it works much worse in many question, how about your's?
Hi, I used the training script in the readme, and didn't change the data and parameters, but my gpu memory still run out. Have you test it on 4xA100 40Gb?...
This is a draft PR that serves more informational purpose for anyone wanting to interact with the model in an OpenAI-API-ish fashion. I will not implement additional features upon request...
Hi: Can this model be trained with Deepspeed support? If yes, could anyone provide a workable Deepspeed config file? Thanks. BTW, I have tried to use a simple config setting...