FastChat icon indicating copy to clipboard operation
FastChat copied to clipboard

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Results 741 FastChat issues
Sort by recently updated
recently updated
newest added

I would like to know how fast, I could locate and setting to see token/s any change this is an option or change info the logger settings ? Awesome job...

Hi there! I have verified flan-t5 training goes smoothly. Could you review the training file? I am rebasing inference file and will update later. ![image](https://user-images.githubusercontent.com/28548224/232235754-0183efe7-31d5-4f1f-808e-8a2a3c22dea2.png) A sample answer by the...

I use following command to train with my data, model_name_or_path is vicuna-7b-1.1 not llama 7B, Is there any problem with training like this? how to load save step checkpoint-200? Do...

Does this work on AMD cards? What are the GPU requirements for inference?

enhancement

Loading checkpoint shards: 100%|███████████████████████████████████████████████████████████████████| 41/41 [00:28

i launch the model worker 7B and 13B, it works much worse in many question, how about your's?

Hi, I used the training script in the readme, and didn't change the data and parameters, but my gpu memory still run out. Have you test it on 4xA100 40Gb?...

This is a draft PR that serves more informational purpose for anyone wanting to interact with the model in an OpenAI-API-ish fashion. I will not implement additional features upon request...

Hi: Can this model be trained with Deepspeed support? If yes, could anyone provide a workable Deepspeed config file? Thanks. BTW, I have tried to use a simple config setting...