FastChat
FastChat copied to clipboard
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
## Why are these changes needed? Add multi-lora support for vllm_worker, this feature has been supported in vllm v0.3.2. This PR enables this capability in vllm_worker. 1. Add a new...
I attempted a workaround, but the output from finetuning doesn't look quite right. Has anyone made a working fix for this issue?
Can support the new model about Llama3
The fix for the Chinese-Alpaca-2-7B model matches the wrong AlpacaAdapter instead of the correct Llama2ChineseAlpacaAdapter. ## Why are these changes needed?  ``` def match(self, model_path: str): - return "alpaca"...
The [Phi-3 Technical Report](https://arxiv.org/abs/2404.14219) was just published by the Microsoft team, in which they introduce a model family of 3 state-of-the-art models: - phi-3-mini (3.3B) - phi-3-small (7B) - phi-3-medium...
Hello, I am reaching out for clarification on the continuous batching mechanism within the VLLM worker, as described in the [VLLM Integration README](https://github.com/lm-sys/FastChat/blob/main/docs/vllm_integration.md). The documentation states that the system "offers...
why not have log in conversion history? hope add log info
Fix argument missing bug. TypeError: set_global_vars() missing 1 required positional argument: 'use_remote_storage_' ## Why are these changes needed? `python3 -m fastchat.serve.gradio_web_server` launch failed. info """ 2024-04-23 07:32:13 | ERROR |...
## Why are these changes needed? The previous format for ChatGLM3 is not correct, which will yield wrong output:  After changing:  ## Related issue number (if applicable) None...
How deepseekcoder6.7b supports streaming output results? 