FastChat issues

update queue_lens on generation ends.

When using the SHORTEST_QUEUE method for load balancing, an issue arises because the queue_length isn't updated when a generation task ends. This leads to inaccurate load balancing across worker processes....

lcw99

fix the bugs when some parameters are empty

## Why are these changes needed? Fix the bugs when `sampling_weights` is empty; Fix the bugs when `random_questions` is empty ## Related issue number (if applicable) ## Checks - [x]...

zjasper666

Request to add [HODACHI/EZO-Humanities-9B-gemma-2-it] and [HODACHI/EZO-Common-9B-gemma-2-it] to Chatbot Arena

If it is not appropriate to list it as an Issue, please point it out. To Arena, I would like you to add the following model. Model 9B gemma-2, but...

kazuya-hodatsu-336-1

`httpx.RemoteProtocolError: peer closed connection without sending complete message body (incomplete chunked read)`

## using `dumps_kwargs` keyword arguments are no longer supported. while deploying and testing qwen1.5-7b-chat through fastchat v0.2.36, i got an exception : ``` 2024-08-16 03:11:13 | INFO | stdout |...

DHaru85

removed a duplicate line

## Why are these changes needed? In the `fastchat/train/train.py` file, I found a repeated assignment of the tokenizer in the `LazySupervisedDataset` class. ```python class LazySupervisedDataset(Dataset): """Dataset for supervised fine-tuning.""" def...

gpgg

Add Support for Loading Models in 4-bit Quantized Version (Fixes #1798)

## Why are these changes needed? This pull request adds support for loading models in 4-bit quantized versions. This enhancement addresses the need for more efficient model loading and storage,...

02shanks

ERROR | stderr | ERROR: [Errno 99] error while attempting to bind on address ('::1', 21002, 0, 0): cannot assign requested address

代码： #!/bin/bash # 启动控制器 python -m fastchat.serve.controller --host 0.0.0.0 --port 2002 & # 为模型设置环境变量并启动 export CUDA_VISIBLE_DEVICES=1 python -m fastchat.serve.model_worker --model-path ./bge-large-zh-v1.5 --model-names gpt-4 --controller-address http://0.0.0.0:2002 & # 启动openai服务器 python -m...

LIUKAI0815

emmanuel-ferdman

FastChat
FastChat copied to clipboard

Metadata

update queue_lens on generation ends.

fix the bugs when some parameters are empty

Request to add [HODACHI/EZO-Humanities-9B-gemma-2-it] and [HODACHI/EZO-Common-9B-gemma-2-it] to Chatbot Arena

`httpx.RemoteProtocolError: peer closed connection without sending complete message body (incomplete chunked read)`

removed a duplicate line

Add Support for Loading Models in 4-bit Quantized Version (Fixes #1798)

ERROR | stderr | ERROR: [Errno 99] error while attempting to bind on address ('::1', 21002, 0, 0): cannot assign requested address

Llama3 Local Model

⚔️ Arena (side-by-side) sometimes show one model as "rate-exceeded" while the other model keeps generating things.

Fix image conversion state handling across formats

← Metadata

Owner

Metadata

FastChat FastChat copied to clipboard

Metadata

← Metadata

Owner

Metadata

FastChat
FastChat copied to clipboard