FastChat issues

Support microsoft phi-1.5 model

4

good first issue

Slower throughput with openai_server

Hi, I deployed the model backend using fastchat worker. I tested the throughput for llama3-8b model and it reached >2500 tokens/seconds on A100. However when I started using the same...

tacacs1101-debug

add survey link

## Why are these changes needed? - We need to know why people are coming to the arena also what sucks about it so we can make it all better...

lisadunlap

Merged duplicate `GeminiAdapter` class definitions

1

## Why are these changes needed? This PR addresses the issue of duplicate `GeminiAdapter` class definitions found in the codebase. The changes aim to merge the two identical class definitions,...

KangmoonSeo

Duplicate GeminiAdapter class definition found

I've noticed that there are two identical class definitions for `GeminiAdapter` in the same file. This appears to be an unintended duplication. - File: `fastchat/model/model_adapter.py` - Lines: 1202-1212 and 2193-2205...

KangmoonSeo

ascend NPU how to Multiple NPUs

1

ascend NPU how to Multiple NPUs

50785397

lance-maxwell

FastChat
FastChat copied to clipboard

Metadata

Support microsoft phi-1.5 model

Slower throughput with openai_server

add survey link

Merged duplicate `GeminiAdapter` class definitions

Duplicate GeminiAdapter class definition found

ascend NPU how to Multiple NPUs

npu 910B run fastchat + baichuan-13B: DefaultCPUAllocator: can't allocate memory

support musa backend for MooreThreads

No permission to push to the branch

[Suggest] When to support Codestral

← Metadata

Owner

Metadata

FastChat FastChat copied to clipboard

Metadata

← Metadata

Owner

Metadata

FastChat
FastChat copied to clipboard