FastChat issues

Fix incompatible with vllm 0.3.0 and fix size mismatch when fine-tuning model is loading

2

…issue #2994) ## Why are these changes needed? 1. Add support for pydanticv2 and resolve incompatible with vllm 0.3.0 [issue #2994](https://github.com/lm-sys/FastChat/issues/2994). 2. Fix error when fine tuning is loading by...

BRM10213

add chat template for CohereForAI/c4ai-command-r-plus

Could you add chat template for CohereForAI/c4ai-command-r-plus ?

KehaoWu

problem about ascend npu 910A

2

Whether this project is only adapted to the Ascend NPU 910B chip? I am trying to run fastchat vicuna-7b-v1.5 on Ascend NPU 910A chip, but the inference speed very very...

BruceWang1996

chatgpt template `get_prompt` got error ValueError: Invalid style: None

1

I was using `gpt-4`, and I found that `sep_style=None` in its template, so `get_prompt` got the error: ValueError: Invalid style: None ![img_v3_0298_61fcdc3a-e0ed-4513-91f8-021ad208693g](https://github.com/lm-sys/FastChat/assets/7260977/64270709-4262-431d-9b3f-c17831f2855d)

liuyaox

SillyTavern ui? FastChat compatible?

3

Hi, sorry I'm a big newb at this. I love that fastchat uses multi gpus and I've gotten the webservice/worker setup, but I don't know how to config SillyTavern to...

QuinnPiers

Please support top_logprobs via API

I've noticed that the current implementation supports logprobs (thanks a lot!) but it appears to be different from how OpenAI handles them. I was wondering if it would be possible...

xufengduan

ChatGLM3-6b Compatibility is expected as soon as possible.

1

ChatGLM3-6B adopts a newly designed [Prompt format](https://github.com/THUDM/ChatGLM3/blob/main/PROMPT_en.md), supporting multi-turn dialogues as usual. It also natively supports [tool invocation](https://github.com/THUDM/ChatGLM3/blob/main/tools_using_demo/README_en.md) (Function Call), code execution (Code Interpreter), and Agent tasks in complex scenarios....

7010G

fastchat模型部署支持qwen1.5吗？

fastchat内部使用fastapi.是支持高并发吗？是否qwen1.5

yawzhe

Need to support Qwen1.5-72B-Chat 32k token?

2

# HuggingFace https://huggingface.co/Qwen/Qwen1.5-72B-Chat

ggservice007

Looking forward to adding support for Qwen1.5

3

Looking forward to adding support for Qwen1.5, including Qwen1.5-7B-Chat, Qwen1.5-7B-Chat-GPTQ-Int8, and so on. Qwen1.5 is more powerful than Qwen. Thank you.

davidjia1972

FastChat
FastChat copied to clipboard

Metadata

Fix incompatible with vllm 0.3.0 and fix size mismatch when fine-tuning model is loading

add chat template for CohereForAI/c4ai-command-r-plus

problem about ascend npu 910A

chatgpt template `get_prompt` got error ValueError: Invalid style: None

SillyTavern ui? FastChat compatible?

Please support top_logprobs via API

ChatGLM3-6b Compatibility is expected as soon as possible.

fastchat模型部署支持qwen1.5吗？

Need to support Qwen1.5-72B-Chat 32k token?

Looking forward to adding support for Qwen1.5

← Metadata

Owner

Metadata

FastChat FastChat copied to clipboard

Metadata

← Metadata

Owner

Metadata

FastChat
FastChat copied to clipboard