FastChat issues

dont support CohereForAI/c4ai-command-r-plus-4bit

1

Some weights of the model checkpoint at CohereForAI/c4ai-command-r-plus-4bit were not used when initializing CohereForCausalLM: ['model.layers.0.self_attn.k_norm.weight', 'model.layers.0.self_attn.q_norm.weight', 'model.layers.1.self_attn.k_norm.weight', 'model.layers.1.self_attn.q_norm.weight', 'model.layers.10.self_attn.k_norm.weight', 'model.layers.10.self_attn.q_norm.weight', 'model.layers.11.self_attn.k_norm.weight', 'model.layers.11.self_attn.q_norm.weight', 'model.layers.12.self_attn.k_norm.weight', 'model.layers.12.self_attn.q_norm.weight', 'model.layers.13.self_attn.k_norm.weight', 'model.layers.13.self_attn.q_norm.weight', 'model.layers.14.self_attn.k_norm.weight', 'model.layers.14.self_attn.q_norm.weight', 'model.layers.15.self_attn.k_norm.weight', 'model.layers.15.self_attn.q_norm.weight',...

valueLzy

Change separator in phind conversation template from colon to newline

## Why are these changes needed? From the [documentation](https://huggingface.co/Phind/Phind-CodeLlama-34B-v2) on HuggingFace, it can be seen that the Phind-CodeLlama models use newline separators in their conversation templates. As it is currently...

dannydarvish

Chatbot Arena: Add Reka Edge and Core models

Currently the [Reka](https://www.reka.ai/) Flash model can be compared in the Chatbot Arena, but the smaller Edge and larger Core cannot. It would be interesting to see how these two models...

EwoutH

GPT-4-turbo MMLU scores?

Hi all, maybe there's an obvious reason why this can't be done, but it'd be really amazing to have access to the MMLU scores for the GPT-4-turbo models. I'm not...

Duncan-Haywood

ValueError: Tokenizer class GemmaTokenizer does not exist or is not currently imported,

I ran into a problem where after I fine-tuned several series of models, I tried to deploy them, only to find the same error.I want to know how to solve...

1737686924

Cannot serve `CohereForAI/c4ai-command-r-plus-4bit`

1

I just wanted to serve the `CohereForAI/c4ai-command-r-plus-4bit` model, but after I installed `bitsandbytes` I get this error when running: ``` entrypoint: [ "python3.9", "-m", "fastchat.serve.model_worker", "--model-names", "command-r-plus-4bit", "--model-path", "CohereForAI/c4ai-command-r-plus-4bit", "--worker-address",...

phisad

Add support for new IBM models (labradorite and merlinite)

4

## Why are these changes needed? Adding support for new IBM models based: 1. Labradorite-13b: https://huggingface.co/ibm/labradorite-13b 2. Merlinite-7b: https://huggingface.co/ibm/merlinite-7b ## Checks - [X] I've run `format.sh` to lint the changes...

shivchander

Can Azure openi api be proxied by fastchat.serve.openai_api_server?

I have some local mods and also have azure openai api subscription, I'm looking towards accessing it in a consistent way, via fastchat.serve.openai_api_server, I don't know if this is feasible,...

Huyueeer

[Feature Request] Pre-built docker image support

Is there any plan to support pre-built docker image on dockerhub or github package? I think It will be very helpful and become more accessible to a lot of people.

zsaladin

ValueError: Tokenizer class QWenTokenizer does not exist or is not currently imported.

3

```log ~/repo/FastChat$ python -m fastchat.serve.model_worker --model-path ~/repo/models/Qwen-14B-Chat-Int4 --gptq-wbits 4 --gptq-groupsize 128 --model-names gpt-3.5-turbo 2023-09-28 14:36:05 | INFO | model_worker | args: Namespace(host='localhost', port=21002, worker_address='http://localhost:21002', controller_address='http://localhost:21001', model_path='~/repo/models/Qwen-14B-Chat-Int4', revision='main', device='cuda', gpus=None, num_gpus=1,...

thiner

FastChat
FastChat copied to clipboard

Metadata

dont support CohereForAI/c4ai-command-r-plus-4bit

Change separator in phind conversation template from colon to newline

Chatbot Arena: Add Reka Edge and Core models

GPT-4-turbo MMLU scores?

ValueError: Tokenizer class GemmaTokenizer does not exist or is not currently imported,

Cannot serve `CohereForAI/c4ai-command-r-plus-4bit`

Add support for new IBM models (labradorite and merlinite)

Can Azure openi api be proxied by fastchat.serve.openai_api_server?

[Feature Request] Pre-built docker image support

ValueError: Tokenizer class QWenTokenizer does not exist or is not currently imported.

← Metadata

Owner

Metadata

FastChat FastChat copied to clipboard

Metadata

← Metadata

Owner

Metadata

FastChat
FastChat copied to clipboard