FastChat issues

Gpt2

3

fix error and support Phi-3 models

1

## Why are these changes needed? Model worker (also vllm_worker) has error loading [Phi-3 models](https://huggingface.co/microsoft/Phi-3-mini-128k-instruct) "microsoft/Phi-3-mini-128k-instruct" etc.. ![Screenshot from 2024-05-07 00-58-23](https://github.com/lm-sys/FastChat/assets/4597657/17dda665-2f2e-4a38-a0c7-c64e839d8cfc) ## Checks - [x] I've run `format.sh` to lint...

renning22

How to generate reference answers in MT-Bench?

6

Hi! Thank you for your excellent work on the LLM evaluation! I'm inspired to create a French version of MT-Bench. Currently, I'm in the process of generating reference answers for...

bofenghuang

new release?

3

Can we get a new release with llama3 support?

lnguyen

How do you consider the 'top_k' parameter when using openai_api_server to start?

I would like to try 'top_k' in my scenario to see if anything different in the output, but seems the openai invoke method does not support this parameter. https://platform.openai.com/docs/api-reference/chat/create I...

garyyang85

Merged Model from Huggingface runs fine with fastchat CLI but not when using service worker

1

I am running Fastchat on kubernetes. I have a worker for the controller, the fastchat api and a (gpu) worker for each of the models. When I pull this model...

heli-sdsu

What is the system_message for llama3 model?

What is the official system message for llama3 model ( `NousResearch/Meta-Llama-3-8B-Instruct`)? If I used the following in my code, however the system message is missing here: ``` register_conv_template( Conversation( name="llama-3",...

hahmad2008

Please add new model taide

Please add this traditional Chinese model https://huggingface.co/taide

keyang556

Correct prompt for Vicuna v1.5 7b in the case of RAG

3

Hi, Willing to use the `Vicuna v1.5 7b` model for RAG or Retrieval Augmented Generation (Q&A based on retrieved documents or context), I tried many prompts based on Llama 2...

Matthieu-Tinycoaching

Leaderboard mentions `GPT-3.5-Turbo-0314` but I believe it is instead `GPT-3.5-Turbo-0301`

https://chat.lmsys.org/?leaderboard mentions `GPT-3.5-Turbo-0314` but I believe it is instead `GPT-3.5-Turbo-0301`, as indicated in `model_adapter.py`: https://github.com/lm-sys/FastChat/blob/f2e6ca964af7ad0585cadcf16ab98e57297e2133/fastchat/model/model_adapter.py#L71

Franck-Dernoncourt

FastChat
FastChat copied to clipboard

Metadata

Gpt2

fix error and support Phi-3 models

How to generate reference answers in MT-Bench?

new release?

How do you consider the 'top_k' parameter when using openai_api_server to start?

Merged Model from Huggingface runs fine with fastchat CLI but not when using service worker

What is the system_message for llama3 model?

Please add new model taide

Correct prompt for Vicuna v1.5 7b in the case of RAG

Leaderboard mentions `GPT-3.5-Turbo-0314` but I believe it is instead `GPT-3.5-Turbo-0301`

← Metadata

Owner

Metadata

FastChat FastChat copied to clipboard

Metadata

← Metadata

Owner

Metadata

FastChat
FastChat copied to clipboard