FastChat issues

How to register request style api to serve on web UI?

1

Hi all, I have a service called by `requests.post`, how to host it on the fastchat web ui?

Support Gemini Pro 1.5

[OpenRouter](https://openrouter.ai/models/google/gemini-pro-1.5) now supports Gemini Pro 1.5, so it should be accessible to the public.

NightMachinery

upload reference file for gpt-4-0125-preview as judge to mitigate wrong reference answers by gpt-4

7

## Why are these changes needed? We (two Applied Scientists with graduate degrees in Artificial Intelligence) recently manually vet through the reference answers for the default gpt-4 judge. To our...

Zhilin123

Question about gen_model_answer.py with gpt2-large

If I run ``` python gen_model_answer.py --model-path openai-community/gpt2-large --model-id gpt2-large ``` under the directory `FastChat/fastchat/llm_judge` I get this error ``` /opt/conda/conda-bld/pytorch_1711403380909/work/aten/src/ATen/native/cuda/Indexing.cu:1237: indexSelectSmallIndex: block: [8,0,0], thread: [86,0,0] Assertion `srcIndex < srcSelectDimSize`...

QiyaoWei

python -m fastchat.serve.model_worker --model-path /data1/workspaces/llama2/Llama-2-7b-chat-hf --host 0.0.0.0

6

![image](https://github.com/lm-sys/FastChat/assets/72801955/7ae9d8b6-3fbd-49c2-b665-50d832e2b475)

alf-wangzhi

[Feature request] Support loading GGUF and GGML model format

5

nghidinhit

good first issue

H2O-Danube template & minor fixes

## Why are these changes needed? This PR adds a new template for H2O-Danube such as: https://huggingface.co/h2oai/h2o-danube2-1.8b-chat Additionally, I have updated a few minor things: - Changed `conv = get_conversation_template(model_id)`...

psinger

Woker erorr under python3.8: AttributeError: module 'asyncio' has no attribute 'to_thread'

3

``` /workspace# python3 -m fastchat.serve.model_worker ... "POST /worker_generate HTTP/1.1" 500 Internal Server Error 2023-12-13 04:01:05 | ERROR | stderr | ERROR: Exception in ASGI application .... 2023-12-13 04:01:07 | ERROR...

YulunCai

Gradio UI 0.2.36 not launching

2

I am running the gradio web UI 0.2.36 in a air-gapped Kubernetes cluster. Gradio app cannot start. **Version 0.2.34 used to work without issues.** I tried the following: - setting...

stephanbertl

How to use multiple Ascend NPUs?

6

Why is the --device npu parameter fixed to support only one Ascend NPU in code instead of multiple NPUs? ` def generate_stream_gate(self, params):` ` if self.device == "npu":` ` import...

litmonk

FastChat
FastChat copied to clipboard

Metadata

How to register request style api to serve on web UI?

Support Gemini Pro 1.5

upload reference file for gpt-4-0125-preview as judge to mitigate wrong reference answers by gpt-4

Question about gen_model_answer.py with gpt2-large

python -m fastchat.serve.model_worker --model-path /data1/workspaces/llama2/Llama-2-7b-chat-hf --host 0.0.0.0

[Feature request] Support loading GGUF and GGML model format

H2O-Danube template & minor fixes

Woker erorr under python3.8: AttributeError: module 'asyncio' has no attribute 'to_thread'

Gradio UI 0.2.36 not launching

How to use multiple Ascend NPUs?

← Metadata

Owner

Metadata

FastChat FastChat copied to clipboard

Metadata

← Metadata

Owner

Metadata

FastChat
FastChat copied to clipboard