Cannot use /v1/embeddings api with BGE-M3 model
fastchat version 0.2.36
First, start the services:
python3 -m fastchat.serve.controller --port 21002 --host 0.0.0.0
python3 -m fastchat.serve.model_worker --port 21003 --model-names bge-m3 --model-path /root/proj/models/bge-m3 --device cpu --controller-address http://0.0.0.0:21002 --host 0.0.0.0
python3 -m fastchat.serve.openai_api_server --controller-address http://0.0.0.0:21002 --port 8001 --host 0.0.0.0
Then, post requests to the 8001 port:
import requests
url = "http://10.99.106.214:8001/v1/embeddings" headers = { "Content-Type": "application/json" } data = { "model": "bge-m3", "input": "Hello world!" }
response = requests.post(url, headers=headers, json=data)
查看响应内容
print(response.text)
and get the response:
{"object":"error","message":"Not Found","code":50001}
@chillizex , this error is from fastchat, so you can open a issue in fastcaht repo. We're sorry that we cannot address this issue.
@chillizex , this error is from fastchat, so you can open a issue in fastcaht repo. We're sorry that we cannot address this issue.
Thank you, btw, do you have some recommand deploy method that offers openai compatible api?
same question