worker-vllm OpenAI API: API errors have wrong HTTP code

OpenAI API: API errors have wrong HTTP code

Open lucasavila00 opened this issue 11 months ago • 3 comments

Using a model that does not exist returns HTTP status 200, but the error message is in the JSON

Mar 13 '24 00:03 lucasavila00

Are you using the endpoint regularly or through openai compatibility?

Mar 16 '24 01:03 alpayariyak

@lucasavila00

Mar 20 '24 19:03 alpayariyak

Open AI Compatibility:

$ curl -i https://api.runpod.ai/v2/yyyyy/openai/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer xxxxxx" -d '{
"model": "a model that does not exist",
"messages": [
  {
    "role": "user",
    "content": "Why is RunPod the best platform?"
  }
],
"temperature": 0,
"max_tokens": 100
}'


HTTP/2 200 
date: Thu, 21 Mar 2024 01:29:58 GMT
content-type: application/json; charset=utf-8
content-length: 133
cf-cache-status: DYNAMIC
set-cookie: __cflb=zzzzz; SameSite=None; Secure; path=/; expires=Fri, 22-Mar-24 00:29:58 GMT; HttpOnly
server: cloudflare
cf-ray: uuuu-GRU

{"code":404,"message":"The model `a model that does not exist` does not exist.","object":"error","param":null,"type":"NotFoundError"}

Notice the HTTP/2 200

Mar 21 '24 01:03 lucasavila00

Will fix the issue this week, thank you!

Apr 01 '24 02:04 alpayariyak

worker-vllm worker-vllm copied to clipboard

OpenAI API: API errors have wrong HTTP code

worker-vllm
worker-vllm copied to clipboard