text-embeddings-inference icon indicating copy to clipboard operation
text-embeddings-inference copied to clipboard

Cant not request

Open wanzhixiao opened this issue 11 months ago • 1 comments

System Info

CUDA Version: 12.0 , with A10 GPU CentOS: 7.9

Information

  • [X] Docker
  • [ ] The CLI directly

Tasks

  • [X] An officially supported command
  • [ ] My own modifications

Reproduction

I run the text-embedding-inference program by the following script

volume=/data/pretrain_model
model=/data/pretrain_model/bge-small-zh-v1.5
revision=refs/pr/5

docker run -d --restart=always --gpus all\
            -p 8070:80 \
            -v $volume:/data --pull always ghcr.io/huggingface/text-embeddings-inference:86-1.2 \
            --model-id $model \
            --tokenization-workers 2

and the docker has successfully run, however, when i try the following command for caculate embedding:

curl 127.0.0.1:8070/embed_sparse \
    -X POST \
    -d '{"inputs":"I like you."}' \
    -H 'Content-Type: application/json'

the error ocurred: curl: (56) Recv failure: Connection reset by peer, how can i solve it? image

docker logs image

Expected behavior

return the embedding

wanzhixiao avatar Mar 24 '24 11:03 wanzhixiao

You can refer to #207

CharleyXu avatar Mar 28 '24 02:03 CharleyXu