text-generation-inference
text-generation-inference copied to clipboard
RuntimeError: weight encoder.embed_tokens.weight does not exist
After running:
docker run --gpus all --shm-size 1g -p 8080:80 -v $PWD/data:/data ghcr.io/huggingface/text-generation-inference:0.9 --model-id google/flan-t5-small --num-shard 1
I recieve:
RuntimeError: weight encoder.embed_tokens.weight does not exist
I tried multiple small models but every one raise the same issue.
Any tips?
Thanks