Rohan Pooniwala

Results 3 comments of Rohan Pooniwala

I had a similar issue. Turns out I had to **open all the ports for my server in the network firewall** as Client tries to connect to assigned port on...

Hey! I am able to deploy `lmsys/vicuna-13b-v1.5-16k` on 4 x Nvidia A10Gs (g5.12xlarge) using the latest image Here is the command I am using to run it `docker run --gpus...

I am using v1.0.0 because AFAIK, TGI supported Rope Scaling after releasing v1.0.0 and `lmsys/vicuna-13b-v1.5-16k` uses it. From their HF Page -> ```Vicuna v1.5 (16k) is fine-tuned from Llama 2...