Rajat Saxena

Results 1 issues of Rajat Saxena

I am deploying text-generation-inference on EKS with each node having 1 NVIDIA A10G GPU. How should I create a group such that a model like llama-2-13b-chat is able to use...