private-gpt
private-gpt copied to clipboard
To be used by the parallel user.
I have got setup and it running on my GPU A100 Nvidia with best response time, I have configured server which accessible but no two people can use at the same, one has to wait till the other response is finished. While most of gpu resources is vacant and not used at all only 25 gb gpu memory in use. Please assist or refer
@imartinez