FastChat
FastChat copied to clipboard
Model worker with Nvidia NIM?
trafficstars
Nvidia has their own solution to deploy large language models.
Would it make sense to have an adapter?
https://developer.nvidia.com/blog/deploy-multilingual-llms-with-nvidia-nim