gateway [Provider] Support for Nvidia NeMo

[Provider] Support for Nvidia NeMo

Open vrushankportkey opened this issue 1 year ago • 1 comments

Aug 06 '24 08:08 vrushankportkey

Models hosted using Nvidia's NeMo servers expose Nvidia's Triton inference API's, we have a PR for that already (currently only for text completions, not chat completions) https://github.com/Portkey-AI/gateway/pull/445 https://docs.nvidia.com/nemo-framework/user-guide/latest/overview.html

Aug 21 '24 09:08 narengogi

gateway gateway copied to clipboard

[Provider] Support for Nvidia NeMo

gateway
gateway copied to clipboard