data-on-eks icon indicating copy to clipboard operation
data-on-eks copied to clipboard

NVIDIA Dynamo blueprint on EKS

Open vara-bonthu opened this issue 9 months ago • 4 comments

Community Note

  • Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
  • Please do not leave "+1" or other comments that do not add relevant new information or questions, they generate extra noise for issue followers and do not help prioritize the request
  • If you are interested in working on this issue or have submitted a pull request, please leave a comment

What is the outcome that you are trying to reach?

  • new NVIDIA Dynamo Blueprint to showcase the inference capabilities for large models

Describe the solution you would like

Describe alternatives you have considered

Additional context

vara-bonthu avatar Mar 21 '25 19:03 vara-bonthu

propose to replace the Triton pattern, as NVIDIA side already redirect Triton to Dynamo now https://developer.nvidia.com/triton-inference-server

hustshawn avatar Mar 24 '25 09:03 hustshawn

Agreed with @hustshawn, we should deprecate existing Triton blueprint and replace that with Dynamo.

askulkarni2 avatar Mar 24 '25 16:03 askulkarni2

I wouldn’t recommend replacing or immediately deprecating the Triton Server blueprint. We still have customers actively using this pattern who haven’t yet migrated to Dynamo.

A better approach would be to first create and publish the Dynamo blueprint. Once that's available, we can add a deprecation notice to the Triton blueprint, including clear guidance on how users can transition to the new Dynamo pattern before we fully remove it.

vara-bonthu avatar Mar 24 '25 18:03 vara-bonthu

This issue has been automatically marked as stale because it has been open 30 days with no activity. Remove stale label or comment or this issue will be closed in 10 days

github-actions[bot] avatar Apr 24 '25 00:04 github-actions[bot]

Issue closed due to inactivity.

github-actions[bot] avatar May 04 '25 00:05 github-actions[bot]