[Examples] Add TensorRT-LLM example (end-to-end)
@peterschmidt85 : May I take this up?
@peterschmidt85 : May I take this up?
Only if you know how. It's not an easy one. TensorRT-LLM is one of the most complicated stacks. The example should show it end-to-end: how to build a model, and serve it.
Thank you @peterschmidt85 for giving me the heads up. Will give it a try and keep you posted on how it goes. May I request you to point to any example on dstack which has similar set of task like
- build the model
- serve it
Thank you @peterschmidt85 for giving me the heads up. Will give it a try and keep you posted on how it goes. May I request you to point to any example on dstack which has similar set of task like
- build the model
- serve it
I would invite you to explore more what dstack is, how it works, and of course what TensorRT-LLM is and how it works.
Thank you @peterschmidt85.
This issue is stale because it has been open for 30 days with no activity.
This issue is stale because it has been open for 30 days with no activity.
This issue was closed because it has been inactive for 14 days since being marked as stale. Please reopen the issue if it is still relevant.