dstack icon indicating copy to clipboard operation
dstack copied to clipboard

[Examples] Add vLLM example

Open peterschmidt85 opened this issue 1 year ago • 9 comments

peterschmidt85 avatar Aug 26 '24 11:08 peterschmidt85

This issue is stale because it has been open for 30 days with no activity.

github-actions[bot] avatar Sep 26 '24 01:09 github-actions[bot]

Curious to know how vllm is different from this issue?

bikash119 avatar Oct 01 '24 13:10 bikash119

Curious to know how vllm is different from this issue?

@bikash119 We'd like to have it as a separate example under the Deployment category (in addition to other categories such as Fine-tuning, Accelerators, etc.).

The example should feature most essential information around using vLLM with dstack.

Note, dstack allows to use vllm with both tasks and services. Should we show both, also explaining briefly the difference?

peterschmidt85 avatar Oct 01 '24 13:10 peterschmidt85

Apologies for being a noob here. Under Deployment category means, to provide vllm as a serverless inference service on dstack?

bikash119 avatar Oct 01 '24 14:10 bikash119

Apologies for being a noob here. Under Deployment category means, to provide vllm as a serverless inference service on dstack?

@bikash119 Currently, there are three categories on https://dstack.ai/examples/: Fine-tuning, Accelerators, and LLMs. Let's add Deployment - can be the first in the list.

peterschmidt85 avatar Oct 01 '24 14:10 peterschmidt85

Ok, so under Deployment , we should have 2 examples shown

  • How to use vLLM for service
  • How to use vLLM for task Does this sound ok?

bikash119 avatar Oct 01 '24 14:10 bikash119

Ok, so under Deployment , we should have 2 examples shown

  • How to use vLLM for service
  • How to use vLLM for task Does this sound ok?

Under Deployment, we are going to have one card "vLLM".

This will lead to https://dstack.ai/examples/deployment/vllm.

The sourcecode must be in examples/examples/deployment/vllm/README.md (it's copied to the docs when docs are built). See other cards on https://dstack.ai/examples/ to follow the structure and style.

Let me know if this helps

peterschmidt85 avatar Oct 01 '24 14:10 peterschmidt85

Thank you @peterschmidt85 for being patient with my questions.

bikash119 avatar Oct 01 '24 15:10 bikash119

@peterschmidt85 : I have made the required changes. Verifed by executing mkdocs serve

bikash119 avatar Oct 02 '24 02:10 bikash119

This issue is stale because it has been open for 30 days with no activity.

github-actions[bot] avatar Nov 02 '24 01:11 github-actions[bot]

This issue is stale because it has been open for 30 days with no activity.

github-actions[bot] avatar Dec 07 '24 02:12 github-actions[bot]

This issue was closed because it has been inactive for 14 days since being marked as stale. Please reopen the issue if it is still relevant.

github-actions[bot] avatar Dec 24 '24 01:12 github-actions[bot]