worker-vllm
worker-vllm copied to clipboard
Issue: Update VLLM to Version .5.0++, and a few suggestions
Description
- 🌟 Upgrade VLLM: We need to rocket VLLM to version 0.5.0++ or beyond! 🚀
- 🤖 Tensorize Awesomeness: The
tensorize
feature is like giving VLLM a turbo boost. 🏎️ Check out the Tensorize VLLM example for a sneak peek.- 🚀 It lets us load the model during download (but remember, the model needs a little conversion magic).
- 📦 Pip It Up: Why build VLLM from scratch when we can summon it with a pip package? Efficiency, my friend! 🧙♂️
Kudos to the stellar maintainer! 🌟🙌