Add Aphrodite Engine to Local Apps
This PR adds Aphrodite Engine to the list of local apps.
Aphrodite is a tensor-parallel LLM inference engine based on vLLM, with support for almost all transformers models and quantization formats. It currently supports:
- Hugging Face Transformers
- GGUF
- ExLlamaV2
- GPTQ
- AWQ
- Bitsandbytes
- Smoothquant+
- EETQ
- AQLM
- QuIP#
Deeplink support is not planned because it's a CLI-only app. This is my first time writing TypeScript, please let me know if I've made a mistake. Cheers!
Here's the SVG, if needed.
Thanks for the contribution @AlpinDale - massive fan of your Hub work too! Sorry for the delay on this!
cc: @julien-c
I noticed there was a PR for vLLM which streamlined the quantization stuff a lot better. I'll probably update this PR to follow that.
ah i had missed that PR, thanks for pinging @Vaibhavs10!
aphrodite-engine looks cool 🔥
Thanks for reminding me @Vaibhavs10 ! I'll work on this again tonight and hopefully we can finish it up.