AlpinDale comments

Results 170 comments of


                                            AlpinDale

[Bug]: Llama 3.1 outputs gibberish when --kv-cache-dtype fp8 but AWQ model works fine

> So just an update chunked prefill works for 2xRTX 3090 24GB NVLink for Llama 3.1 8B FP16 at full 128K context. It outputs normal non-gibberish text. Yay! Thanks alpin....

Add service user to Dockerfile

@theobjectivedad thanks for the PR and sorry for the late review! I made some changes including rebasing it to the release candidate branch. Please let me know if this is...

[Usage]: So can AMD NAVI GPUs be used with aphrodite? Which GPUs?

Hi yes, they're supported. The easiest way is to use them through Docker. I will be updating the wiki in the next release with detailed instructions. Stay tuned!

[New Model]: Phi3ForCausalLM

Added as of v0.6.0

[Usage]: native nvlink support or not agnostic to mobo

I believe our p2p detection has improved as of v0.6.0. Can you try again?

[Bug]: New Numpy version breaks installation

Oh boy. I suppose we have to pin numpy versions too from now on. Thanks for reporting Does torch not pin numpy already? Weird.

[Bug]: New Numpy version breaks installation

For anyone else who might be running into this issue, please downgrade numpy: ```sh pip install numpy==1.26.4 ```

[Installation]: install fails on Ubuntu 24.04

Looks like you're on python 3.12. At the moment, we only have wheels for python versions 3.8-3.11. Please downgrade. You can either use conda to do this, or a python...

[Installation]: install fails on Ubuntu 24.04

Python 3.12 now officially supported as of v0.6.0

[Feature]: Add Support for aya-23-8b with GGUF

This should work perfectly fine as of v0.6.0. Feel free to re-open the issue if the problem persists.