elkay comments

Results 41 comments of


                                            elkay

aarch64 build for AWS Linux - Failed to load image Python extension

> ``` > [pip3] torchvision==0.16.2+cu121 > [conda] torchvision 0.16.2+cu121 pypi_0 pypi > ``` > > Try uninstalling these versions first? What would that accomplish? That's literally the package that I'm...

aarch64 build for AWS Linux - Failed to load image Python extension

> > Built Torch 2.1.2 and TorchVision 2.1.2 from source > > What version of torchvision are you building from source, exactly? There's no torchvision 2.x. The latest stable version...

aarch64 build for AWS Linux - Failed to load image Python extension

The box is shut down but I believe it was pyproject.toml that I had to update to point directly at my torch whl and the command I used was "python...

Turning support?

Also interested in this to run in AWS Graviton servers.

ERROR: Could not build wheels for flash_attn, which is required to install pyproject.toml-based projects

Same, but for a custom build to run on AWS Linux. torch.__version__ = 2.3.0a0+git26431db Everything else otherwise works, I just can't get exllamav2 to use flash_attn even if simply installing...

Has anyone successfully compiled this on ARM Linux (aarch64)?

Actually, just noticed this PR. https://github.com/Dao-AILab/flash-attention/pull/757 Seems like exactly what I'm looking for. Hopefully the PR can be approved.

Has anyone successfully compiled this on ARM Linux (aarch64)?

> Does either #757 or #724 work for you? 757 did end up working for me.

Installation version window

Having the same issues. Those files aren't in the tmp folder.

Installation version window

Figured it out, you need to ls the /tmp directory and get the string that represents "*" in the path. Replace "*" with that string and the scripts will run...

Error during response generation on RTX 5090

Some additional information - After testing a bit further, this error only seems to be happening when using GPTQ format models (my preference due to speed). When I loaded the...