llama-cpp-python
llama-cpp-python copied to clipboard
arm64 builds for CUDA
Many are using this library on Nvidia Jetson/Orin devices, but there are no prebuilt wheels available for CUDA arm architectures.
Could support for automated builds of these wheels be added? Thanks!
@mcvella I think so, while I don't want to expand the builds arbitrarily (big issue atm) it would be good to have them for popular devices if we can limit the CUDA / python versions as well. If we can collect some of that info here I'll see how we can expand the builds.