Baye Dieng
Baye Dieng
**Describe the bug** Something seems to be wrong with the implementation of maxpool2d's backends. When testing against torch tensors I am getting different shapes. The burn backends aren't outputting the...
This project would have a significant speed boost on mac if the demucs model were to use mps instead of cpu in a similar way that the windows binary uses...
Ran Phi 2 model with the metal features enabled and seems to hang with about 7% of GPU usage from Activity monitor. This seems to be recent as it ran...
This PR adds support for Llama.cpp and closes #167.
This PR is meant to add LLama CPP Support using the ggml inference engine. For the sake of simplicity, this PR will take Q8(8-bit) format GGUF Files, implementing the LLAMA...