Junbum Lee
Junbum Lee
Currently I'm investigating on it. plan to make it compatible with Bitsandbytes. Since I'm not familiar with CUDA or C++, it'll takes some time, so currently I could not guarantee...
> you way want to check out https://github.com/IST-DASLab/qmoe. They created some custom cuda functions for sub 1-bit weights. Thanks for information! I'm looking on it 😉
Thanks for notice! Could you provide the colab code you tried to run? it would be helpful to check the issue😄
No specific reason, if the result is same, you could use it with hf causal experimental as default. I'm currently investing if the results are same or not.