Liger-Kernel
Liger-Kernel copied to clipboard
[feat] Add jamba support
🚀 The feature, motivation and pitch
model code here -- https://github.com/huggingface/transformers/blob/main/src/transformers/models/jamba/modeling_jamba.py
might be interesting to see how is a triton implementation of mixer forward compared to existing cuda forward too 🤔
Alternatives
No response
Additional context
No response