mistral.rs
mistral.rs copied to clipboard
Add topk scalings, topk softmax scalings for X-LoRA
This is currently pending on some way to do topk in Candle.
Refs https://github.com/huggingface/candle/pull/2132