Eric Buehler
Eric Buehler
Hello all and thanks for your great work here! I am integrating support for [pyo3_special_method_derive](https://crates.io/crates/pyo3_special_method_derive) into mistral.rs, but there is a linking error. The linking error seems to only occur...
- [x] Loader and model - [ ] ISQ - [ ] AnyMoE - [ ] Device Mapping - [ ] X-LoRA/LoRA - [ ] Adapter activation
This is a tracking issue for the development of AnyMoE, which will be broken up into several PRs. - [x] Core functionality, plain models, all APIs: #476 - [x] Support...
This PR adds GPTQ quantization ([paper here](https://arxiv.org/abs/2210.17323)) support. Refs: #418, #448.
Because #492 ensures that ISQ now happens on the CPU, we can try to run ISQ in parallel for Metal, too. Tagging: @Luddo183, this refs #393.
https://crates.io/crates/pyo3_special_method_derive
```rust use mistralrs::{device, load_normal_model}; fn main() -> anyhow::Result { let dev = Device::cuda_if_available(0)?; let runner = load_normal_model!( id = "mistralai/Mistral-7B-Instruct-v0.1".to_string(), kind = Mistral, device = dev, use_flash_attn = false );...