mistral.rs
mistral.rs copied to clipboard
mistral does not support NVIDIA V100 (compute_cap <= 800)
Describe the bug it does not support some old hardware.
Can it just convert bfloat16 to float16 before loading model. just like vllm is doing?