Nicolas PASCAL

Results 34 issues of Nicolas PASCAL

This ensures broader accessibility to Mistral.rs across different hardware configurations by enabling its use even on systems with a CUDA compute capability less than 8.0.

The previous implementation of `to_dtype` used a large `match` statement. This commit replaces the `match` with calls to specialized `as_` methods, which contain highly optimized conversion logic. This significantly improves...

tested with PR https://github.com/EricLBuehler/candle/pull/57 and works with: ``` nvidia-smi --query-gpu="compute_cap" --format=csv compute_cap 6.1 ``` ``` cargo run -F "cuda cudnn" -r -- --throughput -i plain -m meta-llama/Llama-3.2-1B-Instruct --dtype bf16 cargo...

tested and works with: ``` nvidia-smi --query-gpu="compute_cap" --format=csv compute_cap 6.1 ``` ``` cargo run -F "cuda,cudnn" --example llama --release -- --model-id meta-llama/Llama-3.2-1B-Instruct --temperature 0.1 --which v32-1b-instruct --seed 42 --dtype bf16...