luke comments

Results 64 comments of


                                            luke

Candle vs. PyTorch performance

> I added a simple matrix multiplication and addition calculation: https://github.com/passchaos/ml-bench/blob/main/rust-ml/src/main.rs#L44, but it takes much longer than PyTorch. I wonder if it's a problem with my usage. > > framework...

Fix LayerNorm gradient flow issue

great. it would be awesome to have more training code examples and workflows with candle

[DRAFT] Add `candle-utils` helper crate for common patterns

this sorta thing is great. i wonder if there is enough people with merge authority to handle this inside of the candle repo...

Short context length on Qwen quantized examples.

I thought it might be quantized thing. So running the same test on the Non-Quantized Qwen 3 8B i get the same error.

Windows Cuda Compatibility?

compilation errors are sometimes because missing libraries in the path. do you have the proper coda paths set.

Windows Cuda Compatibility?

i have these in debian 12 ``` export CUDA_ROOT=/usr/local/cuda export CUDA_PATH=/usr/local/cuda export CUDA_TOOLKIT_ROOT_DIR=/usr/local/cuda export CUDA_HOME=/usr/local/cuda export LD_LIBRARY_PATH=/usr/lib/x86_64-linux-gnu/nvidia/current ```

Windows Cuda Compatibility?

adding the bin to to the cudnn helped me run the program but the problem is that compiled app still needs the cudnn to dynamically link... isnt there someway like...

Windows Cuda Compatibility?

for deployment i needed to bundle cuda and cudnn with my app.

Implement the Gemma 3 vision models!

when i run the 4b and 12b like ``` cargo run --example gemma_vision --release --features cuda,cudnn -- --prompt "describe this image" --which v3_4b --image /home/pro/Desktop/sd_final.png ``` it downloads the models...

Implement the Gemma 3 vision models!

is this working? its been sitting here for a while would be great to have Gemma vision models in candle especially if the work is done..?