candle icon indicating copy to clipboard operation
candle copied to clipboard

Minimalist ML framework for Rust

Results 407 candle issues
Sort by recently updated
recently updated
newest added

Has anyone considered adapting `llama_multiprocess` to run on multiple machines instead of multiple processes? I've started by using the `SystemCommunicator` from `rsmpi` library to replace `nccl::Comm`, but the debugging seems...

Hello, In flash-attn, the logsumexp of the softmax is not output. But it would be nice if it could be output too as it is necessary to compute long context...

In _candle-examples/src/lib.rs_ the function _load_image_ returns (channels, height, width), but the function _load_image_and_resize_ returns (channels, width, height): ```rust let data = Tensor::from_vec(data, (height, width, 3), &Device::Cpu)?.permute((2, 0, 1))?; ``` and...

I ended up on [the documentation page for Linear](https://docs.rs/candle-nn/latest/candle_nn/linear/struct.Linear.html) which surprisingly didn't have any documentation on what that layer does. Only after reading the code did I see that the...

Hi, I have been reviewing DINOv2 Candle code and I noticed most likely a bug (unless I misunderstood the code). As far as I understand, the function **interpolate_pos_encoding()** is used...

Hello! Are there any plans on implementing quantized-t5 models on CUDA devices? I'm looking for a couple of days to find the solution or implement a CUDA support for https://github.com/huggingface/candle/blob/main/candle-examples/examples/quantized-t5/main.rs...

Running this code multiplying a 784x100 matrix times a 100x10 matrix seems to leak memory. The memory usage gradually increases to more than 5 gigabytes when running with the metal...

We are working with the stable diffusion example. How do we select which GPU device on our system to use for the rendering? thanks.

Running the command: ```cargo run --example mamba --release --features metal -- --prompt "Tell me a joke please" --dtype f16``` does not work. The problem seems to lie in code: ```rs...

I had great success using MKL for some models, especially BERT-likes with huge improvements in speed (up to x25). Here the speedup is only of about 1 second. However, I'm...