candle issues

MetaVoice-1B: fix degradation compared to Python version

7

The MetaVoice-1B model has significant degradation compared to the Python version. I believe one of the main causes is using a 64x smaller decoder model (instead of multiband diffusion and...

vatsalaggarwal

support for json (or other?) grammar?

2

llama.cpp now supports grammars: https://til.simonwillison.net/llms/llama-cpp-python-grammars Is that something that will come to candle? It sounds like the approach taken in this python library would be straight forward: https://github.com/1rgs/jsonformer/blob/main/jsonformer/main.py Basically, since...

kurtbuilds

[help] Run gemma example using the command line on my mac, got some runtime errors.

2

When I run gemma example using the command line on my mac, I get the following error. It looks like this is a compilation error with some package. runtime errors:...

bofen97

Issue Labelling for Good First Issues

It would be awesome to have some issue labelling for new contributors to get a start.

PatStiles

question: what GPU can run the mixtral example?

12

I found the mixtral example in this repo, and try to run it on A100 80GB, but the default Mixtral-8x7B-v0.1 runs out of memory. I was curious what GPU can...

zwpaper

Flash Attention not working on CUDA 12.1

12

I'm trying to use Flash Attention on an environment with CUDA 12.1 but it fails to compile. Is it expected? Reproducing: 1. Start a Docker container with CUDA version 12.1.1...

hugoabonizio

Can't loop over model implementation based off examples more than N times (7-20+ it ends up breaking)

12

Hi I have these errors and my code tries to even retry but there seems to be an issue when I try to continuously use the gemma and mistral examples...

groovybits

Extend `GradStore` public functionality

5

This PR allows user to merge two `GradStore`s together (and also to create an empty one). It is helpful for collecting gradients from multiple different backward passes (e.g. with different...

agerasev

Tensor copy from noncontiguous tensor still make noncontiguous tensor

2

while pytoch's clone return contiguous tensor candle ```rust #[test] fn test_copy1() -> candle::Result { let device = candle::Device::Cpu; let test = Tensor::zeros((2, 10), DType::U32, &device)?; let test1 = test.narrow(1, 0,...

yinqiwen

bert attention mask

2

lz1998

candle
candle copied to clipboard

Metadata

MetaVoice-1B: fix degradation compared to Python version

support for json (or other?) grammar?

[help] Run gemma example using the command line on my mac, got some runtime errors.

Issue Labelling for Good First Issues

question: what GPU can run the mixtral example?

Flash Attention not working on CUDA 12.1

Can't loop over model implementation based off examples more than N times (7-20+ it ends up breaking)

Extend `GradStore` public functionality

Tensor copy from noncontiguous tensor still make noncontiguous tensor

bert attention mask

← Metadata

Owner

Metadata

candle candle copied to clipboard

Metadata

← Metadata

Owner

Metadata

candle
candle copied to clipboard