Andrei-Aksionov issues

Results 32 issues of


                                            Andrei-Aksionov

Support for CUDA kernels

## 🚀 Feature Hi there 👋 From the main readme file I noticed that `Thunder` except custom kernels, but only the ones that are written in Trition. Is there a...

enhancement

help wanted

Gemma: A study of the effect of the new issues

[Daniel Han](https://twitter.com/danielhanchen) in his [blogpost](https://unsloth.ai/blog/gemma-bugs) shared his discoveries of what is ~also~ wrong with the Gemma implementation. Some of them are only appliable to Keras, some of them affects PyTorch...

Drop interleave placement in QKV matrix

Hi there 👋 This PR changes placement of `query`, `key` and `value` weights in the combined QKV matrix. Intuitively one might assume that weights in the QKV matrix are combined...

breaking change

Half-Quadratic Quantization `HQQ`

Hi there 👋 I came across a new quantization technique called Half-Quadratic Quantization. Blog: https://mobiusml.github.io/hqq_blog/ GitHub: https://github.com/mobiusml/hqq/tree/master This approach performs quantization to [1|2|3|4|8]bit precision, but doesn't require calibration process (like...

question

quantization

Andrei-Aksionov

Support for CUDA kernels

Gemma: A study of the effect of the new issues

Drop interleave placement in QKV matrix

Half-Quadratic Quantization `HQQ`

AutoGPTQ integration

LoRA: dequantize model

LoRA: `zero_pad` speed improvements

Specify `UTF-8` encoding for every `open` function

Kata machine in Python

Format logging output in a form of a table