dfdx issues

Add QoL Error when dimensions between layers don't match

36

If, by accident, you mistype the dimensions of layers in your `model`, the compiler throws you an error at `model.forward_mut()` which does not hint at all to the actual error...

kstavro

Padding type for Conv2d

2

PyTorch supports an optional parameter for 2D convolutions called `padding_mode`, which defaults to `zeros`, but can also be `reflect`, `replicate` or `circular`. This changes what values it uses for the...

Cifram

Add OpenCL device

10

As brought up in reddit thread, a OpenCL device would be useful to support folks with AMD gpus. Here are roughly the tasks that need to happen: 1. [ ]...

coreylowman

new feature

help wanted

gpu

expert

CUDA Graphs

2

We should consider whether it is possible and desired to automatically combine kernels into CUDA graphs to reduce overhead of calling individual kernels. Here is the relevant documentation: - https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#cuda-graphs...

ViliamVadocz

gpu

optimization

Rotate and Mirror Operations

2

For data augmentation, it's very useful to have operations to mirror and rotate a tensor. See PyTorch's `flip` and `rot90` methods.

Cifram

new feature

expert

Attention mask in TransformerDecoderBlock?

11

Hi! I've been trying to porting nanoGPT to Rust with dfdx. The `transformer` module is awesome! but it seems an important trick is missing, which is the attention mask in...

ifsheldon

new feature

Support for 4 bit Quantization

3

Language model progress has been rapid recently and with the llama weights being released, so much progress is being made on the c++ side https://github.com/ggerganov/llama.cpp I see that fp16 is...

vikigenius

ideation

Benchmarking

1

For the purposed of effectively reducing the size of real time models, finding inefficiencies in the library, and testing modifications to the library, there should be a type of "benchmarking"...

opfromthestart

ideation

Ability to one_hot_encode 2d arrays/vecs of usizes

3

Currently only support 1d arrays/vecs

coreylowman

new feature

Learning Rate Scheduler

5

It would be really useful to have a learning rate scheduler trait to implement various schedules. The trait could be as general as something like: ```rust trait OptimScheduler { fn...

jafioti

dfdx
dfdx copied to clipboard

Metadata

Add QoL Error when dimensions between layers don't match

Padding type for Conv2d

Add OpenCL device

CUDA Graphs

Rotate and Mirror Operations

Attention mask in TransformerDecoderBlock?

Support for 4 bit Quantization

Benchmarking

Ability to one_hot_encode 2d arrays/vecs of usizes

Learning Rate Scheduler

← Metadata

Owner

Metadata

dfdx dfdx copied to clipboard

Metadata

← Metadata

Owner

Metadata

dfdx
dfdx copied to clipboard