horde-ad issues

Merge `ranked` and `shaped` (as in `ADReady`) into `tensor` indexed by `[Maybe Nat]`

13

This is @awf's idea that would improve the end user API. The `Nothing` would signify that the shape may differ at runtime. Cognitive load would be reduced (no more `ranked`...

Mikolaj

help wanted

Think about parallel execution of our mapAccums and their derivatives (associative operations?)

When the implementation of fold, scan, mapAccum and their derivatives settles down, let's think about parallel execution. A GPU backend to actually benchmark the result wouldn't hurt either. Tom provides...

Mikolaj

help wanted

postpone

Unpack r in Delta datatypes via backpack, class or otherwise

1

This is low priority, because it's going to be a few (dozen) percent speedup (and a slight space saving and improved storage locality, which may be more important in real...

Mikolaj

help wanted

performance

postpone

Get rid of AllowAmbiguousTypes

1

Edsko says: "I think AllowAmbiguousTypes is almost never the right solution; the main problem is that it gives the user no indication at all as to which type variables might...

Mikolaj

postpone

Generate random Tensor (and Boolean and Integral) programs

1

@tomsmeding said: > I wonder if what you need here is generating random programs :p And indeed I need it. Dimensions only 1, 2 or 3, ranks up to 5,...

Mikolaj

help wanted

Investigate "‘p0’ is untouchable" that derails type reconstruction

Here ```hs nestedGather :: forall r. ADReady r => TensorOf 2 r -> TensorOf 2 r nestedGather t = tgather (2 :$ 2 :$ ZS) (tgather (2 :$ 3 :$...

Mikolaj

help wanted

Voice recognition (fixes #58)

4

Implements #58. An attempt to recognize to which person a voice belongs in a given window of a sound file. Uses RNN. The tests to run are `cabal test extremelyLongTest...

Mikolaj

Implement checkpointing

Try to implement checkpointing (inserting recomputation to trade-off computation vs memory use) and then automatic checkpointing, which is what pytorch/JAX users now reportedly need and can't get. We have an...

Mikolaj

postpone

Try equality saturation for our simplifcation/vectorization rewriting system

11

This is the hammer and it may just succeed: https://github.com/alt-romes/hegg

Mikolaj

help wanted

[Feature] I can write code with large tensors, and the derivative runs as fast as PyTorch on GPU

11

Desiderata: - [ ] MNIST example with MatMul only - [ ] MNIST example with convolutions - [ ] ... - [ ] GPT-3 on 64 GPUs Supposedly, this can...

Mikolaj

enhancement

help wanted

horde-ad
horde-ad copied to clipboard

Metadata

Merge `ranked` and `shaped` (as in `ADReady`) into `tensor` indexed by `[Maybe Nat]`

Think about parallel execution of our mapAccums and their derivatives (associative operations?)

Unpack r in Delta datatypes via backpack, class or otherwise

Get rid of AllowAmbiguousTypes

Generate random Tensor (and Boolean and Integral) programs

Investigate "‘p0’ is untouchable" that derails type reconstruction

Voice recognition (fixes #58)

Implement checkpointing

Try equality saturation for our simplifcation/vectorization rewriting system

[Feature] I can write code with large tensors, and the derivative runs as fast as PyTorch on GPU

← Metadata

Owner

Metadata

horde-ad horde-ad copied to clipboard

Metadata

← Metadata

Owner

Metadata

horde-ad
horde-ad copied to clipboard