ChainRules.jl Make `OneElement` more GPU friendly

Make `OneElement` more GPU friendly

Open ToucheSir opened this issue 1 year ago • 0 comments

Ref. https://github.com/FluxML/Flux.jl/pull/2368. I see a couple of possibly complementary ways to go about this. Easiest would be to define an Adapt rule for OneElement so it's materialized or substituted with some GPU-friendly equivalent when run through CUDA.cu. The other would be defining overloads for certain functions such as mul! which can take advantage of the sparsity.

Jan 08 '24 04:01 ToucheSir

ChainRules.jl ChainRules.jl copied to clipboard

Make `OneElement` more GPU friendly

ChainRules.jl
ChainRules.jl copied to clipboard