jumerckx
jumerckx
When I move a BitArray to the gpu from within a gradient call, I get the following error. ```julia using CUDA, Zygote function f(a, b) sum(a .* b) end a...
As suggested by @MikeInnes in #69, I'm making a pull request with a seq2seq tutorial in literate.jl format. I'm hoping someone could have a look at this model since performance...
This code fails when using `CUDADevice()`. When using `CPU()` however, everything works just fine. Michael Abbott identified this to be a problem with the broacasting, replacing these with `map` resolves...
Running this code gives an error, when setting `CUDA.allowscalar(false)` it does show scalar operations are being done. On cpu everything works as expected. ```julia using CUDA, KernelAbstractions, Tullio, Flux f(x)...
Running the [matmul example from the KernelGradients package](https://github.com/JuliaGPU/KernelAbstractions.jl/blob/master/lib/KernelGradients/test/matmul.jl) using CUDA throws an error. As far as I can tell it's nothing because of the specific kernel as a simple copy-kernel...
The [specialized backward-pass for softmax](https://github.com/FluxML/NNlibCUDA.jl/blob/06ccd9f5b0fa6d3bfc9c9d52dbf865a78d76a576/src/cudnn/softmax.jl#L77) takes a lot longer than the generic implementation from NNlib.jl. The effect seems especially pronounced when the batch-dimension is larger. [Here's](https://gist.github.com/jumerckx/082db491760052afb2f376153097e4ba) the code to reproduce...
*edit: I added a smaller MWE in the [comment below](https://github.com/llvm/Polygeist/issues/368#issuecomment-1834714323).* Hi, I'm banging my head against the wall trying to figure out why the second `memref.load` and `memref.store` in this...
**EDIT:** I reduced this error further to: ```julia using CassetteOverlay @MethodTable MyTable mypass = @overlaypass MyTable mypass() do UnionAll(TypeVar(:T,Integer), Array{TypeVar(:T,Integer)}) end ``` ``` ERROR: `ccall` requires the compiler Stacktrace: [1]...
Following code throws an error: ```jl using CassetteOverlay @MethodTable MyTable mypass = @overlaypass MyTable mypass() do print(Ptr{Nothing}(0)) end ``` ``` ERROR: type DataType has no field body Stacktrace: [1] ##MyTable#218...
fixes https://github.com/JuliaLang/julia/issues/56193 I imagine this maybe needs a test but not sure how to properly unit test this without creating a full end-to-end test using `Base.code_ircode` and so on.