denizyuret
denizyuret
Compiling our kernels into a shared library and using them with ccall's may not be the most efficient method. Check out packages under JuliaGPU (CUDANative.jl etc.) for more kernel launching....
This does not work: ``` function relu(x::T; max_value=Inf, negative_slope=0, threshold=0) where T (x >= max_value ? oftype(x, max_value) : x >= threshold ? x : negative_slope == 0 ? zero(T)...
See https://github.com/denizyuret/Knet.jl/issues/410
- [x] broadcast of user defined functions not supported: #101 - [x] Solve outstanding bugs and issues. - [ ] Review and merge pull requests. #54 #57 - [ ]...
@CarloLucibello I am responding to your comments here to make the discussion easier: > can we avoid exporting 2 letters names to avoid conflicts and improve code readability? I need...
``` julia> x 0.22919536f0 julia> besseli(2,x) 0.006595105469050567 ```
There are two mean pooling operators in cudnn, one that includes and one that excludes the padded values. The current meanpool operation only supports including the padded values, we should...
I did not find this in other issues, if not elsewhere should add it to our todo list. For a description of grouped convolutions see: https://towardsdatascience.com/a-comprehensive-introduction-to-different-types-of-convolutions-in-deep-learning-669281e58215
This took me a while to figure out today, while debugging the new CUSPARSE package: ``` ERROR: LoadError: MethodError: `convert` has no method matching convert(::Type{CUDArt.CudaArray{T,N}}, ::Type{Int32}, ::Tuple{Int32}) This may have...
Is there any progress on making gc sensitive to remaining gpu memory? The following example still fails with an out-of-memory error. It works if you uncomment the manual gc() line....