Julian Samaroo issues

Results 172 issues of


                                            Julian Samaroo

Remove GCN lower_throw_extra! pass

gcn

[WIP] Add support for ROCm

This is nowhere near ready to go yet, but I wanted to get this posted since things are progressing well for AMDGPU support overall :slightly_smiling_face: TODO: - [x] Add synchronization...

After https://github.com/JuliaParallel/Dagger.jl/pull/223 gets merged, Dagger's eager API (`Dagger.@spawn`) should be suitable for use in packages. I would recommend we use it for non-lazy computations in FileTrees so that we can...

enhancement

Don't assume the model is on a CUDA device

Currently, `DaggerChain` communicates to Dagger that the wrapped model is located on a CUDA GPU, which is not necessarily true (and shouldn't be a requirement anyway). We should provide functions...

bug

Parallelize tests

This uses Distributed to parallelize the tests, in the hopes of having CI jobs which don't take >3hrs to run. Todo: - [ ] Pass output through tmpfile to remove...

tests

tests: Execute GPUArrays testsuite later

Moves the GPUArrays testsuite to run after we've tested our ROCm libraries (rocBLAS et. al). In the absence of a parallel test runner, and with total test time being over...

arrays

tests

Mem.alloc: Allow using hipMalloc to service allocations

Some libraries, like rocSPARSE, call HIP functions which expect to be passed allocations generated from `hipMalloc` and friends. Because `hipMalloc` just ends up calling HSA allocation functions, we should be...

bug

enhancement

hip

needs docs

logging

Julian Samaroo

Remove GCN lower_throw_extra! pass

[WIP] Add support for ROCm

Use Dagger's eager API

Don't assume the model is on a CUDA device

Parallelize tests

tests: Execute GPUArrays testsuite later

Mem.alloc: Allow using hipMalloc to service allocations

`getinfo` should determine the `Ref` output container automatically

Improve exception reporting

Add timespan logging via TimespanLogging.jl