libcudacxx
libcudacxx copied to clipboard
`cuda::atomic::fetch_add/sub` should support float/double
The cuda::atomic<T> extension type should support atomic addition/subtraction on floating point types.
What's the ETA for this?
This has been done some time ago.