Graham Markall

Results 776 comments of Graham Markall

In the long term the changes will be incorporated into Numba's CUDA target, so that pynvjitlink is not needed for patching the Numba linker.

@ed-o-saurus Sorry I forgot to follow up earlier - the functionality is available in numba-cuda, but changes will not be backported to the built-in target - there is some explanation...

I'd been meaning to look into Enzyme, as it also seems to support CUDA-specific techniques. @adam-urbanczyk have you any thoughts or experiments in the direction of AD support for Numba...

I think this is an area none of the Numba maintainers have any significant amount of experience with, so it'll be difficult for us to push forward on this without...

> @gmarkall do I understand it correctly that Numba does not use CUDA-Streams, or CUDA-Graphs for its CUDA support? Apologies for the delay in my reply here. It doesn't use...

> @gmarkall coming back to the original question, is the description good enough? Do you need more details? I guess so - as my experience in this area is extremely...

> Hi :) Giving an updated talk next week at EuroAD, will gladly post the slides after the talk Slides appear to be here: https://www.autodiff.org/Docs/euroad/25th%20EuroAd%20Workshop%20-%20Ludger%20Paehler%20-%20Numba-Enzyme:%20Differentiable%20JIT%208217%20d%20Python.pdf

I'm afraid there is no update at the moment.

A belated thanks for this PR - rather than attempting to go our own way in Numba CUDA (to avoid duplication of effort / minimise the amount of additional bespoke...

After merging master into this branch there is a small conflict which is trivial to resolve, and the added tests of `np.tile` pass - @AndrewEckart would you like to merge...