Leo Fang comments

Results 1175 comments of


                                            Leo Fang

EPIC: Path finder for CUDA components

cc @cryos for vis (since you're also working on wheels)

EPIC: Path finder for CUDA components

Tracking more relevant links to code that we want to offer an alternative for: - https://github.com/NVIDIA/nvmath-python/blob/073b168ac0688fa3b84caaa8bb65948bf7db7eae/nvmath/bindings/_internal/cusparse_windows.pyx#L295-L324 - https://pypi.org/project/cupti-python/

EPIC: Path finder for CUDA components

> The consequence is that any prior arts that rely on checking if a certain piece exists (ex: nvcc, cuda.h, nvvm, ...) and generalizing it to assume the whole Toolkit...

EPIC: Path finder for CUDA components

The bulk of the work is largely done now. Let me close this issue and the remaining tasks can be tracked individually, with the `cuda.pathfinder` label.

[BUG] Provide CUDA implementation of FFT

Hi, CUDA Python tech lead here 👋 cuFFT has an official Python solution which is exposed as part of [nvmath-python](https://github.com/nvidia/nvmath-python), home for NVIDIA CPU & GPU math libraries. It should...

[BUG] Provide CUDA implementation of FFT

Hi @Maalvi14 Glad to know! I just checked, Google Colab still has free T4 GPUs for access (it is not the default runner, which is CPU-only, so need to do...

[BUG] Provide CUDA implementation of FFT

brev.nvidia.com has a wide variety of GPU instances, including T4, but I am not sure if we offer free credits, if Colab does not work for you, I can ask...

[BUG] Declaring a named cpdef enum also populates the module namespace

We've also moved away from cpdef enum for the same reason. The pollution to the module namespace is too much from the tab completion perspective.

Nested Functions for jit.rawkernel

IIRC this is about writing a device function inside a kernel body, something like ```python @jit.rawkernel() def my_kernel(...): def my_dev_func(...): ... ... my_dev_func(...) .... ``` Is this what you need?...

Make `eig/eigvals` always return complex eigenvalues

Returning complex values is the right move. Instead of viewing this discussion as introducing a breaking change, I argue this is a NumPy bug (value-dependent casting) that should have been...