pycuda issues

Finer control over GPU memory (access to cu*Destroy from Python)

4

**Is your feature request related to a problem? Please describe.** We recently identified a (GPU) memory leak in a routine that creates a new cuda stream on a given context...

CarlosRDomin

enhancement

`GPUArray.minimum` fails for `bool` arguments

1

Here's an MWE ```python >>> import numpy as np >>> import pycuda.autoinit >>> import pycuda.gpuarray as gpuarray >>> a = np.array(True) >>> b = np.array(False) >>> a_gpu = gpuarray.to_gpu(a) >>>...

mitkotak

bug

Shipped Boost.Python is incompatible with Python 3.11

16

Hello, in order to test Python 3.11 with pycuda, i have just installed latest pycuda version : 2022.1 I use CUDA 11.6.2 with a windows 11 laptop. When i try...

easybob95

bug

`GPUArray.div` does not comply with numpy

2

Here's an MWE: ```python >>> import pycuda.autoinit >>> import pycuda.gpuarray as cu_np >>> a = cu_np.zeros(10, dtype="int32") + 1 >>> b = cu_np.zeros(10, dtype="int32") + 2 >>> a / b...

kaushikcfd

bug

[WIP] Add support for CUDA Graphs.

10

Hi there! I wanted to experiment with CUDA Graphs a bit to get a feel for the performance differences between blocking, async and graph execution. See: * https://developer.nvidia.com/blog/cuda-graphs/ * https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#cuda-graphs...

gfokkema

Is there a way to share context among threads, if not why?

10

Basically I want to achieve concurrent work with multithreading and my current inference code is pycuda + tensorrt. **why I want to do so** I'm trying to optimize the inference...

menglin0320

Implement stream priority feature

6

Solves https://github.com/inducer/pycuda/issues/319 I'm not exactly sure my work is proper, I'm more a python guy than C++. Can you share your thoughts @inducer please ?

dmenig

Implement `enter` and `exit` for CUDA context

1

People like me always forgot to pop context and cause errors. By implementing these two methods, the python will handle the context automatically. See https://book.pythontips.com/en/latest/context_managers.html This also helps when there...

riaqn

PyCUDA ERROR: The context stack was not empty upon module cleanup

1

I have created a Streamlit App to as a demo of a project on Multilingual Text Classification using mBERT in PyTorch. When I run the app with the command `python...

ishandutta0098

GPU memory allocated by make_context cannot be released when exception.

11

**Describe the bug** I want to initialize as many cuda contexts as possible in a multi-threaded environment, but when cuda.Device(0).make_context() throws an exception, the GPU memory allocated by make_context cannot...

reallijie

bug

pycuda
pycuda copied to clipboard

Metadata

Finer control over GPU memory (access to cu*Destroy from Python)

`GPUArray.minimum` fails for `bool` arguments

Shipped Boost.Python is incompatible with Python 3.11

`GPUArray.div` does not comply with numpy

[WIP] Add support for CUDA Graphs.

Is there a way to share context among threads, if not why?

Implement stream priority feature

Implement `enter` and `exit` for CUDA context

PyCUDA ERROR: The context stack was not empty upon module cleanup

GPU memory allocated by make_context cannot be released when exception.

← Metadata

Owner

Metadata

pycuda pycuda copied to clipboard

Metadata

← Metadata

Owner

Metadata

pycuda
pycuda copied to clipboard