numba-dpex issues

[testing] Implement cli tool to generate llvm IR for kernels without running them

1

We want something like `dpexcli compile -m -n -o ` to generate llvm code of function without running it. TODO: - think about passing arguments type

ZzEeKkAa

enhancement

Support `@guvectorize`

Support for `@guvectorize` is missing. Features: - [ ] Passing intra-device arrays - [ ] Launch asynchronous - [ ] Calling Device Functions - [ ] Explicitly control the maximum...

PokhodenkoSA

enhancement

`@reduce` decorator

1

We currently do not have anything similar to `@cuda.reduce` and the output of this step should be a design to support a similar `@reduce` decorator for `numba-dppy`. Features to implement:...

PokhodenkoSA

enhancement

Implement support for types complex, bool, None, tuple on the kernel

Built-in types - [x] complex - [ ] bool - [ ] None - [ ] tuple

1e-to

enhancement

Support for Calling Device Functions from Ufuncs

3

dpex ufunc kernels cannot be called from other dpex device functions. Example: ```python @dppy.func def a_device_function(a): return a + 1 @vectorize(nopython=True) def ufunc_kernel(x, y): return a_device_function(x) + y def test_ufunc():...

PokhodenkoSA

enhancement

Investigating IGC generated kernel binary and dump info

3

When set to “1”, IGC will write number of dumps into /tmp/IntelIGC. ```python $ export IGC_ShaderDumpEnable=1 ``` To read the DWARF of a kernel, we first need a copy of...

1e-to

debug

Is the --spirv-debug-info-version=ocl-100 needed for debugging.

1

@akharche @reazulhoque The "--spirv-debug-info-version=ocl-100" in `spirv_generator.generate` is not used in any place. Is the flag essential for us to support GDB? If not the dead code should removed. _Originally posted...

diptorupd

debug

Cannot write into a dpctl.tensor view with a dpex.kernel

The following does not work: ```python import numba_dpex as dpex from numba import float32 import dpctl import dpnp import numpy as np @dpex.kernel def kernel(array_i): i = dpex.get_global_id(0) array[i] =...

fcharras

user

On loop unrolling

3

The following snippet highlights differences regarding loop unrolling between `numba` and `numba_dpex`. Regular `numba` [will try to unroll loops](https://numba.pydata.org/numba-doc/latest/user/faq.html#why-my-loop-is-not-vectorized) but the same behavior is not seen with `numba_dpex`, as it...

fcharras

enhancement

user

numpy sum operator (axis) isn't supported

1

Running into this issue when implementing l2_norm into dpebench: Here is the code: ``` @nb.njit(parallel=False, fastmath=True) def l2_norm(a, d): sq = np.square(a) sum = sq.sum(axis=1) d[:] = np.sqrt(sum) ``` Here...

mingjie-intel

enhancement

numba-dpex
numba-dpex copied to clipboard

Metadata

[testing] Implement cli tool to generate llvm IR for kernels without running them

Support `@guvectorize`

`@reduce` decorator

Implement support for types complex, bool, None, tuple on the kernel

Support for Calling Device Functions from Ufuncs

Investigating IGC generated kernel binary and dump info

Is the --spirv-debug-info-version=ocl-100 needed for debugging.

Cannot write into a dpctl.tensor view with a dpex.kernel

On loop unrolling

numpy sum operator (axis) isn't supported

← Metadata

Owner

Metadata

numba-dpex numba-dpex copied to clipboard

Metadata

← Metadata

Owner

Metadata

numba-dpex
numba-dpex copied to clipboard