SYCLomatic issues

Migrate the math intrinsic functions

1

Syclomatic supports part of the functions in https://docs.nvidia.com/cuda/cuda-math-api/group__CUDA__MATH__INTRINSIC__SIMD.html The unsupported functions are listed below. Thanks. ``` DPCT1007:0: Migration of __viaddmax_s16x2 is not supported. DPCT1007:1: Migration of __viaddmax_s16x2_relu is not supported....

jinz2014

enhancement

Fix cross scope of function call, parameter size limit and performance impact of large capture by using functor.

1

Considering following case in DeepSpeed kernel, a global function template with parameter pack ``` template __global__ void multi_tensor_apply_kernel( int chunk_size, volatile int* noop_flag, T tl, U callable, ArgTypes... args) {...

CaoZhongZ

enhancement

[SYCLomatic] Support migration for cub::BlockExchange API

Add Migration of CUB Block Exchange API. cc @yihanwg

abhilash1910

Use `sycl::bfloat16` class and functions instead of float casts.

9

The bfloat16 class has been non-experimental for a while now, supporting all backends: https://github.com/oneapi-src/SYCLomatic/pull/1286 However SYCLomatic appears to be not be using this, and instead just always casting to float,...

JackAKirk

enhancement

[SYCLomatic] Support migration for cub::{StoreDirectBlocked, StoreDirectStriped} API

2

Add Migration of cub::store API . Linked to #1819 cc @yihanwg , @zhimingwang36

abhilash1910

SYCLomatic README missing a pre-requisite and a Useful link

2

The pre-requisite section of the README document for SYCLomatic repo doesn't call out the CUDA headers dependency and versions supported (https://github.com/oneapi-src/SYCLomatic#prerequisites) The post migration SYCL can be targeted for non-Intel...

mpanoop

bug

DPCT seems to generate incomplete codes for a template function

2

Migrating a function from CUDA to DPCT shows that the result is not complete (e.g. kernel name is missing). Please see the following code snippets from the program (https://github.com/zjin-lcf/HeCBench/blob/master/ssim-cuda/utils.h). Could...

jinz2014

bug

Migration of ncclGroupStart and ncclGroupEnd not supported ?

1

Please see the example https://docs.nvidia.com/deeplearning/nccl/user-guide/docs/examples.html#example-1-single-process-single-thread-multiple-devices

jinz2014

enhancement

Migration of ncclCommInitAll is not supported ?

1

Please see the example https://docs.nvidia.com/deeplearning/nccl/user-guide/docs/examples.html#example-1-single-process-single-thread-multiple-devices Thanks.

jinz2014

enhancement

Mistranslation of CUDA constant memory

3

**Is your feature request related to a problem? Please describe** Syclomatic translates memory marked with `__constant__` in CUDA as just a standard marked read only SYCL buffer/accessor. This means that...

tomatih

enhancement

SYCLomatic
SYCLomatic copied to clipboard

Metadata

Migrate the math intrinsic functions

Fix cross scope of function call, parameter size limit and performance impact of large capture by using functor.

[SYCLomatic] Support migration for cub::BlockExchange API

Use `sycl::bfloat16` class and functions instead of float casts.

[SYCLomatic] Support migration for cub::{StoreDirectBlocked, StoreDirectStriped} API

SYCLomatic README missing a pre-requisite and a Useful link

DPCT seems to generate incomplete codes for a template function

Migration of ncclGroupStart and ncclGroupEnd not supported ?

Migration of ncclCommInitAll is not supported ?

Mistranslation of CUDA constant memory

← Metadata

Owner

Metadata

SYCLomatic SYCLomatic copied to clipboard

Metadata

← Metadata

Owner

Metadata

SYCLomatic
SYCLomatic copied to clipboard