jinge90 comments

Results 30 comments of


                                            jinge90

[SYCL] Add __imf_rcp64h to intel math libdevice

Hi, @tfzhu Could you help check whether the pre-ci failure in Jenkins/Precommit is a infrastructure issue? Thanks very much.

[SYCL] Add __imf_rcp64h to intel math libdevice

> Hello @jinge90 > > ```cuda-c++ > #include > #include > #include > #include > > extern "C" > __device__ > double __nv_rcp64h(double a); > > __device__ > void calculate(const...

[SYCL] Add __imf_rcp64h to intel math libdevice

> @jinge90 I believe it flushes denormals to zero in source and destination and utilizes a slightly different table. > > 1-bit differences are unavoidable: > > ``` > >...

[SYCL] Add __imf_rcp64h to intel math libdevice

> @jinge90 rcp64h provides initial value for division algorithm to work. Sometimes such algorithms are implemented as table-lookup. > > ``` > 0x1.08d3e00000000p-1 > > > real math infinite prec....

[SYCL] Add __imf_rcp64h to intel math libdevice

> I suggest to even out with FTZ/DAZ behavior. I wrote this quickly to test new behavior. > > ```c++ > void emulate(const double* a, double* y) { > uint64_t...

[SYCL] Add __imf_rcp64h to intel math libdevice

Hi, @intel/dpcpp-tools-reviewers , @intel/llvm-reviewers-runtime and @aelovikov-intel Could you help review this patch? Thanks very much.

[SYCL] Add __imf_rcp64h to intel math libdevice

Hi, @intel/dpcpp-tools-reviewers Could you help review this patch? Thanks very much.

[SYCL] Add __imf_rcp64h to intel math libdevice

Hi, @intel/dpcpp-tools-reviewers Kind ping~. Thanks very much.

[SYCL] Add __imf_rcp64h to intel math libdevice

Hi, @intel/dpcpp-tools-reviewers Kind ping~. Thanks very much.

[SYCL] Remove aspect-ext_oneapi_bfloat16_math_functions

Hi, @steffenlarsen and @JackAKirk Yes, for "non-cuda" targets, we just use generic fp32 math functions to implement these bf16 functions, they can run on any device. Thanks very much.