jinge90

Results 7 issues of jinge90

Signed-off-by: gejin

follow through

Some deep learning framework uses '__nv_rcp64h' in CUDA backend. We need to provide equivalent functionality in DPC++ compiler.

Adds new simd emulate functions.