DeepSpeed
DeepSpeed copied to clipboard
[ROCm] Hip headers fix
This PR is to
- Hipify cg header in apply_rotary_pos_emb.cu
- Exclude cuda_bf16.h on ROCm
cc: @jithunnair-amd