DylanChen-NV
Results
2
issues of
DylanChen-NV
Fix the issue that kv_head_num becomes 0 when cp_size * tp_size > kv_head_num for MQA. Refine Ulysses code in AttentionOp
Add support for fp8 kv cache on blackwell