cccl icon indicating copy to clipboard operation
cccl copied to clipboard

[EPIC]: Add Hopper features to `cuda::ptx`

Open ahendriksen opened this issue 1 year ago • 1 comments

Is this a duplicate?

  • [X] I confirmed there appear to be no duplicate issues for this request and that I agree to the Code of Conduct

Area

libcu++

Is your feature request related to a problem? Please describe.

Many Hopper features are not yet exposed through cuda::ptx.

Describe the solution you'd like

Add PTX instructions to cuda::ptx.

Describe alternatives you've considered

No response

Additional context

No response

### Tasks
- [ ] https://github.com/NVIDIA/cccl/issues/673
- [ ] https://github.com/NVIDIA/cccl/issues/1079
- [ ] https://github.com/NVIDIA/cccl/issues/1077
- [ ] https://github.com/NVIDIA/cccl/issues/659
- [ ] https://github.com/NVIDIA/cccl/issues/1339
- [ ] https://github.com/NVIDIA/cccl/issues/1343
- [ ] https://github.com/NVIDIA/cccl/issues/1344
- [ ] https://github.com/NVIDIA/cccl/issues/1350
- [ ] https://github.com/NVIDIA/cccl/issues/1398
- [ ] https://github.com/NVIDIA/cccl/issues/1399
- [ ] https://github.com/NVIDIA/cccl/issues/1400
- [ ] https://github.com/NVIDIA/cccl/issues/1401
- [ ] https://github.com/NVIDIA/cccl/issues/1402
- [ ] https://github.com/NVIDIA/cccl/issues/1365
- [ ] https://github.com/NVIDIA/cccl/issues/1439
- [ ] https://github.com/NVIDIA/cccl/issues/1440
- [ ] https://github.com/NVIDIA/cccl/issues/1536
- [ ] https://github.com/NVIDIA/cccl/issues/1444
- [ ] [FEA]: Add `cuda::ptx::setmaxnreg`
- [ ] [FEA]: Add `cuda::ptx::cp_async_bulk_prefetch`
- [ ] [FEA]: Add `cuda::ptx::cp_async_bulk_prefetch_tensor`
- [ ] [FEA]: Add `cuda::ptx::multimem_*`
- [ ] [FEA]: Add `cuda::ptx::prefetch_tensormap`
- [ ] [FEA]: Add `cuda::ptx::griddepcontrol`
- [ ] [FEA]: Add `cuda::ptx::mbarrier_complete_tx`
- [ ] [FEA]: Add `cuda::ptx::mbarrier_expect_tx`
- [ ] [FEA]: Add `cuda::ptx::stmatrix`
- [ ] [FEA]: Add `cuda::ptx::atom` (cluster scope)
- [ ] [FEA]: Add `cuda::ptx::red` (cluster scope)
- [ ] https://github.com/NVIDIA/cccl/issues/1490

ahendriksen avatar Feb 06 '24 11:02 ahendriksen

Removing cuda::ptx::mapa again due to bug. See #1442 #1414

ahendriksen avatar Feb 27 '24 14:02 ahendriksen