[Docs/PTX] Add device tensor map init example
Description
closes https://github.com/NVIDIA/cccl/issues/1982
Adds documentation of tensormap modification/initialization on device. Also improves on navigation and table of contents of the cuda::ptx docs.
Checklist
- [x] New or existing tests cover these changes.
- [x] The documentation is up to date with these changes.
Please don't merge yet. I still have to incorporate some internal feedback.
In the mean time, the code example for on-device tensor map modification has made it into the CUDA programming guide. Instead of duplicating the documentation and code sample, I have linked to the relevant section in the programming guide. The improvements to table of contents and layout are still very much worth it in this PR.
I have copied over the fixed links from Bryan van de Ven's PR.
This pull request requires additional validation before any workflows can run on NVIDIA's runners.
Pull request vetters can view their responsibilities here.
Contributors can view more details about this message here.
/ok to test
/ok to test
/ok to test