Peter Park
Peter Park
AMD Instinct MI300-series accelerators and third-party GPUs such as the Matrox G200 have an issue impacting video output. The issue was reproduced on a Dell server model PowerEdge XE9680. Servers...
Processor affinity is a critical setting to ensure that ROCm helper threads run on the correct cores. By default, ROCm helper threads are spawned on all available cores, ignoring the...
ROCm provides an optional package – `rocm-llvm-alt` – that provides a closed-source compiler for users interested in additional closed-source CPU optimizations. This feature is not functional in the ROCm 6.2.0...
- [x] Quick find and replace in `docs/` - [ ] Proofread - [ ] Verify names in UI and commands in docs Related: #428
This PR: - Updates conf.py to non-legacy rocm-docs-core setup - Injects Omnitrace's version number into the download command for omnitrace-installer.py (quick-start.rst) - Fixes an issue causing the download url (`https://github.com/ROCm/omnitrace/releases/latest/download/omnitrace-install.py`)...
On systems running Linux kernel 6.8.0, such as Ubuntu 24.04, Direct Memory Access (DMA) transfers between the GPU and NIC are disabled and impacts multi-node RCCL performance. This issue was...
Perfetto can no longer open [Omnitrace](https://rocm.docs.amd.com/projects/omnitrace/en/docs-6.2.1) proto files. Loading Perfetto trace output `.proto` files in the latest version of [ui.perfetto.dev](ui.perfetto.dev) can result in a dialog with the message, "Oops, something...
This PR updates the documentation with info about the MI300 series ## Performance model ### L1 - Update L1 cache line size to 128B for MI300 (https://advanced-micro-devices-demo--446.com.readthedocs.build/projects/rocprofiler-compute/en/446/conceptual/vector-l1-cache.html#l1-cache-line-size) (64B for MI200)....