MIOpen icon indicating copy to clipboard operation
MIOpen copied to clipboard

AMD's Machine Intelligence Library

Results 202 MIOpen issues
Sort by recently updated
recently updated
newest added

[Issue] For some cluster environments, $HOME are accessible to log-in nodes but not compute nodes [Proposal] - [ ] 1. emit warning messages when $HOME/.cache/miopen is not accessible - [x]...

enhancement
value_middle
complexity_low
urgency_normal
documentation

I guess it's time to find out if these W/A are still necessary in 4.3.1 (which we currently run CI on) and disable them. This is the key point for...

value_high
quality

l can't find the corresponding API to `cudnnTransformTensor()` in miopen library, can anyone please help me?

quality
urgency_low
documentation

Originated from https://github.com/ROCmSoftwarePlatform/MIOpen/pull/1230#discussion_r737878197 (see the whole thread). Synopsis: > As far as I see, for PerformanceConfigAsmImplicitGemmGTCFwdXdlopsNHWC we only need to store table index and `gemm_k_global_split`. The rest of data can...

value_high
quality
urgency_normal

- rocm 4.0.0 - gfx906 ```bash export MIOPEN_FIND_MODE=1 export MIOPEN_LOG_LEVEL=7 export MIOPEN_DEBUG_CONV_DIRECT=1 export MIOPEN_DEBUG_CONV_WINOGRAD=0 export MIOPEN_DEBUG_AMD_FUSED_WINOGRAD=0 export MIOPEN_DEBUG_CONV_FFT=0 export MIOPEN_DEBUG_CONV_GEMM=0 export MIOPEN_DEBUG_CONV_SCGEMM=0 export MIOPEN_DEBUG_CONV_IMPLICIT_GEMM=0 ./MIOpenDriver CBAInferfp16 -W 4 -H 4...

bug
urgency_normal
correctness

Bf16 and Fp16 Full tests fail on gfx90a when library is built with COMGR. This prevents from switching fully to COMGR builds by default. Some tests PASS with hip-clang, FAIL...

non-miopen-bug
workaround
urgency_low

There were times when HIP compilation didn't work in embedded MIOpen, because precompiled HIP headers weren't available. https://github.com/ROCmSoftwarePlatform/MIOpen/blob/5cb2e548131d777d589917db8a0d8669b3ae8c5d/CMakeLists.txt#L132-L135 This workaround leads to performance drop of embedded MIOpen. We should remove...

urgency_unknown
performance
workaround

As reported in [SWDEV-304151](https://ontrack-internal.amd.com/browse/SWDEV-304151) when tuning using MIGraphX as the driver framework, a large number (more than 10k) of file handles are created and never released. One of the possible...

bug
value_middle
non-miopen-bug
quality
complexity_high
urgency_normal

If the user doesn't have the MIOpen kernels package installed, they see a message such as below: `MIOpen(HIP): Warning [SQLiteBase] Unable to read system database file:gfx906_60.kdb Performance may degrade` This...

documentation

https://github.com/ROCmSoftwarePlatform/MIOpen/blob/4e61a3ebdfe2b07b4d331cc46832eb4a6b49941c/src/solver.cpp#L264 Can we introduce some dedicated primitive (e.g. `Fused`) for fused convolutions and use it in the registry? This would help to avoid hacks like this: https://github.com/ROCmSoftwarePlatform/MIOpen/pull/1212/files/db05b058f77cc838f5bbafc93fe08fd8e20c713d..f26346d12ba5f2ee472d26d7e6c207325bbe6233#r723662796

enhancement
value_middle
quality