AMDMIGraphX
AMDMIGraphX copied to clipboard
AMD's graph optimization engine.
We are going to enable f16 for reduce in this rocmlir PR: https://github.com/ROCm/rocMLIR/pull/1722 However, the code here assumes bf16 and fp8 are supported and that's not the case.
Fixes #2991
This PR adds `MIGRAPHX_ENABLE_TUNING_WARNINGS` environment flag. Based on this flag it would warn/error out if saved Tuning information has a different version, and architecture setup compared to the machine on...
Related PR: https://github.com/ROCm/AMDMIGraphX/pull/3595
There's a push to have dockers come/go to a private repo for CI. Today github actions can build a docker image to be used for builds but that is published...
Generalize the work to efficiently support w4a16, to support w8a16 (weights in an 8 bit format, e.g. int8, eventually fp8); activations in fp16 (eventually also bf16). "Efficiently" here means that...
`hipblaslt_ext::getAllAlgos` returns algos which can then be validated using `hipblaslt_ext::matmulIsAlgoSupported`. For some cases, all the algos returned are invalid. This seems to be an issue that hipBLASLt team needs to...
So I followed steps from [https://rocm.docs.amd.com/projects/radeon/en/latest/docs/install/native_linux/install-migraphx.html](url), I installed Radeon software for WSL with ROCm and Pytouch strictly following those steps and everything works fine, and I start to try to...