AMDMIGraphX
AMDMIGraphX copied to clipboard
AMD's graph optimization engine.
Fixes #3172.
Fixes https://github.com/ROCm/AMDMIGraphX/issues/2556 RMSNorm is used in LLMs like Llama2. Currently the fp16 version can overflow during normalization. This change try to addess it byconvert normalization to fp32.
amdclang++/amdclang can be invoked without providing absolute path, since update-alternatives command has been run for these binaries This will also help to remove the hard coded paths in cmake target...
Changes to MatMul parser to handle the Microsoft Contrib operator MatMulintegarToFloat Since we have the scale and zero points in our operands we can just perform a multiplied after int8...
Needed to be commented out by #3340 - [ ] Uncomment and investigate - [ ] Fix code that causes this if able - [ ] determine if test stops...
Disabled tidy check in #3340 - [ ] Uncomment and investigate - [ ] Fix code if able - [ ] reneable check
Investigate and cleanup tidy errors that arose via #3340 with bugprone-empty-catch - [ ] Uncomment this in CmakeLists.txt - [ ] determine if false positive or requires fix - [...