AMDMIGraphX
AMDMIGraphX copied to clipboard
AMD's graph optimization engine.
This PR fixes these two issues when using the default stream on the GPU: i.e. via `MIGRAPHX_ENABLE_NULL_STREAM=1`: 1. Trying to `verify` on a model, (resnet50v2 here) gives this exception: `context.hpp:365:...
Target is 6.4.1 May require an additional changes to support accuracy
New headers hipamd/include/hip/amd_detail/amd_hip_bf16.h and hipamd/include/hip/amd_detail/amd_hip_fp8.h should be in the latest ROCm release? Will we be incorporating these?
Motivation: https://github.com/ROCm/MIOpen/pull/3515
Implemented an example for https://huggingface.co/amd/Llama-2-7b-chat-hf-awq-int4-asym-gs128-onnx/tree/main Llama2 7b model with MIGraphX. Details about the example and description for running is available in README (https://github.com/ROCm/AMDMIGraphX/tree/htec/mgx-llama2-7b-example/examples/transformers/mgx_llama2)
The old logic stepped through the args list in order, then assumed the first float-type input was the "Scales". This led to an error in an obscure case where the...