AMDMIGraphX issues

Add TurnkeyML Hooks for MIGraphX

1

Follow up from work from #2584 scoping Add support for MIGraphX into Turnkey ML. Leverage the existing TensorRT route used and allow us to run an inference with MIGraphX through...

TedThemistokleous

onnxruntime

TurnkeyML

Icebox

Unsupported operator: com.microsoft.FusedMatMul

1

- MIGraphX commit: [352dcea](https://github.com/ROCm/AMDMIGraphX/commit/352dcea2c6a03c495a6ba8667e19811bc5d1399b) - GPU: MI210 (gfx90a) - ROCm version: 5.7.0 Model(s) to reproduce: - https://github.com/gyulaz-htec/models/blob/migraphx_testing/text/machine_comprehension/bert-squad/model/bertsquad-12-int8.tar.gz Fails at: `/code/AMDMIGraphX/src/onnx/onnx_parser.cpp:419: parse_graph: Unknown operator: FusedMatMul` Note: More info at [ContribOperators.md/com.microsoft.FusedMatMul](https://github.com/microsoft/onnxruntime/blob/main/docs/ContribOperators.md#com.microsoft.FusedMatMul)

gyulaz-htec

Drop is seen in torch_migx_quantized's model due to volatility in scores for multiple runs

ahsan-ca

~6 to ~23% Performance drops are observed in AMDMIGRAPHX Models

ahsan-ca

Use dynamic shapes for kv-cache

There is two parts for this: 1. Use dynamic shapes to handle the past sequence length. 2. Similar to dynamic batching, we can make two different submodule for each GQA...

pfultz2

enhancement

split multiple of the same dynamic dimension for dynamic batch

1

An example case would be when we have a square image. Such that the height and width are the same but dynamic.

CharlieL7

enhancement

Error out of the migraphx-driver when --batch is wrongly used

--batch should only be used when there is only 1 input and it is the first dimension. Otherwise error with a message directing the user to use input dimms The...

amd-mwu10004

Update `pass_manager` to handle updating the cached shape when submodule shapes change

* Extend code in `pass_manager.cpp:run_passes(prog, root_mod, passes, trace)` to handle updating the cached shape from `op.compute_shape(shape_inputs, mod_inputs)` when `mod_inputs` shape changes. * Currently, the cached shape is not updated when...

CharlieL7

enhancement

Cleanup

Make python API examples for dynamic batch

We currently only have a C++ API example for using dynamic batch. We need to make an example using the Python API and dynamic batch.

CharlieL7

C/C++ API and Python API dynamic batch with offload_copy = false

* Both APIs need to be updated to allow for dynamic batch to be used with user allocated device memory * Related to #1720 and #1734

CharlieL7

enhancement

AMDMIGraphX
AMDMIGraphX copied to clipboard

Metadata

Add TurnkeyML Hooks for MIGraphX

Unsupported operator: com.microsoft.FusedMatMul

Drop is seen in torch_migx_quantized's model due to volatility in scores for multiple runs

~6 to ~23% Performance drops are observed in AMDMIGRAPHX Models

Use dynamic shapes for kv-cache

split multiple of the same dynamic dimension for dynamic batch

Error out of the migraphx-driver when --batch is wrongly used

Update `pass_manager` to handle updating the cached shape when submodule shapes change

Make python API examples for dynamic batch

C/C++ API and Python API dynamic batch with offload_copy = false

← Metadata

Owner

Metadata

AMDMIGraphX AMDMIGraphX copied to clipboard

Metadata

← Metadata

Owner

Metadata

AMDMIGraphX
AMDMIGraphX copied to clipboard