AMDMIGraphX issues

Flash decoding round 1

3

First round in addressing #4334 ## Motivation rocMLIR has implemented split kv and GQA, which enables us to implement flash decoding. Now we need to add this to the migraphx...

bdevorem

Create op. builders (3.) (MatMul)

4

The whole purpose of the op-builders is to factor out the main functionalities behind the current onnx parsers and put them in a separate set of files (under the folder...

gchinora

Disable fp32 geg

3

## Motivation MLIR doesnt support fp32 GEG fusion on navi. ## Technical Details Disable GEG fusion for fp32, and enable GEG in jenkins CI. ## Changelog Category - - [...

pfultz2

## Motivation * Part of https://github.com/ROCm/AMDMIGraphX-internal/issues/149 ## Technical Details * Requires https://github.com/ROCm/rocMLIR/tree/packFp4 from rocMLIR to work on MI350. * This will pass CI since CI doesn't run on a MI350...

CharlieL7

roadmap

disable matching for dynamic shapes

4

## Motivation disable matchers by default for dynamic shape graphs ## Technical Details Updates based on comments in #4347 Changes need to be applied on top of #4316 ## Changelog...

shivadbhavsar

Implement flash decoding

2

Implement flash decoding as described here: https://pytorch.org/blog/flash-decoding/ We have attention operators grouped like this: ``` Q -> [B, M, k] K -> [B, k, N] V -> [B, N, D]...

pfultz2

migraphx triage guide

2

## Motivation ## Technical Details ## Changelog Category - - [ ] Added: New functionality. - - [ ] Changed: Changes to existing functionality. - - [ ] Removed: Functionality...

aarushjain29

Refactor Jenkinsfile to use declarative pipeline syntax

2

Updated image build process, and remove deprecated code. Added stages for checking and building Docker images, and organized test stages for various configurations. ## Motivation Testing to determine if this...

causten

clamping the scale

3

## Motivation The scale values could underflow or overflow. So, to avoid those cases clamping on both sides. ## Technical Details ## Changelog Category - - [ ] Added: New...

aarushjain29

bugfix

`generic_float` for Float8E8M0

1

## Motivation * Introduce Float8E8M0 type within MIGraphX for better MXFP4 optimizations and to use hipblaslt mxfp4 kernels. ## Technical Details ## Changelog Category - - [ ] Added: New...

CharlieL7

AMDMIGraphX
AMDMIGraphX copied to clipboard

Metadata

Flash decoding round 1

Create op. builders (3.) (MatMul)

Disable fp32 geg

MXFP4 - rocMLIR integration

disable matching for dynamic shapes

Implement flash decoding

migraphx triage guide

Refactor Jenkinsfile to use declarative pipeline syntax

clamping the scale

`generic_float` for Float8E8M0

← Metadata

Owner

Metadata

AMDMIGraphX AMDMIGraphX copied to clipboard

Metadata

← Metadata

Owner

Metadata

AMDMIGraphX
AMDMIGraphX copied to clipboard