iree issues

Propagate reshapes through generics with reduction iterators

Closes https://github.com/iree-org/iree/issues/18854

Improve FuseMultiUseElementwiseProducersPass

The helper function `isHorizontalToGroup` relies on `getBackwardSlice` which doesn't include operations defined above which are used in the body of an operation (vs as an operand) https://github.com/iree-org/iree/blob/114a1427810f3da0234f98c22f58390773b0489a/compiler/src/iree/compiler/DispatchCreation/FusionUtils.cpp#L104 To be conservative,...

IanWood1

good first issue 🌱

performance ⚡

Llama dispatch perf tracking issue

### Tracking issue for llama related `DispatchCreation` changes/improvements #### Reshape related changes: - [ ] Fold `tensor.cast`s with `tensor.expand_shape` ops https://github.com/llvm/llvm-project/pull/112265 - [ ] Bubble `tensor.expand_shape` ops through `tensor.collapse_shape` ops...

IanWood1

performance ⚡

[Attention] Only clamp attention for low precision types

Post-softmax, the range of output is between 0, 1. For low-precision types (like fp8), we scale the output range to be between 0, fpMax, so we can use more of...

Groverkss

Operand #1 does not dominate this use

2

### What happened? For the given IR ```mlir module { func.func @main_graph(%arg0: !torch.vtensor, %arg1: !torch.vtensor , %arg2:!torch.vtensor , %arg3: !torch.vtensor,%arg5: !torch.vtensor , %arg6: !torch.vtensor) -> !torch.vtensor attributes {torch.onnx_meta.ir_version = 8...

pdhirajkumarprasad

bug 🐞

iree
iree copied to clipboard

Metadata

Propagate reshapes through generics with reduction iterators

Improve FuseMultiUseElementwiseProducersPass

Llama dispatch perf tracking issue

[Attention] Only clamp attention for low precision types

Operand #1 does not dominate this use

Move reshape ops through `linalg.generic` with reduction iterators

GPUMaterializeEncoding: expand-to-subgroups in both M and N dimensions

GPUMaterializeEncoding: tune for narrow cases

failed to legalize operation 'hal.interface.constant.load'

← Metadata

Owner

Metadata

iree iree copied to clipboard

Metadata

← Metadata

Owner

Metadata

iree
iree copied to clipboard