Ean Garvey comments

Results 87 comments of


                                            Ean Garvey

[rocm] Wan2.1 layer norm dispatch causes runtime semaphore abort

It seems to be an issue with a transpose on the norm input -- the abort also goes away when I increase --iree-opt-level to O2 or higher. Perhaps we are...

[rocm] Wan2.1 layer norm dispatch causes runtime semaphore abort

@AWoloszyn if you have cycles for it, your context on HIP runtime may help here the most. PTAL

[rocm] Wan2.1 layer norm dispatch causes runtime semaphore abort

Output with AMD_LOG_LEVEL=3: [Azure](https://sharkpublic.blob.core.windows.net/sharkpublic/ean/wan/semabrt_log_level_3.txt)

[rocm] Wan2.1 layer norm dispatch causes runtime semaphore abort

Thank you @AWoloszyn, reducing the sequence length (in this model that means restricting height and width of output) to below 65535 does circumvent the error, though it is obviously not...

[ROCM][Tracker] Wan2.1 Autoencoder3d performance - MI300x

Exports of the original post's MLIR can be reproduced by following the instructions on the shark-ai feature branch @ wan_exports: https://github.com/nod-ai/shark-ai/blob/wan_exports/sharktank/sharktank/torch_exports/wan/README.md This is the nn.module we are exporting through iree-turbine...

[ROCM][Tracker] Wan2.1 Autoencoder3d performance - MI300x

The next step here, if I understand correctly, is to improve code generation for the attention shape with head dim 384 used in this VAE decode. I can help with...

[ROCM][Tracker] Wan2.1 Autoencoder3d performance - MI300x

@Groverkss I am reimplementing this model in sharktank to get a better export, but I suspect we will still need to support these cases. Has there been any change since...

SHARK API

@gpetters-amd FWIW the existing api file in apps/shark_studio/api/sd.py can be converted to apps/shark_studio/api/turbine-models-sd.py (that's sort of what you have going at the moment, I think) and refactored to match an...

(TorchToLinalg) Support for lowering torch.aten.as_strided

By all means, I'd take a look at any PRs submitted for this. It is, however, not an entry-level task and I don't recommend it for a first-time contributor. @gmalasan

(TorchToLinalg) Support for lowering torch.aten.as_strided

It may help to enumerate the cases where as_strided is used in pytorch lowerings and decompositions. Chunk (example in OP) is just one of many sources and is likely one...