Andres Lugo issues

Results 11 issues of


                                            Andres Lugo

[ROCm] Implement forward AD for miopen_batch_norm

Implements forward automatic differentiation support for miopen_batch_norm as well as unskips the associated unit tests. Also fixes a class of functorch related unit tests that fail due to failing a...

module: rocm

triaged

open source

rocm

ciflow/rocm

[ROCm] Integrate hipblasLT AMAX_D pointer

Reverts the AMAX workaround now that hipblasLT supports AMAX. hipblasLT does not like getting nullptr for scale D so we create a dummy scalar tensor with the value of 1.0...

Fix hipblasLT workaround amax bug

Pushing to our internal fork. Already merged upstream here: https://github.com/pytorch/pytorch/pull/123275

Change to temporarily fix ldl_factor tests

Hey @jithunnair-amd This PR is the change to fix the ldl_factor tests regarding that "hermitian" flag. I know we wanted to wait until hipsolver was enabled by default (hence the...

CK GEMM Backend

Porting recent ck gemm backend changes to ROCm

Rocm gemm abd

Fixes #ISSUE_NUMBER

[ROCm] Initial prototype for ck_tile sdpa FA backend

Initial prototype for sdpa ck backend. Does not support odd number of attention heads

[DON'T MERGE] Sdpa mine

Fixes #ISSUE_NUMBER

[DON'T MERGE] Rocm sdpa ck tile

Creating this so I can trivially see all sdpa ck tile changes in one place

[ROCm] CK Flash Attention Backend

Replaces https://github.com/ROCm/pytorch/pull/1592 Updated implementation of CK gemm backend. Can close previous PR This PR will generate CK kernels necessary for flash attention. Currently they will be generated in pytorch/aten/src/ATen/native/transformers/hip/flash_attn/. The...

module: rocm

triaged

open source

ciflow/trunk

topic: not user facing

skip-pr-sanity-checks

module: dynamo

ciflow/inductor

rocm

ciflow/rocm