eqy issues

Results 23 issues of

eqy

[cuDNN][SDPA] Match `query`'s memory layout ordering for `output` in cuDNN SDPA

For #138340 We might consider more sophisticated logic here but the corresponding logic in other backends doesn't seem to do anything fancy for non BSHD/BHSD cases https://github.com/pytorch/pytorch/blob/ea8ea2f33fc65b33dc562f4b0430f8c79eb81d8d/aten/src/ATen/native/transformers/cuda/attention.cu#L1145 cc @csarofeen @ptrblck...

module: cudnn

module: cuda

open source

ciflow/trunk

topic: bug fixes

topic: not user facing

ciflow/periodic

ciflow/inductor

module: multi-headed-attention

Bump `nn.functional.conv3d` tolerances for `test_comprehensive`

`float16` tolerance was previously set to `1e-5` which seemed very low

module: convolution

triaged

open source

topic: not user facing

ciflow/inductor

[cuDNN][SDPA][Nested Tensor] Experimental cuDNN Nested Tensor SDPA Support (forward only)

Disabled by default for now behind `TORCH_CUDNN_SDPA_NESTED_TENSOR_ENABLED=1` Just wanted to get this out before starting a series of SDPA cleanup PRs---the biggest thing is we don't need the boilerplate around...

module: cudnn

open source

module: nestedtensor

topic: not user facing

module: multi-headed-attention