Birch-san
Birch-san
As there wasn't a torch 2.4.0 wheel, I tried building NATTEN myself. It didn't go as smoothly as usual. Most problems were due to cmake giving misleading/incomplete error messages. These...
_Downstream PyTorch issue:_ https://github.com/pytorch/pytorch/issues/133780 **Describe the bug** cuDNN frontend rejects batch_size=0 input with `CUDNN_STATUS_BAD_PARAM` **Expected behavior** cuDNN should return to me a tensor [0, num_head, sequence_length, dims_per_head] something like that,...
## Problem If I trace my model like so (weird shape then nice shape), everything is fine: ```python model(randn(1, 4, 150, 157)) model(randn(1, 4, 64, 64)) ``` whereas if I...
thanks for the great work on stable-fast. it compiles quickly and boosts speed a lot. is it possible to support two different compilation graphs? for example swapping the SDXL UNet's...