transformer_nuggets
transformer_nuggets copied to clipboard
FlexAttention currently evaluates on partial blocks all values and this can lead to IMA
trafficstars
See https://github.com/pytorch/pytorch/issues/147551#issuecomment-2683700299