saint
saint copied to clipboard
Update saint_i.py
Inter sample attention now inherits from nn.MultiheadAttention which includes optimizations such as flash attention