macsim
Results
2
comments of
macsim
thank you @Mddct comments, But I got a one more question.. As you say "pytorch will automatically select flash attention or memory efficenet attention", We don't need 'use_sdpa=False' like flag...
Thank you very much @Mddct I understand your answer