macsim

Results 2 comments of macsim

thank you @Mddct comments, But I got a one more question.. As you say "pytorch will automatically select flash attention or memory efficenet attention", We don't need 'use_sdpa=False' like flag...

Thank you very much @Mddct I understand your answer