chamuyaye
Results
1
comments of
chamuyaye
trafficstars
> you can replace flash_atten with the normal attention in pytorch. Things will still work, despite that the training speed will be very slow. Can you give detailed steps, thank...