Guanzheng Chen
Results
1
issues of
Guanzheng Chen
Hi, Thanks for your awesome work. In my test on 8xA800, why using USP with ulysses_degree=8 and ring_degree=1 would take more GPU memory than naive Ulysses?