Yongchang Hao
Yongchang Hao
Hi @hiyouga, I am trying out GaLore with this repo. However, I am experiencing a very low throughput on an A6000. How did you manage to make it >1it/s? In...
@hiyouga Thanks for the update. I feel the current data make more sense. For future readers' reference, my preliminary experience aligns well with the data reported in in https://github.com/jiaweizzhao/GaLore/issues/3#issuecomment-1985411364
Thank you. May I know the reason why the matrix A is set to 0 here, unlike other parts (and the paper) where B is 0?