mamba
mamba copied to clipboard

Published 20 hours ago •

Reame
Issues

LoRA layer param.grad=None after loss.backward() using lora for mamba 2.8B

Open jinlHe opened this issue 10 months ago • 0 comments

I'm not sure which community I need to turn to for help, so I'm posting them all, I would appreciate it if someone could answer my questions: LoRA layer param.grad=None after loss.backward() using lora for mamba 2.8B

Apr 01 '24 15:04 jinlHe