Zhihang Lin
Results
2
issues of
Zhihang Lin
I write code in keras trying to build a QA system,And I use a lstm layer to compute a representation for question,and Evidenve lstm for analyzing evidence. At last I...
**Describe the bug** When I use zero optimization(stage=3),It's spend lots of time on loading model. I'm trying to finetune OPT-66B on 2 node,each node contain 8*NVIDIA A100-SXM(80G),1TB RAM. I have...
bug