Zhihang Lin

Results 2 issues of Zhihang Lin

I write code in keras trying to build a QA system,And I use a lstm layer to compute a representation for question,and Evidenve lstm for analyzing evidence. At last I...

**Describe the bug** When I use zero optimization(stage=3),It's spend lots of time on loading model. I'm trying to finetune OPT-66B on 2 node,each node contain 8*NVIDIA A100-SXM(80G),1TB RAM. I have...

bug