hsb1995

Results 38 comments of hsb1995

Have you solved this problem? I have encountered the same problem and am still solving it. I would like to consult with you on how to resolve the issue? I...

Have you also tried the Qlora option command ? ---- Replied Message ---- | From | ***@***.***> | | Date | 4/9/2024 09:58 | | To | ***@***.***> | |...

![image](https://github.com/AnswerDotAI/fsdp_qlora/assets/149936473/10a34d2a-5931-453f-a5ea-ca5a19a1be50) I just tested that files with small weights can be computed in parallel, but files with large weights cannot. This indicates that it is not an issue with our...

> Hi @hsb1995 , > > Yes, 7B is working fine without an issue with the parallel process. > > Have you also tried the Qlora option command? => Yes,...

![image](https://github.com/AnswerDotAI/fsdp_qlora/assets/149936473/f3ff9f2e-5e17-4dd6-8742-46423ef06509) ![image](https://github.com/AnswerDotAI/fsdp_qlora/assets/149936473/115ddbda-0596-47e4-b5a8-05c068cf8fe6) The command he mentioned in the article requires 128G-CPU, which is currently the case with me. Is it related to this? Or can you take a look at...

Is this a bit awkward for me? Is it because of this reason that the operation did not succeed? @sanipanwala

I feel that my failure was caused by the CPU, and I tried other commands but still couldn't succeed. @johnowhitaker But it's strange why you didn't succeed either. Because all...

![image](https://github.com/AnswerDotAI/fsdp_qlora/assets/149936473/226020b9-5da3-4b85-8bec-90189900eea7) ![image](https://github.com/AnswerDotAI/fsdp_qlora/assets/149936473/0bf27e09-5d20-4b5c-9d92-1d561138ac76) Why does the pre training weight after running decrease by a lot of files? How can I use the trained files for downstream tasks? Do you know? I...