Rucha Apte

Results 10 comments of Rucha Apte

> Hi @esko22 - I tried to make the following change - device_map='auto' , however, I am still getting the same error. I am using NVIDIA A100-SXM4-40GB. Are you running...

> No - single GPU A100 on Colab @esko22 - Thank you for your reply. Were you using a token size of 1024 ?

Thank you so much for your response @loubnabnl and also pointing to the data-preparation link. I had an allied question regarding the data preparation. I am fine-runing the star-coder on...

I got a similar error - TypeError: expected str, bytes or os.PathLike object, not NoneType. It seemed to output these thing son console - Starting main loop Training... {'loss': 0.6581,...

Hi @Kushalamummigatti , I faced similar issue, and realized that my assignment of validation set was incorrect. When the length of my validation set was 0 , I got a...

Hi @ArmelRandy , thank you for your response ! Sure following are the details - This is the command I used in order to fine-tune my model - ` CUDA_VISIBLE_DEVICES=1...

Thank you so much for your prompt response @ArmelRandy Let me try with the settings you mentioned. :) I also wanted to know approximately how much examples should I target...

Got it , thank you so much for your response. I really appreciate your input. Also , if I use the same data and convert it to ChatML format to...

Thanks. That is a really interesting suggestion ! Thank you so much for all your help ! This has really helped me clear my doubts

@ArmelRandy - Hi Armel , as per your suggestion, I tried to modify the parameters for the Trainer class and the command looks like this - ` CUDA_VISIBLE_DEVICES=1 nohup python...