Rucha Apte comments

Results 10 comments of


                                            Rucha Apte

torch.cuda.OutOfMemoryError: CUDA out of memory When Trying to Save the Model

> Hi @esko22 - I tried to make the following change - device_map='auto' , however, I am still getting the same error. I am using NVIDIA A100-SXM4-40GB. Are you running...

torch.cuda.OutOfMemoryError: CUDA out of memory When Trying to Save the Model

> No - single GPU A100 on Colab @esko22 - Thank you for your reply. Were you using a token size of 1024 ?

Dataset Preparation for Fine Tuning

Thank you so much for your response @loubnabnl and also pointing to the data-preparation link. I had an allied question regarding the data preparation. I am fine-runing the star-coder on...

TypeError: expected str, bytes or os.PathLike object, not NoneType

I got a similar error - TypeError: expected str, bytes or os.PathLike object, not NoneType. It seemed to output these thing son console - Starting main loop Training... {'loss': 0.6581,...

RuntimeError: RuntimeError: IndexError: list index out of range - multiple GPUs

Hi @Kushalamummigatti , I faced similar issue, and realized that my assignment of validation set was incorrect. When the length of my validation set was 0 , I got a...

Training Loss vs Evaluation Loss during Fine Tuning Star Coder

Hi @ArmelRandy , thank you for your response ! Sure following are the details - This is the command I used in order to fine-tune my model - ` CUDA_VISIBLE_DEVICES=1...

Training Loss vs Evaluation Loss during Fine Tuning Star Coder

Thank you so much for your prompt response @ArmelRandy Let me try with the settings you mentioned. :) I also wanted to know approximately how much examples should I target...

Training Loss vs Evaluation Loss during Fine Tuning Star Coder

Got it , thank you so much for your response. I really appreciate your input. Also , if I use the same data and convert it to ChatML format to...

Training Loss vs Evaluation Loss during Fine Tuning Star Coder

Thanks. That is a really interesting suggestion ! Thank you so much for all your help ! This has really helped me clear my doubts

Training Loss vs Evaluation Loss during Fine Tuning Star Coder

@ArmelRandy - Hi Armel , as per your suggestion, I tried to modify the parameters for the Trainer class and the command looks like this - ` CUDA_VISIBLE_DEVICES=1 nohup python...