वेदांत

Results 25 comments of वेदांत

can i try solving this ? , i think i should be able to fix it

it could be ``` cleanup_distributed() ``` is called after model saving, and NCCL's cleanup process requires additional GPU memory that wasn't available ? if this is the case we can...

@chamini2 hey fixed the linting please can you review both the PRs : )

hey @chamini2 , sorry for the inconvenience linting is fixed now

thanks for the review : ) , @chamini2 @efiop just let me know what changes need to be done

i think this is also related : [pytorch/torch/nn/utils/clip_grad.py_L166-L168](https://github.com/pytorch/pytorch/blob/bfc873d02ec413344717493e4175a902921359fd/torch/nn/utils/clip_grad.py#L166-L168)

hey thanks for reporting this , please can you try after Removing `type: chat_template` in dataset also if this doesn't work please consider sharing your dataset

Hey, thanks for sharing the dataset! the issue is that GRPO requires a custom transform function to modify the labels . You have the reward functions set up (`ged.runtime.rewards.*`), but...

thanks for reporting the issue , can you try comparing the loss `For Config 1: Look at step 1 ` with `For Config 2: Look at step 16 after full...