petals
petals copied to clipboard
Prompt Tuning met NaN error
I tried to run the code for prompt tuning example, but got the NaN error after some iterations.
Does anybody know the reason for such error? Thanks :)
The dots after the training curve mean NaN.
Hi @xinghua-qu! We made a fix for this in https://github.com/bigscience-workshop/petals/pull/343 which should likely resolve your issue. Can you try running the SST-2 prompt tuning notebook from main?