Pankayaraj

Results 2 issues of Pankayaraj

While finetuning Llama from an SFT model trained with lora config I get this type of behavior where both the rewards stay at 0 and the loss never goes down...

Hi I'm unable to access the word2vec matrix or other files from the https://www.rocq.inria.fr/cluster-willow/ cluster wget https://www.rocq.inria.fr/cluster-willow/amiech/word2vec.zip