Pankayaraj
Results
2
issues of
Pankayaraj
While finetuning Llama from an SFT model trained with lora config I get this type of behavior where both the rewards stay at 0 and the loss never goes down...
Hi I'm unable to access the word2vec matrix or other files from the https://www.rocq.inria.fr/cluster-willow/ cluster wget https://www.rocq.inria.fr/cluster-willow/amiech/word2vec.zip