tuned-lens icon indicating copy to clipboard operation
tuned-lens copied to clipboard

All layers translators with nan loss while training the lens

Open AmeenAli opened this issue 1 year ago • 2 comments

Hello!

Thanks for sharing this amazing work!

I am trying to train the lens over a new dataset HF Dataset (note that the original "the pile" dataset was removed from the internet because of a DMCA claim - as i understood it. as follows :

tuned-lens train --model.name EleutherAI/pythia-160m-deduped --output ./output/ --per_gpu_batch_size=6 --data.name HuggingFaceH4/ultrachat_200k --split train_gen --text_column prompt --wandb test

However, once I look into wandb logs i notice that loss/translator_1 - 12 are all NaNs also the weight_norms_1-12. I have also tried different datasets, but the issue still exists. Any idea what is off?

Thank you!

AmeenAli avatar Dec 23 '23 15:12 AmeenAli