OLMo
OLMo copied to clipboard
Process get stuck at buling training data loader during Stage 2 training.
❓ The question
As titled. I'm able to run stage 1 pretraining smoothly, but when running stage 2, my code is stuck at running this function for over 30 mins.
Hi, thanks for the question!
Could you share some additional details so we can help troubleshoot the issue? It would be particularly helpful if you could share your config, hardware you're using, and any relevant logs / failures before this function? Are you using a custom dataset? Thank you!