Qihan Ren
Qihan Ren
I think it should be `criterion = metrics.ELBO(batch_size).to(device)` since the ELBO is calculated based on each mini-batch. Is it correct? @kumar-shridhar
Is the problem solved now? I come across the same problem with zero 3 on mix-modality training. The training always hangs at the make_experience stage, and the progress is always...
@ccp123456 @clclclaiggg Hi guys, I think there's one way to circumvent this issue. You could set the default cache dir of huggingface hub to the path where you store your...
> > @ccp123456 @clclclaiggg Hi guys, I think there's one way to circumvent this issue. You could set the default cache dir of huggingface hub to the path where you...
@hamind Hi, I think there's a few things you may try. 1. Rename the folder as `models--gpt2` 2. Use **huggingface hub** (e.g., see [link](https://huggingface.co/docs/huggingface_hub/guides/download#download-an-entire-repository)) to download the files in a...
@hamind That's a good practice. I've also seen your proposal to add a function for loading local files. I believe this is a feature worth to be added.