Srinivasan Iyer

Results 5 comments of Srinivasan Iyer

Could you share the motivation and the command? Thanks!

@Charles-ux-bit We are working on uploading it to huggingface and it will be available soon, hopefully in a few days. An alternative that works right now is to use the...

@xiangjjj I need the exact command you folks are trying to run - and what you are trying to achieve - in order to unblock you.

> You cannot send your model to `accelerator.prepare` if using `device_map="auto"` (as the model will be split across GPUs already). Looks like load_state_dict is trying to put the entire model...

@sgugger The from_pretrained works fine, so if I had saved my checkpoint using save_pretrained, everything is great. Unfortunately, I saved the state_dict, so when I load the state_dict, it OOMs...