Sylvain Gugger
Sylvain Gugger
The PR linked above should fix the two last issues if you want to give it a try.
The result of the tokenizer calls can already interact with the `to` method (note that batch_encode_plus will be deprecated sometime soon) but I agree it could be interesting to look...
Mmmm there is no reason for the script to give different results for different GPUs, especially since removing the device_map="auto" gives the same results. I also can't reproduce on my...
You need to make sure to execute the cell `notebook_login()` at the beginning and pass it your token (it provides a direct link to your token pages on hf.co)
Your code example doesn't define multiple objects, so I can't really tell what's wrong. Please give us a minimal reproducer we can execute.
Could you also print us `trainer.state`? The error comes from the fact it is not JSON-serializable so it would help to know which object in it is not serializable. Thanks!
So your metrics are not floats, but one ends up being a whole scikit-learn module, this is why you have the issue. The code you pasted is actually super weird:...
You just need to add the proper attribute to `GLMPreTrainedModel` so that it knows which layers should not be split across GPUs and then test it works properly. Since this...
Does it work without the load_in_8bit part? Also what is your version of Accelerate?