norikazu99
norikazu99
### What happened + What you expected to happen Hello, I have a Windows machine and it seems that there are warnings regarding the sync/checkpoint path due to windows paths...
Hello, when attempting to load my gpu trained RPPO I get the following error. (Note: I only want to use the model's predictions I don't necessarily want to resume training.)...
Hello and thank you for sharing the repo. I'd like to know if muP would work out of the box with Mamba model or I would have to rescale some...
Hello, Your paper seems to have covered linear layers, convs, and transformers but not rnns. Was it just to reduce the number of experiments or is their a more fundamental...