philharmonikerzzy issues

Repositories
Issues
Comments

Results 2 issues of


                                            philharmonikerzzy

Entropy continually increases throughout the training

Hi, using the current implementation of the PPO using the `PPOTrainer`, im seeing that the entropy of the actively updated model continues to increase as the training proceeds. It seems...

What to use for `model_id` when loading from a model trained locally and not published?

Do we have to push the model to hugginface to be able to load a model trained locally?