Michele Milesi
Michele Milesi
Hi @Andrei-Aksionov, thanks for your support. We used the commit with id: 1e5afd6fb5653eddc15aafcae8c20f5222e4e1e3. The only two things we have done are: 1. Comment the line that calls the `merge_lora_weights()` function...
Hi @anthony0727, 1. The player has the stochastic state that is flattened, as you can see here: https://github.com/Eclectic-Sheep/sheeprl/blob/40035066a55b76fd9f9dc4d92ee5a749e079e6b1/sheeprl/algos/dreamer_v3/agent.py#L655 or https://github.com/Eclectic-Sheep/sheeprl/blob/40035066a55b76fd9f9dc4d92ee5a749e079e6b1/sheeprl/algos/dreamer_v3/agent.py#L686 2. You are right, the reconstructed observation should be incremented...
Hi @anthony0727, we created a branch for fixing this issue, can you check if it works? (https://github.com/Eclectic-Sheep/sheeprl/tree/fix/dv3-imagination-notebook) Thanks
Hi there, @belerico, yes, we can start with something similar to the two examples you mentioned. For the environment part, I think we can try to recycle [this](https://lightning.ai/or-bix-srl/studios/sheeprl-how-to-integrate-super-mario-bros-enviroment?view=public§ion=tutorials). Or are...
Hi @verityw, we are fixing a few problems we found with half-precision training. After this, we will move on to pure python examples. Thank you for your patience.
Hi @geranim0, yes, the observation space must have the shape attribute. I suggest to use the `gymnasium.spaces.Box` space to augment the observations of the environment. I prepared a branch with...
Great, I'm glad it works. I do not understand why you added the input buffer and how you used it. Can you show me which modification you made? Thanks
Understood, thanks
Hi @geranim0, can you share the error you encountered and which environment you are using? Thanks
I should have fixed the problem, could you check with the multidiscrete action space? Thanks