Federico Belotti
Federico Belotti
Hi, you're technically right, but as you can see in Hafner implementation [here](https://github.com/danijar/dreamerv2/blob/07d906e9c4322c6fc2cd6ed23e247ccd6b7c8c41/dreamerv2/expl.py#L61) you see that he sets the targets as the following: ```python target = { 'embed': context['embed'], 'stoch':...
Hi @nshoman, I've just heard the news yesterday! I would love to implement it. Do you already have some ideas on how to do it? Especially since it's an offline...
Moreover, I would definitely update the implementation of dv3 to the latest one also
Hi everyone, we've just released our new repo for RLHF: https://github.com/Eclectic-Sheep/sheeprlhf. Right now it works with HF models only, but we're working to integrate also lit-gpt :zap: We have a...
Hi @anthony0727, have you tried training with different seeds?
Thanks for the reference @zippeurfou! I will open a or in the next few days with the fixes
Hi @otaj, yeah you're right: after I've dived into more I have to agree with you. The main problem here is that the app state it has to be session_id...
> Option 2 is tricky, because if there would be two users coming at similar time, one would have to _at least_ wait for the works of the other to...