wang88256187

Results 5 comments of wang88256187

I want to generate trajectory by my own model, how can I get the original obs (not the running state which may help train)? Thank you!

hi, I meet similar problem, my results is always bad in the GAIL. Can you share your experiences on this problem in detail? Thank you very much!

> RL optimizes for the reward you specify. If the benefit of getting high bitrate overwhelms the penalty from rebuffering, it will just choose high bitrate (it's doing its job)....