wang88256187
wang88256187
I want to generate trajectory by my own model, how can I get the original obs (not the running state which may help train)? Thank you!
hi, I meet similar problem, my results is always bad in the GAIL. Can you share your experiences on this problem in detail? Thank you very much!
> RL optimizes for the reward you specify. If the benefit of getting high bitrate overwhelms the penalty from rebuffering, it will just choose high bitrate (it's doing its job)....
我现在用的1.1的版本,表格引用一样是bug啊
我直接在overleaf上编译的