min-decision-transformer icon indicating copy to clipboard operation
min-decision-transformer copied to clipboard

Training is ok, but failed to eval.

Open 353055619 opened this issue 2 years ago • 2 comments

Hello👋,

Thank you for open-sourcing the code for the min decision transformer. Your code has been tremendously helpful in helping me understand DT.

However, I am currently facing an issue. During the training process, the action loss is indeed steadily decreasing, but the test results have consistently been subpar, to the point of having no discernible impact. I've been grappling with this problem for a while now and can't seem to figure out why this is happening. 截屏2023-10-09 16 12

By the way, I haven't tested it on the three environments, namely halfcheetah, hopper, and walker2d, mainly because I've been struggling with configuring d4rl. I'm using the upgraded version of d4rl provided by Farama, specifically on the pointmaze offline dataset.

If you could spare some time to assist me with this, I would be immensely grateful!

353055619 avatar Oct 09 '23 08:10 353055619

The Decision Transformer paper does not provide results for pointmaze environment, it is a difficult env and I would not expect DT to work well on it out of the box.

If you are struggling to install d4rl, you can refer to the first couple of blocks on the colab notebook in the repo that installs the required libraries on google colab.

nikhilbarhate99 avatar Oct 16 '23 21:10 nikhilbarhate99

Anyway, Thank for your time! 发件人: ***@***.*** ***@***.***> 代表 Nikhil Barhate ***@***.***>日期: 星期二, 2023年10月17日 05:02收件人: nikhilbarhate99/min-decision-transformer ***@***.***>抄送: Godw ***@***.***>, Author ***@***.***>主题: Re: [nikhilbarhate99/min-decision-transformer] Training is ok, but failed to eval. (Issue #5)The Decision Transformer paper does not provide results for pointmaze environment, it is a difficult env and I would not expect DT to work well on it out of the box.—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you authored the thread.Message ID: ***@***.***>

May DT failed to do path planning. I have tried it on the Minari dataset. It seems like it can only work in the Umaze environment.

WangJinCheng1998 avatar Oct 26 '23 14:10 WangJinCheng1998

Hello👋,

Thank you for open-sourcing the code for the min decision transformer. Your code has been tremendously helpful in helping me understand DT.

However, I am currently facing an issue. During the training process, the action loss is indeed steadily decreasing, but the test results have consistently been subpar, to the point of having no discernible impact. I've been grappling with this problem for a while now and can't seem to figure out why this is happening. 截屏2023-10-09 16 12

By the way, I haven't tested it on the three environments, namely halfcheetah, hopper, and walker2d, mainly because I've been struggling with configuring d4rl. I'm using the upgraded version of d4rl provided by Farama, specifically on the pointmaze offline dataset.

If you could spare some time to assist me with this, I would be immensely grateful!

@353055619 Hi, I found the value of state_mean and state_std differed in train.py and test.py recently. Maybe It's this bug that caused this issue.

Jordan-Haidee avatar Dec 01 '24 14:12 Jordan-Haidee

I will completely consider this later! And offer you a full insight into that.

WangJinCheng1998 avatar Dec 01 '24 16:12 WangJinCheng1998