HiT-MAC icon indicating copy to clipboard operation
HiT-MAC copied to clipboard

Why the reward curve looks random?

Open zzhixin opened this issue 1 year ago • 0 comments

I followed the instructions of your project to train the excuator. python main.py --env Pose-v1 --model multi-att-shap --workers 6.

Here is the tensorboard results:

image

The reward curve during train is different from the results in the paper below.

image

Does I do wrong? How to reproduce the reward curve in your paper ?

zzhixin avatar Nov 15 '23 14:11 zzhixin