dexfrost89
dexfrost89
If I got it correct: first I record demo file with an oscillator, second - behavior cloning for 10M-15M steps with extrinsic reward signal 0.1, GAIL reward signal 1.0 and...
I got this behavior after 100M steps on your data https://drive.google.com/file/d/1VmbZDDP_3KzUhMaQe0Sa6N4kzfjFPcp7/view?usp=sharing
I tried several lengths for behavior cloning. Unfortunately, this behavior is the best what I got after 40M steps of learning. https://drive.google.com/file/d/1unkTrmxDzP9MTFeIVh6fltZ0FL_HDYlh/view?usp=sharing Is there any way to contact you somewhere...
Hello! I have the same problem when I try to run spot_ars.py. spot_ars_eval.py works fine.