Joe Booth

Results 54 comments of Joe Booth

thanks @dkmisra - I was able to get cliclab version running on mac. Looking forward to getting this version working!

this is pretty much done but i need to push the right code

Hi @maystroh - I'm glad you figured it out. It would be good to get your feedback as at some point, it would be good to fold the baselines capabilities...

I have not tried visual observations yet - so I'm interested to know how this works out!! Yes, I can confirm that CPU was fast than GPU in my tests....

@maystroh - Sure! I believe I will be able to target and build Linux but I may need your help testing. I've got a revision paper deadline over the weekend...

@maystroh -Great. I've built and uploaded the Linux version - see https://github.com/Sohojoe/MarathonEnvsBaselines/releases/tag/v1.0.0 - download them and put them into a folder named 'env' from the root of MarathonEnvsBaselines code pull...

Sure - This is for Hopper: does it help? velocity - the main reward signal for positive movement to the right uprightBonus - reward signal for keeping the pelvis upright...

@araffin - that is great to hear. I will merge with the latest and re-run the tests

@araffin I got it training using the same hyperparams that I used with openai.baselines The good news is that hopper trains well: * Score ~870 (openai.baselines.ppo2 scores: 700, ml-agents.ppo scores:...

hmm - very strange; I thought it could be normalization but see that you are using that. Maybe I'm doing something dumb - I'll try again by building a script...