sensenet
sensenet copied to clipboard
reward should exploit exploring an object when touching
need to figure out how to model this.
any info from here we can use: https://medium.com/mlreview/our-nips-2017-learning-to-run-approach-b80a295d3bb5