act and learn input ranges

Open salamanders opened this issue 10 years ago • 1 comments

Should all state inputs to act be 0<=stateX<1?
Should all reward inputs be 0<=reward<1?
Is there any way to get out "nope, that wasn't a good reply. I want a second opinion!" (second place answer, etc)

Oct 21 '15 17:10 salamanders

I have been having some success with categorical state inputs. Since my categories are represented as strings I ended up implementing a string hashcode for state inputs since I quickly learned that state inputs needed to be numeric. My string hashcodes are integers in the thousands and they seem to be working with the library, so I think your suggested [0, 1) range for inputs is overly constrained.
I have been providing integer rewards between -100 and 100, and these seem to be working too, however I have been encountering NaN in the net weights after calls to learn(), so maybe I should try a smaller range?

Mar 17 '19 09:03 gb96