large-scale-curiosity
large-scale-curiosity copied to clipboard
Does normalized rewards works with other Agents for Attari ?
Using the normalized reward (#6 ) with the other agent's, taking the example of A2C where the discounted rewards are used on the extrinsic reward.
- Now to which extent we need to normalize the intrinsic rewards as the forward loss (#3 ) tend to get low with the passage of time.
- Second, Does the scaling of rewards also requires advantage normalization. ? In the perspective of agents A2C, ACKTR etc