DeepRL-TensorFlow2
DeepRL-TensorFlow2 copied to clipboard
From_logit in A2C_discrete.py should be False
In the Actor net, It seems that from_logit should be set to False in tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True) as you added a softmax in the last layer:)