JCH

Results 2 issues of JCH

Dear author: When I run the train.py file, when it runs for about 5000 iterations, the neural network has a Nan value, which causes the program to break unexpectedly. How...

I applied the code of discrete sac to a custom discrete action environment. During the training process, I found that the loss of critic did not decrease but increased, and...