lamare3423

Results 4 comments of lamare3423

So how can we implement iterations for policy and value statements. For any idea mail me. [email protected]

@Souphis i want to understand something. If we imply dynamic reward function, our reward function can be more success. Is it true . for example how can we customize our...

@Souphis First of all thank u for all information. You said that , modify the reward function during learning in your agent. İ read lots of thing but i dont...

How? i want to save trained model . then i will use it for testing in different environment