Soft-Actor-Critic-and-Extensions
Soft-Actor-Critic-and-Extensions copied to clipboard
Use of sum trees
Hello, in the original PER paper I believe sum tree was used to speed up sampling, and I believe ERE also mentions using it in their PER implementation and PER + ERE implementation as well. It seems that your code uses simple np.random.choice to sample instead.
Have you tried implementing the tree data structure to see if that speeds up the code at all?
Thanks!