pfrl icon indicating copy to clipboard operation
pfrl copied to clipboard

Memory leak (?) when run without a GPU

Open tadashiK opened this issue 5 years ago • 2 comments

Disclaimer: I am not completely sure if this is a bug of PFRL.

When I ran SAC, and TD3 on my university's cluster without a GPU, I observed that memory usage gradually increased and finally reached to 24 GB, which is the amount of RAM assigned to jobs. I confirmed that this occurred on a local workstation as well. My collaborator also confirmed that this occurred on his environment too. He told me that this did not occur when he ran experiments with a GPU. Would you check if this memory leak (?) occurs too on your workstation or cluster? If this occurs in other environments too, this might be a bug of PFRL.

PyTorch version is 1.6.0+cpu, and PFRL is the latest one obtained by git clone .... The command I used is python3 examples/mujoco/reproduction/soft_actor_critic/train_soft_actor_critic.py --env Humanoid-v2 --gpu -1 --num-envs 3. (num-envs and env seem to be unrelated, though.)

I use singularity, and my collaborator use docker, so there is some possibility that this occurs only when PFRL is run in a container. However, I think it is unlikely.

tadashiK avatar Sep 06 '20 11:09 tadashiK

Sorry for the late reply. This is noted, and we will try to investigate this when we have the chance.

prabhatnagarajan avatar Sep 23 '20 05:09 prabhatnagarajan

Thanks! (And thank you for creating and maintaining this great repo!)

tadashiK avatar Sep 23 '20 05:09 tadashiK