LarryLiZimo
LarryLiZimo
> Hello, > > > but only 64 G available in my k8s container. I ran a training script, which is estimated to require 30 G memory. > > if...
It is ok to decrease the buffer_size but not being notified of potential "Out of Memory" is bad.
By reading "/sys/fs/cgroup/memory.max" and "/sys/fs/cgroup/memory.current", I can manually check the amount of free memory. For older versions of Linux, read "/sys/fs/cgroup/memory/memory.limit_in_bytes". This method works fine on my Linux. I am...
> Do you have a proposed fix? Maybe I'll ask one of my experienced co-worker to improve on `RaplayBuffer` and make a PR.