PaddleCloud
PaddleCloud copied to clipboard
Can not stop docker container and too much `kworker` process on host
The problem describetion:
- GPU can not be released normally, The Docker Continaer which used GPU can not be stop.
- The command
nvidia-smiwill be hanged. - Too high for the system load(3k+).
- Too much
kworkerprocess(3k+).
Environment
- OS: CentOS 7.2
- Kernel: 4.4.79-1.el7.elrepo.x86_64
- GPU: P40, Driver: 375.26
- Docker: 1.12
- How to use GPU in Docker: mount nvidia libraries and nvidia device in container.
Thought kubernetes will use --device argument to attach GPU device to the container, and this issue is very possible that has something to do with docker.