mihirkapadiap
mihirkapadiap
Hi Kevin, Getting socket timeout (it worked fine till 168 notebooks or so thereafter it started throwing error. Also it restarted Jupyter Enterprise Gateway. I start EG as: docker stack...
Hi Kevin, Thanks for such quick response! You are amazing! I checked /var/log/messages and I see some timeout happening! I will recreate scenario and try out docker service ls and...
@kevin-bates Ran again for 200 kernels! I did see some error in /var/log/messages again. However I could do docker service ls! I did it may be couple minutes after error...
@kevin-bates How do I run notebook 6. Just download notebook version 6 and run! Even with version 6 it will still use EG to route request and do management (like...
Another issue I see is when starting EG in Swarm it needs to start on manager node (sometimes when I do docker ps if it is running on non leader...
This is my understanding at high level! 1. On Notebook server user requests a new Python on Docker kernel. 2. Notebook server sends request for new kernel (http) 3. JEG...
@kevin-bates Hi, Tried with bigger swarm cluster. I get Too many open files error. I changed ulimit -n from 1024 to 4096 on all hosts but still get same error...
@kevin-bates Any idea.... Tried creating 25 notebook server each opening 10 kernels. I see lots of kernels stuck or throwing kernel error. In notebook server logs I see timeouts. I...