Docker-CODE
Docker-CODE copied to clipboard
Docker restarts infinitely: "Failed to fork child process"
Reproducible: Always
Steps to reproduce:
docker pull collabora/code
docker run -l debug -t -d -p My.IP.Add.ress:9980:9980 -e "domain=www\\.my\\-domain\\.tld\|my\-domain\\.tld" --restart always --cap-add MKNOD collabora/code
Expected result: Docker should start up fine
Actual result: Docker will restart over and over (filling up HDD space)
Logs:
docker -l debug logs -f DOCKERID
Generating RSA private key, 2048 bit long modulus .......................................+++ ......................................................................................................................+++ e is 65537 (0x10001) Signature ok subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost Getting CA Private Key loolforkit version details: 2.1.0 - b2e736a3 wsd-00024-00024 06:54:08.272296 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2405 Failed to fork child processes. wsd-00024-00024 06:54:08.274339 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2551 Failed to fork child processes. wsd-00024-00024 06:54:08.278844 [ loolwsd ] WRN Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:445 wsd-00024-00024 06:54:08.279161 [ loolwsd ] WRN Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:445 -00024 06:54:08.283357 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:445 -00024 06:54:08.283444 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:445 -00024 06:54:08.283526 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:445 -00024 06:54:08.283566 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:445 -00024 06:54:08.283630 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:445 -00024 06:54:08.283705 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:445 -00024 06:54:08.284083 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:445 -00024 06:54:08.284149 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:445 frk-00026-00026 06:54:08.289767 [ forkit ] FTL Pipe closed.| common/IoUtil.cpp:309
System information
uname -a
4.4.0-78-generic #99-Ubuntu SMP Thu Apr 27 15:29:09 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux
free -m
total used free shared buff/cache available
Mem: 3951 289 924 46 2737 3299 Swap: 1023 0 1023
This is more or less a cross-post from https://help.nextcloud.com/t/restarting-docker-results-in-failed-to-fork-child-processes/12209
After updating to latest docker image it is working again for me - if anybody has done something on purpose to fix this: Thanks :-)
I'm sorry to post on an closed issue, but this issue seems to still be here or to have returned since OP commented that an update fixed his situation. I am using a docker image pulled today. System is Arch Linux.
#uname -r
4.9.30-1-lts
#pacman -Si docker
Repository : community
Name : docker
Version : 1:17.05.0-1
[…]
Build Date : Fri 05 May 2017 10:42:16 PM CEST
Validated By : MD5 Sum SHA-256 Sum Signature
Installed Size : 88.17 MiB
Packager : Sé
[
{
"Id": "38ea21ae35b6214d779a1b23208d8a75bc435ff8315dbb57e20d02498bc0333a",
"Created": "2017-06-04T07:52:49.45273884Z",
"Path": "/bin/sh",
"Args": [
"-c",
"bash start-libreoffice.sh"
],
"State": {
"Status": "exited",
"Running": false,
"Paused": false,
"Restarting": false,
"OOMKilled": false,
"Dead": false,
"Pid": 0,
"ExitCode": 137,
"Error": "",
"StartedAt": "2017-06-04T07:57:17.788074925Z",
"FinishedAt": "2017-06-04T12:03:22.069898922Z"
},
"Image": "sha256:d3757c5469234839e2d58dfaee49cc8466d61f87074961391986e861c6dbf14f",
"ResolvConfPath": "/var/lib/docker/containers/38ea21ae35b6214d779a1b23208d8a75bc435ff8315dbb57e20d02498bc0333a/resolv.conf",
"HostnamePath": "/var/lib/docker/containers/38ea21ae35b6214d779a1b23208d8a75bc435ff8315dbb57e20d02498bc0333a/hostname",
"HostsPath": "/var/lib/docker/containers/38ea21ae35b6214d779a1b23208d8a75bc435ff8315dbb57e20d02498bc0333a/hosts",
"LogPath": "/var/lib/docker/containers/38ea21ae35b6214d779a1b23208d8a75bc435ff8315dbb57e20d02498bc0333a/38ea21ae35b6214d779a1b23208d8a75bc435ff8315dbb57e20d02498bc0333a-json.log",
"Name": "/wizardly_borg",
"RestartCount": 5,
"Driver": "overlay2",
"MountLabel": "",
"ProcessLabel": "",
"AppArmorProfile": "",
"ExecIDs": null,
"HostConfig": {
"Binds": null,
"ContainerIDFile": "",
"LogConfig": {
"Type": "json-file",
"Config": {}
},
"NetworkMode": "default",
"PortBindings": {
"9980/tcp": [
{
"HostIp": "127.0.0.1",
"HostPort": "9980"
}
]
},
"RestartPolicy": {
"Name": "always",
"MaximumRetryCount": 0
},
"AutoRemove": false,
"VolumeDriver": "",
"VolumesFrom": null,
"CapAdd": [
"MKNOD"
],
"CapDrop": null,
"Dns": [],
"DnsOptions": [],
"DnsSearch": [],
"ExtraHosts": null,
"GroupAdd": null,
"IpcMode": "",
"Cgroup": "",
"Links": null,
"OomScoreAdj": 0,
"PidMode": "",
"Privileged": false,
"PublishAllPorts": false,
"ReadonlyRootfs": false,
"SecurityOpt": null,
"UTSMode": "",
"UsernsMode": "",
"ShmSize": 67108864,
"Runtime": "runc",
"ConsoleSize": [
0,
0
],
"Isolation": "",
"CpuShares": 0,
"Memory": 0,
"NanoCpus": 0,
"CgroupParent": "",
"BlkioWeight": 0,
"BlkioWeightDevice": null,
"BlkioDeviceReadBps": null,
"BlkioDeviceWriteBps": null,
"BlkioDeviceReadIOps": null,
"BlkioDeviceWriteIOps": null,
"CpuPeriod": 0,
"CpuQuota": 0,
"CpuRealtimePeriod": 0,
"CpuRealtimeRuntime": 0,
"CpusetCpus": "",
"CpusetMems": "",
"Devices": [],
"DeviceCgroupRules": null,
"DiskQuota": 0,
"KernelMemory": 0,
"MemoryReservation": 0,
"MemorySwap": 0,
"MemorySwappiness": -1,
"OomKillDisable": false,
"PidsLimit": 0,
"Ulimits": null,
"CpuCount": 0,
"CpuPercent": 0,
"IOMaximumIOps": 0,
"IOMaximumBandwidth": 0
},
"GraphDriver": {
"Data": {
"LowerDir": "/var/lib/docker/overlay2/a6e3472d865a223330d01c5b078369f206439bea8c8f99e1dc600ca483ffaee0-init/diff:/var/lib/docker/overlay2/b14335218e66a1ea8940ef671bec2f7c6f7030a9f184765af6d9f60ba6c13e94/diff:/var/lib/docker/overlay2/ad5642adbe8566f711fcec95ae4cab5e62ce34cc1df63d108940985415cd149e/diff:/var/lib/docker/overlay2/a30640e1e0bc41324e49d69e0181245bdf94f15dea4520841cddf3fba9c0ccb6/diff:/var/lib/docker/overlay2/24f88ff1146028958a4b0c9467b02afaf052d239616e45a8e808f840e4781113/diff:/var/lib/docker/overlay2/411c16332e95176fb553363ff8248db4c8e5a3464f757ba63726b7f9f92db4e9/diff:/var/lib/docker/overlay2/685233aa866ef5afa2823e5c3d4512df097d59dd9d0bde8253f441735e37535a/diff:/var/lib/docker/overlay2/752fb968056ab7eca54814d6f09a0fe7a44fd37e7be8774cd42fca79733d3b4d/diff:/var/lib/docker/overlay2/d886180b45b8e08fe76f25db152f30e5f2a9af75aaae5f33634881ade9af904a/diff",
"MergedDir": "/var/lib/docker/overlay2/a6e3472d865a223330d01c5b078369f206439bea8c8f99e1dc600ca483ffaee0/merged",
"UpperDir": "/var/lib/docker/overlay2/a6e3472d865a223330d01c5b078369f206439bea8c8f99e1dc600ca483ffaee0/diff",
"WorkDir": "/var/lib/docker/overlay2/a6e3472d865a223330d01c5b078369f206439bea8c8f99e1dc600ca483ffaee0/work"
},
"Name": "overlay2"
},
"Mounts": [],
"Config": {
"Hostname": "38ea21ae35b6",
"Domainname": "",
"User": "",
"AttachStdin": false,
"AttachStdout": false,
"AttachStderr": false,
"ExposedPorts": {
"9980/tcp": {}
},
"Tty": true,
"OpenStdin": false,
"StdinOnce": false,
"Env": [
"domain=subdomain\\.domain\\.org",
"PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin",
"LC_CTYPE=en_US.UTF-8"
],
"Cmd": [
"/bin/sh",
"-c",
"bash start-libreoffice.sh"
],
"ArgsEscaped": true,
"Image": "collabora/code",
"Volumes": null,
"WorkingDir": "",
"Entrypoint": null,
"OnBuild": null,
"Labels": {}
},
"NetworkSettings": {
"Bridge": "",
"SandboxID": "ffeaab1f03863e71fcb167480dd9c100cb698557378f991972915b0a3fef004d",
"HairpinMode": false,
"LinkLocalIPv6Address": "",
"LinkLocalIPv6PrefixLen": 0,
"Ports": {},
"SandboxKey": "/var/run/docker/netns/ffeaab1f0386",
"SecondaryIPAddresses": null,
"SecondaryIPv6Addresses": null,
"EndpointID": "",
"Gateway": "",
"GlobalIPv6Address": "",
"GlobalIPv6PrefixLen": 0,
"IPAddress": "",
"IPPrefixLen": 0,
"IPv6Gateway": "",
"MacAddress": "",
"Networks": {
"bridge": {
"IPAMConfig": null,
"Links": null,
"Aliases": null,
"NetworkID": "e55ad2c38cb147a8209e284ef5dcb5078a23641207cd25f490900f3890fbf328",
"EndpointID": "",
"Gateway": "",
"IPAddress": "",
"IPPrefixLen": 0,
"IPv6Gateway": "",
"GlobalIPv6Address": "",
"GlobalIPv6PrefixLen": 0,
"MacAddress": ""
}
}
}
}
]
#docker -l debug logs CONTAINER_ID
Generating RSA private key, 2048 bit long modulus
..................+++
.+++
e is 65537 (0x10001)
Generating RSA private key, 2048 bit long modulus
....................................................+++
.........................................................................................................+++
e is 65537 (0x10001)
Signature ok
subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
Getting CA Private Key
loolforkit version details: 2.1.1 - 52f0568e
[1m[31mwsd-00025-00025 07:53:43.257922 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2443[0m
Failed to fork child processes.
[1m[31mwsd-00025-00025 07:53:43.258517 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2590[0m
Failed to fork child processes.
[1m[33mwsd-00025-00025 07:53:43.258980 [ loolwsd ] WRN Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
[1m[33mwsd-00025-00025 07:53:43.259072 [ loolwsd ] WRN Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
<shutdown>-00025 07:53:43.260100 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260159 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260207 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260258 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260303 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260348 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260663 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260718 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
[1m[31mfrk-00027-00027 07:53:43.263370 [ forkit ] FTL Pipe closed.| common/IoUtil.cpp:309[0m
Generating RSA private key, 2048 bit long modulus
...................................................+++
.........................................+++
e is 65537 (0x10001)
Generating RSA private key, 2048 bit long modulus
.............+++
........+++
e is 65537 (0x10001)
Signature ok
subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
Getting CA Private Key
loolforkit version details: 2.1.1 - 52f0568e
[1m[31mwsd-00025-00025 07:54:35.663806 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2443[0m
Failed to fork child processes.
[1m[31mwsd-00025-00025 07:54:35.664365 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2590[0m
Failed to fork child processes.
[1m[33mwsd-00025-00025 07:54:35.664815 [ loolwsd ] WRN Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
[1m[33mwsd-00025-00025 07:54:35.664913 [ loolwsd ] WRN Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
<shutdown>-00025 07:54:35.665923 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:54:35.665981 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:54:35.666022 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:54:35.666069 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:54:35.666113 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:54:35.666156 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:54:35.666432 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:54:35.666495 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
[1m[31mfrk-00027-00027 07:54:35.669392 [ forkit ] FTL Pipe closed.| common/IoUtil.cpp:309[0m
Generating RSA private key, 2048 bit long modulus
..............................................................+++
..........................................................................................................................................................................................................................................+++
e is 65537 (0x10001)
Generating RSA private key, 2048 bit long modulus
.........................................+++
......................+++
e is 65537 (0x10001)
Signature ok
subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
Getting CA Private Key
loolforkit version details: 2.1.1 - 52f0568e
[1m[31mwsd-00025-00025 07:55:29.282207 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2443[0m
Failed to fork child processes.
[1m[31mwsd-00025-00025 07:55:29.282810 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2590[0m
Failed to fork child processes.
[1m[33mwsd-00025-00025 07:55:29.283304 [ loolwsd ] WRN Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
[1m[33mwsd-00025-00025 07:55:29.283410 [ loolwsd ] WRN Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
<shutdown>-00025 07:55:29.284533 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:55:29.284593 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:55:29.284685 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:55:29.284762 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:55:29.284833 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:55:29.284911 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:55:29.285297 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:55:29.285381 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
[1m[31mfrk-00027-00027 07:55:29.287970 [ forkit ] FTL Pipe closed.| common/IoUtil.cpp:309[0m
Generating RSA private key, 2048 bit long modulus
.......................+++
.........................................+++
e is 65537 (0x10001)
Generating RSA private key, 2048 bit long modulus
..............................................................................+++
..................................................................................+++
e is 65537 (0x10001)
Signature ok
subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
Getting CA Private Key
loolforkit version details: 2.1.1 - 52f0568e
[1m[31mwsd-00024-00024 07:56:22.727144 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2443[0m
Failed to fork child processes.
[1m[31mwsd-00024-00024 07:56:22.727668 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2590[0m
Failed to fork child processes.
[1m[33mwsd-00024-00024 07:56:22.728187 [ loolwsd ] WRN Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
[1m[33mwsd-00024-00024 07:56:22.728290 [ loolwsd ] WRN Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
<shutdown>-00024 07:56:22.729318 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00024 07:56:22.729380 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00024 07:56:22.729465 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00024 07:56:22.729539 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00024 07:56:22.729609 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00024 07:56:22.729676 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00024 07:56:22.730004 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00024 07:56:22.730085 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
[1m[31mfrk-00026-00026 07:56:22.732483 [ forkit ] FTL Pipe closed.| common/IoUtil.cpp:309[0m
Generating RSA private key, 2048 bit long modulus
.............................................................................+++
.............................................................................+++
e is 65537 (0x10001)
Generating RSA private key, 2048 bit long modulus
............+++
..+++
e is 65537 (0x10001)
Signature ok
subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
Getting CA Private Key
loolforkit version details: 2.1.1 - 52f0568e
[1m[31mwsd-00025-00025 07:57:15.609266 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2443[0m
Failed to fork child processes.
[1m[31mwsd-00025-00025 07:57:15.609812 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2590[0m
Failed to fork child processes.
[1m[33mwsd-00025-00025 07:57:15.610269 [ loolwsd ] WRN Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
[1m[33mwsd-00025-00025 07:57:15.610380 [ loolwsd ] WRN Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
<shutdown>-00025 07:57:15.611411 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:57:15.611471 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:57:15.611518 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:57:15.611567 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:57:15.611613 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:57:15.611657 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:57:15.611993 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:57:15.612049 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
[1m[31mfrk-00027-00027 07:57:15.614443 [ forkit ] FTL Pipe closed.| common/IoUtil.cpp:309[0m
Generating RSA private key, 2048 bit long modulus
...............+++
......+++
e is 65537 (0x10001)
Generating RSA private key, 2048 bit long modulus
.......................................................................+++
....+++
e is 65537 (0x10001)
Signature ok
subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
Getting CA Private Key
loolforkit version details: 2.1.1 - 52f0568e
office version details: { "ProductName": "Collabora Office", "ProductVersion": "5.3", "ProductExtension": ".10.13", "BuildId": "e639c2ad9c05a793b16db13bd879342ed75dcf95" }
[1m[33mwsd-00025-00026 07:57:46.504191 [ prisoner_poll ] WRN ForKit not responsive for 28299 ms forking 1 children. Resetting.| wsd/LOOLWSD.cpp:338[0m
I'm available to provide any extra info or perform test if needed. Thanks for any insight. Regards, Mark.
this still appears broken, docker image just keeps restarting for me too.
same here
I'm reopening this, as this still seems to affect many people. In fact, I've not dared to restart my own docker since it is currently running and there's so many reports of it not working for others that I don't want to take any risks.
You might try to restart it over and over again and see if it is some kind of timing / race-condition where it might work 1 out of 20 times or something...
Again, I've not change any of my setup and it suddenly worked after trying countless times...
Status : Waiting: CrashLoopBackOff Restarts : 210
:D it's not 1/20 :D 0/210 for me
[
{
"Id": "ea1e806cd2cea55598aa7eed2a6efc3a196e17b941c42c362dd2a17e1db48e76",
"Created": "2017-07-10T17:35:03.395726169Z",
"Path": "/bin/sh",
"Args": [
"-c",
"bash start-libreoffice.sh"
],
"State": {
"Status": "exited",
"Running": false,
"Paused": false,
"Restarting": false,
"OOMKilled": false,
"Dead": false,
"Pid": 0,
"ExitCode": 70,
"Error": "",
"StartedAt": "2017-07-10T17:35:03.576286932Z",
"FinishedAt": "2017-07-10T17:35:54.150846205Z"
},
"Image": "sha256:8bc1d306ff41c5a358942308649f3eb76f2a9248d294793e08c1f532d4a04978",
"ResolvConfPath": "/var/lib/docker/containers/305b10b43f7b46406edc2a2631600e39aa8b747124d61c6f60f709992a05a260/resolv.conf",
"HostnamePath": "/var/lib/docker/containers/305b10b43f7b46406edc2a2631600e39aa8b747124d61c6f60f709992a05a260/hostname",
"HostsPath": "/var/lib/kubelet/pods/0d4f0519-64ea-11e7-ab20-7054d216f5a7/etc-hosts",
"LogPath": "/var/lib/docker/containers/ea1e806cd2cea55598aa7eed2a6efc3a196e17b941c42c362dd2a17e1db48e76/ea1e806cd2cea55598aa7eed2a6efc3a196e17b941c42c362dd2a17e1db48e76-json.log",
"Name": "/k8s_collabora-code_sad-kitten-collabora-code-4174266487-ljrcx_default_0d4f0519-64ea-11e7-ab20-7054d216f5a7_211",
"RestartCount": 0,
"Driver": "overlay",
"MountLabel": "",
"ProcessLabel": "",
"AppArmorProfile": "",
"ExecIDs": null,
"HostConfig": {
"Binds": [
"/var/lib/kubelet/pods/0d4f0519-64ea-11e7-ab20-7054d216f5a7/volumes/kubernetes.io~secret/default-token-n3djs:/var/run/secrets/kubernetes.io/serviceaccount:ro",
"/var/lib/kubelet/pods/0d4f0519-64ea-11e7-ab20-7054d216f5a7/etc-hosts:/etc/hosts",
"/var/lib/kubelet/pods/0d4f0519-64ea-11e7-ab20-7054d216f5a7/containers/collabora-code/e1a7542c:/dev/termination-log"
],
"ContainerIDFile": "",
"LogConfig": {
"Type": "json-file",
"Config": null
},
"NetworkMode": "container:305b10b43f7b46406edc2a2631600e39aa8b747124d61c6f60f709992a05a260",
"PortBindings": null,
"RestartPolicy": {
"Name": "",
"MaximumRetryCount": 0
},
"AutoRemove": false,
"VolumeDriver": "",
"VolumesFrom": null,
"CapAdd": [
"MKNOD"
],
"CapDrop": null,
"Dns": null,
"DnsOptions": null,
"DnsSearch": null,
"ExtraHosts": null,
"GroupAdd": null,
"IpcMode": "container:305b10b43f7b46406edc2a2631600e39aa8b747124d61c6f60f709992a05a260",
"Cgroup": "",
"Links": null,
"OomScoreAdj": 992,
"PidMode": "",
"Privileged": false,
"PublishAllPorts": false,
"ReadonlyRootfs": false,
"SecurityOpt": [
"seccomp=unconfined"
],
"StorageOpt": null,
"UTSMode": "",
"UsernsMode": "",
"ShmSize": 67108864,
"ConsoleSize": [
0,
0
],
"Isolation": "",
"CpuShares": 102,
"Memory": 1602224128,
"CgroupParent": "/kubepods/burstable/pod0d4f0519-64ea-11e7-ab20-7054d216f5a7",
"BlkioWeight": 0,
"BlkioWeightDevice": null,
"BlkioDeviceReadBps": null,
"BlkioDeviceWriteBps": null,
"BlkioDeviceReadIOps": null,
"BlkioDeviceWriteIOps": null,
"CpuPeriod": 100000,
"CpuQuota": 100000,
"CpusetCpus": "",
"CpusetMems": "",
"Devices": [],
"DiskQuota": 0,
"KernelMemory": 0,
"MemoryReservation": 0,
"MemorySwap": -1,
"MemorySwappiness": -1,
"OomKillDisable": false,
"PidsLimit": 0,
"Ulimits": null,
"CpuCount": 0,
"CpuPercent": 0,
"BlkioIOps": 0,
"BlkioBps": 0,
"SandboxSize": 0
},
"GraphDriver": {
"Name": "overlay",
"Data": {
"LowerDir": "/var/lib/docker/overlay/3d6352145379228a0df0a5d17d6b13bae0637c567d1886122a4a8f538e9f2f16/root",
"MergedDir": "/var/lib/docker/overlay/b2bf37103bb655f679122755650e0e889df049995cd472047ccb2c03a5cc8675/merged",
"UpperDir": "/var/lib/docker/overlay/b2bf37103bb655f679122755650e0e889df049995cd472047ccb2c03a5cc8675/upper",
"WorkDir": "/var/lib/docker/overlay/b2bf37103bb655f679122755650e0e889df049995cd472047ccb2c03a5cc8675/work"
}
},
"Mounts": [
{
"Source": "/var/lib/kubelet/pods/0d4f0519-64ea-11e7-ab20-7054d216f5a7/volumes/kubernetes.io~secret/default-token-n3djs",
"Destination": "/var/run/secrets/kubernetes.io/serviceaccount",
"Mode": "ro",
"RW": false,
"Propagation": "rprivate"
},
{
"Source": "/var/lib/kubelet/pods/0d4f0519-64ea-11e7-ab20-7054d216f5a7/etc-hosts",
"Destination": "/etc/hosts",
"Mode": "",
"RW": true,
"Propagation": "rprivate"
},
{
"Source": "/var/lib/kubelet/pods/0d4f0519-64ea-11e7-ab20-7054d216f5a7/containers/collabora-code/e1a7542c",
"Destination": "/dev/termination-log",
"Mode": "",
"RW": true,
"Propagation": "rprivate"
}
],
"Config": {
"Hostname": "sad-kitten-collabora-code-4174266487-ljrcx",
"Domainname": "",
"User": "0",
"AttachStdin": false,
"AttachStdout": false,
"AttachStderr": false,
"Tty": false,
"OpenStdin": false,
"StdinOnce": false,
"Env": [
"domain=office.*****",
"NFS_PROVISIONER_PORT_20048_TCP_PROTO=tcp",
"NFS_PROVISIONER_PORT_111_TCP=tcp://10.101.151.252:111",
"KUBERNETES_PORT_443_TCP_PORT=443",
"KUBERNETES_PORT_443_TCP_ADDR=10.96.0.1",
"NFS_PROVISIONER_SERVICE_PORT_RPCBIND=111",
"NFS_PROVISIONER_SERVICE_PORT_RPCBIND_UDP=111",
"NFS_PROVISIONER_PORT_2049_TCP_PORT=2049",
"KUBERNETES_PORT=tcp://10.96.0.1:443",
"JUICED_WOLVERINE_POSTGRESQL_SERVICE_PORT=5432",
"NFS_PROVISIONER_SERVICE_PORT_MOUNTD=20048",
"JUICED_WOLVERINE_POSTGRESQL_PORT_5432_TCP_PROTO=tcp",
"SAD_KITTEN_COLLABORA_CODE_PORT_9980_TCP_PROTO=tcp",
"SAD_KITTEN_COLLABORA_CODE_PORT=tcp://10.103.0.162:9980",
"NFS_PROVISIONER_SERVICE_PORT=2049",
"KUBERNETES_SERVICE_PORT_HTTPS=443",
"NFS_PROVISIONER_PORT_111_TCP_PROTO=tcp",
"NFS_PROVISIONER_PORT_111_UDP=udp://10.101.151.252:111",
"NFS_PROVISIONER_PORT_20048_TCP_PORT=20048",
"NFS_PROVISIONER_PORT_111_UDP_ADDR=10.101.151.252",
"JUICED_WOLVERINE_POSTGRESQL_PORT=tcp://10.107.46.110:5432",
"SAD_KITTEN_COLLABORA_CODE_PORT_9980_TCP_ADDR=10.103.0.162",
"NFS_PROVISIONER_PORT_2049_TCP=tcp://10.101.151.252:2049",
"NFS_PROVISIONER_PORT_20048_TCP_ADDR=10.101.151.252",
"NFS_PROVISIONER_PORT_111_TCP_PORT=111",
"NFS_PROVISIONER_PORT_111_TCP_ADDR=10.101.151.252",
"KUBERNETES_SERVICE_HOST=10.96.0.1",
"JUICED_WOLVERINE_POSTGRESQL_PORT_5432_TCP_PORT=5432",
"KUBERNETES_SERVICE_PORT=443",
"NFS_PROVISIONER_SERVICE_PORT_NFS=2049",
"MANAGEABLE_SHRIMP_PYDIO_PORT_80_TCP_PORT=80",
"SAD_KITTEN_COLLABORA_CODE_SERVICE_HOST=10.103.0.162",
"NFS_PROVISIONER_PORT=tcp://10.101.151.252:2049",
"NFS_PROVISIONER_PORT_20048_TCP=tcp://10.101.151.252:20048",
"MANAGEABLE_SHRIMP_PYDIO_SERVICE_PORT_HTTPS=80",
"MANAGEABLE_SHRIMP_PYDIO_PORT=tcp://10.96.157.238:80",
"JUICED_WOLVERINE_POSTGRESQL_SERVICE_HOST=10.107.46.110",
"NFS_PROVISIONER_PORT_111_UDP_PROTO=udp",
"KUBERNETES_PORT_443_TCP_PROTO=tcp",
"MANAGEABLE_SHRIMP_PYDIO_PORT_80_TCP_PROTO=tcp",
"JUICED_WOLVERINE_POSTGRESQL_PORT_5432_TCP=tcp://10.107.46.110:5432",
"JUICED_WOLVERINE_POSTGRESQL_PORT_5432_TCP_ADDR=10.107.46.110",
"SAD_KITTEN_COLLABORA_CODE_SERVICE_PORT_COLLABORA=9980",
"SAD_KITTEN_COLLABORA_CODE_PORT_9980_TCP_PORT=9980",
"KUBERNETES_PORT_443_TCP=tcp://10.96.0.1:443",
"JUICED_WOLVERINE_POSTGRESQL_SERVICE_PORT_POSTGRESQL=5432",
"SAD_KITTEN_COLLABORA_CODE_PORT_9980_TCP=tcp://10.103.0.162:9980",
"NFS_PROVISIONER_PORT_111_UDP_PORT=111",
"MANAGEABLE_SHRIMP_PYDIO_SERVICE_PORT=80",
"MANAGEABLE_SHRIMP_PYDIO_PORT_80_TCP=tcp://10.96.157.238:80",
"NFS_PROVISIONER_SERVICE_HOST=10.101.151.252",
"NFS_PROVISIONER_PORT_2049_TCP_PROTO=tcp",
"NFS_PROVISIONER_PORT_2049_TCP_ADDR=10.101.151.252",
"MANAGEABLE_SHRIMP_PYDIO_SERVICE_HOST=10.96.157.238",
"SAD_KITTEN_COLLABORA_CODE_SERVICE_PORT=9980",
"MANAGEABLE_SHRIMP_PYDIO_PORT_80_TCP_ADDR=10.96.157.238",
"PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin",
"LC_CTYPE=en_US.UTF-8"
],
"Cmd": [
"/bin/sh",
"-c",
"bash start-libreoffice.sh"
],
"Image": "sha256:8bc1d306ff41c5a358942308649f3eb76f2a9248d294793e08c1f532d4a04978",
"Volumes": null,
"WorkingDir": "",
"Entrypoint": null,
"OnBuild": null,
"Labels": {
"annotation.io.kubernetes.container.hash": "f031f7bb",
"annotation.io.kubernetes.container.ports": "[{\"containerPort\":9980,\"protocol\":\"TCP\"}]",
"annotation.io.kubernetes.container.restartCount": "211",
"annotation.io.kubernetes.container.terminationMessagePath": "/dev/termination-log",
"annotation.io.kubernetes.container.terminationMessagePolicy": "File",
"annotation.io.kubernetes.pod.terminationGracePeriod": "30",
"io.kubernetes.container.logpath": "/var/log/pods/0d4f0519-64ea-11e7-ab20-7054d216f5a7/collabora-code_211.log",
"io.kubernetes.container.name": "collabora-code",
"io.kubernetes.docker.type": "container",
"io.kubernetes.pod.name": "sad-kitten-collabora-code-4174266487-ljrcx",
"io.kubernetes.pod.namespace": "default",
"io.kubernetes.pod.uid": "0d4f0519-64ea-11e7-ab20-7054d216f5a7",
"io.kubernetes.sandbox.id": "305b10b43f7b46406edc2a2631600e39aa8b747124d61c6f60f709992a05a260"
}
},
"NetworkSettings": {
"Bridge": "",
"SandboxID": "",
"HairpinMode": false,
"LinkLocalIPv6Address": "",
"LinkLocalIPv6PrefixLen": 0,
"Ports": null,
"SandboxKey": "",
"SecondaryIPAddresses": null,
"SecondaryIPv6Addresses": null,
"EndpointID": "",
"Gateway": "",
"GlobalIPv6Address": "",
"GlobalIPv6PrefixLen": 0,
"IPAddress": "",
"IPPrefixLen": 0,
"IPv6Gateway": "",
"MacAddress": "",
"Networks": null
}
}
]
There's a 50 s time limit for the preinit stage during startup, it might be it fails to complete the process during that period on certain systems.
"StartedAt": "2017-07-10T17:35:03.576286932Z",
"FinishedAt": "2017-07-10T17:35:54.150846205Z"
Yup it's can be that :)
Where's that timeout defined? Do we have any chance to change that to something a little higher?
From the look of my log messages, the timeout is visible, but doesn't appear to be the cause of the issue. Allow me to elaborate:
#### FIRST ATTEMPT STARTS HERE ####
## START RSA KEY SECTION ##
Generating RSA private key, 2048 bit long modulus
..................+++
.+++
e is 65537 (0x10001)
Generating RSA private key, 2048 bit long modulus
....................................................+++
.........................................................................................................+++
e is 65537 (0x10001)
Signature ok
subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
Getting CA Private Key
## END RSA KEYS SECTION ##
loolforkit version details: 2.1.1 - 52f0568e
�[1m�[31mwsd-00025-00025 07:53:43.257922 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2443�[0m
Failed to fork child processes.
�[1m�[31mwsd-00025-00025 07:53:43.258517 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2590�[0m
Failed to fork child processes.
�[1m�[33mwsd-00025-00025 07:53:43.258980 [ loolwsd ] WRN Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507�[0m
�[1m�[33mwsd-00025-00025 07:53:43.259072 [ loolwsd ] WRN Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507�[0m
<shutdown>-00025 07:53:43.260100 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260159 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260207 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260258 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260303 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260348 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260663 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260718 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
�[1m�[31mfrk-00027-00027 07:53:43.263370 [ forkit ] FTL Pipe closed.| common/IoUtil.cpp:309�[0m
#### FIRST ATTEMPT ENDS HERE ####
## START RSA KEY SECTION ##
Generating RSA private key, 2048 bit long modulus
...................................................+++
.........................................+++
e is 65537 (0x10001)
Generating RSA private key, 2048 bit long modulus
.............+++
........+++
e is 65537 (0x10001)
Signature ok
subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
Getting CA Private Key
### END RSA KEYS SECTION ###
loolforkit version details: 2.1.1 - 52f0568e
�[1m�[31mwsd-00025-00025 07:54:35.663806 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2443�[0m
Failed to fork child processes.
This is one full cycle and part of a second cycle of the init phase in the log.
It starts by generating the RSA private key, which succeeds, then it attempts to call loolforkit which fails with Failed to fork child process
.
Looking at the timestamps:
[RSA section]
07:53:43.257922 -> first fork attempt
07:53:43.263370 -> pipe closed (gives up trying)
--- Loop ---
[RSA section]
07:54:35.663806 -> second fork attempt
...etc.
There is a 52 second interval between the fork attempts. But they are failing way faster than that. And the RSA key generation seems to be taking up most of the 2 second difference.
What I think is happening is that it is forking, failing fast (43.263370-43.257922=0.005448s) and then waiting most of the 50 second timeout to restart.
So the 50 second limit is not what is causing the error.
Something in loolforkit is.
But that's just my analysis of the log and I don't know the inner working enough to comment further.
Yes, looking at my own logs, I'd agree with you that the 50s timeout is between the two attempts and don't seem to be the cause of the issue, but rather one symptom. So it's still unclear why the forking of the child processes fails so often (or even always for some users).
I got the same error, the solution is, to use not the /etc/default/docker for configuration the devicemapper - use systemd for this.
The solution mentioned in this post -> https://help.nextcloud.com/t/collabora-docker-capabilities-problem/4774 did solve my problem.
I have the same issue and using overlay2 on Arch Linux. Anyone solved this without switching to devicemapper?
I have 17 running services on my server, is there any method to keep aufs method and not devicemapper?
The timeout was increased in version 2.1.5, give it a try to see if that helps.
I have just tested with version 2.1.5, the problem is still not resolved. It just takes longer time to crash but the docker image is not functional as the web app is not responding while waiting:
collabora_1 | 2017-11-19T21:20:16.228234714Z Generating RSA private key, 2048 bit long modulus
collabora_1 | 2017-11-19T21:20:16.232544498Z ...+++
collabora_1 | 2017-11-19T21:20:16.491489887Z ....................................................................................................................................................................................................................................+++
collabora_1 | 2017-11-19T21:20:16.491884384Z e is 65537 (0x10001)
collabora_1 | 2017-11-19T21:20:16.498557068Z Generating RSA private key, 2048 bit long modulus
collabora_1 | 2017-11-19T21:20:16.511605168Z ...........+++
collabora_1 | 2017-11-19T21:20:16.517997544Z .....+++
collabora_1 | 2017-11-19T21:20:16.518292207Z e is 65537 (0x10001)
collabora_1 | 2017-11-19T21:20:16.523152527Z Signature ok
collabora_1 | 2017-11-19T21:20:16.523161166Z subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
collabora_1 | 2017-11-19T21:20:16.523339588Z Getting CA Private Key
collabora_1 | 2017-11-19T21:20:16.547659073Z loolforkit version details: 2.1.5 - c9a8aed
collabora_1 | 2017-11-19T21:20:16.547718300Z frk-00029-00029 21:20:16.547480 [ forkit ] ERR Ignored setting RLIMIT_FSIZE to 18446744073709551615 bytes. (errno: Success)| common/Seccomp.cpp:284
collabora_1 | 2017-11-19T21:20:16.547730763Z frk-00029-00029 21:20:16.547492 [ forkit ] ERR Ignored setting RLIMIT_NOFILE to 18446744073709551615 files. (errno: Success)| common/Seccomp.cpp:306
collabora_1 | 2017-11-19T21:20:16.547734680Z frk-00029-00029 21:20:16.547510 [ forkit ] FTL Capability cap_sys_chroot is not set for the loolforkit program.| kit/ForKit.cpp:168
collabora_1 | 2017-11-19T21:20:16.547738084Z frk-00029-00029 21:20:16.547515 [ forkit ] FTL Capability cap_mknod is not set for the loolforkit program.| kit/ForKit.cpp:168
collabora_1 | 2017-11-19T21:20:16.547741316Z frk-00029-00029 21:20:16.547529 [ forkit ] FTL Capability cap_fowner is not set for the loolforkit program.| kit/ForKit.cpp:168
collabora_1 | 2017-11-19T21:20:16.547744653Z FATAL: Capabilities are not set for the loolforkit program.
collabora_1 | 2017-11-19T21:20:16.547747632Z If you are on SLES11, please set 'file_caps=1' as kernel boot option.
collabora_1 | 2017-11-19T21:20:16.547750884Z
collabora_1 | 2017-11-19T21:24:26.543022381Z wsd-00027-00027 21:24:26.542853 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2623
collabora_1 | 2017-11-19T21:24:26.543045976Z FATAL: Failed to fork child processes.
collabora_1 | 2017-11-19T21:24:26.543058925Z wsd-00027-00027 21:24:26.542987 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2770
collabora_1 | 2017-11-19T21:24:26.543061270Z Failed to fork child processes.
collabora_1 | 2017-11-19T21:24:26.543202917Z wsd-00027-00027 21:24:26.543101 [ loolwsd ] WRN Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507
collabora_1 | 2017-11-19T21:24:26.543207344Z wsd-00027-00027 21:24:26.543123 [ loolwsd ] WRN Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507
collabora_1 | 2017-11-19T21:24:26.543408391Z <shutdown>-00027 21:24:26.543306 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
collabora_1 | 2017-11-19T21:24:26.543412585Z <shutdown>-00027 21:24:26.543312 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
collabora_1 | 2017-11-19T21:24:26.543422773Z <shutdown>-00027 21:24:26.543325 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
collabora_1 | 2017-11-19T21:24:26.543424946Z <shutdown>-00027 21:24:26.543329 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
collabora_1 | 2017-11-19T21:24:26.543426867Z <shutdown>-00027 21:24:26.543332 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
collabora_1 | 2017-11-19T21:24:26.543428816Z <shutdown>-00027 21:24:26.543335 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
collabora_1 | 2017-11-19T21:24:26.543583216Z <shutdown>-00027 21:24:26.543478 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
collabora_1 | 2017-11-19T21:24:26.543587013Z <shutdown>-00027 21:24:26.543490 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
Still happening for me as well.
Hi, I found the issue on all my debian setups with docker with versions from 2.1.2 to 3.0.0.0. I found the issue inside a fresh debian 9 virtualbox machine, 2 debian 9 machines and a debian package installation. It used to work until an update in november for me.
For docker I tried overlay2 and overlay storage driver. So far the solution seems to switch the storage driver to devicemapper (I have not tried).
The fact that I find the issue with debian package makes me think the error does not comes from docker but something higher like a system library or a kernel issue.
Can someone else confirms package install and docker fails on the same machine ?
the probleme is on CollaboraOnline code
https://github.com/CollaboraOnline/Docker-CODE/issues/10#issuecomment-345551351: this seems to be a different issue, the required capabilities are missing.
https://github.com/CollaboraOnline/Docker-CODE/issues/10#issuecomment-343110034, https://github.com/CollaboraOnline/Docker-CODE/issues/10#issuecomment-355846689: I've had no issues with aufs and overlay2 storage drivers on Ubuntu 17.04 / 17.10.
There can be different cases, and in others the Collabora Online log in the container should give further errors, eg. this and this are both unrelated issues, but lead to "FTL Failed to fork child process".
First off I would like to say I am having the same or at least very similar issues with the Docker Container and it restarting constantly. I am running it on DigitalOcean on ubuntu 16.04 and it is the only installed program beyond the base system and docker requirements.
Second, issues like this is the precise reason that open source projects either don't get wide adoption or out right fail! This issue has been open for almost a year and is still not fixed! I'd be more than happy to pay for Collabora Code IF they could get it working and stable but I really don't see that happening.
IDK maybe I'm just old and angry but when you are shipping a product you don't just update it without testing and then push this update out and after people have stated there are major or critical issues you roll it back to the previous version not just leave it on the broken one. Collabora really needs to get there S together!
Okay; a probable fix. Can somebody try passing this environment (SLEEPFORDEBUGGER=0) and see if it works.
You can pass it like -e "SLEEPFORDEBUGGER=0"
to the docker run command or via some other medium if you are using docker-compose, etc.
If it doesn't fix the issue, trace level logs (by changing the entry from warning to trace in /etc/loolwsd/loolwsd.xml; also consider changing the flush attribute in
With this variable, the container has stopped restarting continuously but it get stuck during the initialisation process:
collabora_1 | Generating RSA private key, 2048 bit long modulus
collabora_1 | .......+++
collabora_1 | ...................................................................................+++
collabora_1 | e is 65537 (0x10001)
collabora_1 | Generating RSA private key, 2048 bit long modulus
collabora_1 | ........................................+++
collabora_1 | ...............................................................................................................+++
collabora_1 | e is 65537 (0x10001)
collabora_1 | Signature ok
collabora_1 | subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
collabora_1 | Getting CA Private Key
collabora_1 | frk-00029-00029 21:27:42.871039 [ forkit ] FTL Capability cap_sys_chroot is not set for the loolforkit program.| kit/ForKit.cpp:168
collabora_1 | frk-00029-00029 21:27:42.871047 [ forkit ] FTL Capability cap_mknod is not set for the loolforkit program.| kit/ForKit.cpp:168
collabora_1 | frk-00029-00029 21:27:42.871051 [ forkit ] FTL Capability cap_fowner is not set for the loolforkit program.| kit/ForKit.cpp:168
collabora_1 | FATAL: Capabilities are not set for the loolforkit program.
collabora_1 | If you are on SLES11, please set 'file_caps=1' as kernel boot option.
collabora_1 |
collabora_1 | frk-00029-00029 21:27:42.871057 [ forkit ] WRN Waking up dead poll thread [delay_poll], started: false, finished: false| net/Socket.hpp:507
collabora_1 | frk-00029-00029 21:27:42.871074 [ forkit ] WRN Waking up dead poll thread [delay_poll], started: false, finished: false| net/Socket.hpp:507
collabora_1 | loolforkit version details: 3.0.0 - 81c8935
Of course, the server is not started and nothing is answering to the https requests.
Here is a log in "trace" mode: https://framabin.org/?4fe8437720b162a9#O/H659YpD9U/BAMgKRxQSXHN6DIVl2Wfpth8wlOHVaU=
I don't know why capabilities are not set on loolforkit binary in your docker container. As mentioned in the error message, it may depend on your kernel boot options, etc.
And you didn't set the flush to true in logging, I think. Did you? Anyways, trace level logs are irrelevant here since it's clear what the problem is -- you need to figure out why capabilities are not set in the first place.
Hi,
Still no success after setting flush to true, and using -SLEEPFORDEBUGGER=0
in docker-compose.
My kernel capabilities are okay.
collabora | frk-00028-00028 14:10:16.490045 [ forkit ] INF Have capability cap_sys_chroot| kit/ForKit.cpp:180
collabora | frk-00028-00028 14:10:16.490066 [ forkit ] INF Have capability cap_mknod| kit/ForKit.cpp:180
collabora | frk-00028-00028 14:10:16.490145 [ forkit ] INF Have capability cap_fowner| kit/ForKit.cpp:180
I can't see a clue in the trace logs.
I can confirm the flush logging was set to true.
I am able to run the docker image in a CentOS 7.4 host but I cannot launch it in a Debian 9.3. The docker command lines are identical:
docker run -t -d -p 127.0.0.1:9980:9980 -e 'domain=nextcloud\\.example\\.com' --restart always --cap-add MKNOD collabora/code
I have used docker inspect
to see if I can see a difference and the only obvious difference is centOS is using overlay2 while Debian is using aufs.
The kernels used and docker configurations are the default ones for each distribution. I do not understand why the capabilities are not set when run on a Debian host.
I'm using debian 9.3 kernel 4.12, overlay2 and it does not work even with capabilities enabled. I installed manually the kernel a while ago Linux 4.12.0-041200-generic #201707022031.
@jribal can you paste your trace logs as well?