Docker-CODE icon indicating copy to clipboard operation
Docker-CODE copied to clipboard

Docker restarts infinitely: "Failed to fork child process"

Open ripper17 opened this issue 7 years ago • 50 comments

Reproducible: Always

Steps to reproduce:

docker pull collabora/code
docker run -l debug -t -d -p My.IP.Add.ress:9980:9980 -e "domain=www\\.my\\-domain\\.tld\|my\-domain\\.tld" --restart always --cap-add MKNOD collabora/code

Expected result: Docker should start up fine

Actual result: Docker will restart over and over (filling up HDD space)

Logs: docker -l debug logs -f DOCKERID

Generating RSA private key, 2048 bit long modulus .......................................+++ ......................................................................................................................+++ e is 65537 (0x10001) Signature ok subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost Getting CA Private Key loolforkit version details: 2.1.0 - b2e736a3 wsd-00024-00024 06:54:08.272296 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2405 Failed to fork child processes. wsd-00024-00024 06:54:08.274339 [ loolwsd ] FTL Failed to fork child processes.| wsd/LOOLWSD.cpp:2551 Failed to fork child processes. wsd-00024-00024 06:54:08.278844 [ loolwsd ] WRN Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:445 wsd-00024-00024 06:54:08.279161 [ loolwsd ] WRN Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:445 -00024 06:54:08.283357 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:445 -00024 06:54:08.283444 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:445 -00024 06:54:08.283526 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:445 -00024 06:54:08.283566 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:445 -00024 06:54:08.283630 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:445 -00024 06:54:08.283705 [ loolwsd ] WRN Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:445 -00024 06:54:08.284083 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:445 -00024 06:54:08.284149 [ loolwsd ] WRN Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:445 frk-00026-00026 06:54:08.289767 [ forkit ] FTL Pipe closed.| common/IoUtil.cpp:309

System information uname -a

4.4.0-78-generic #99-Ubuntu SMP Thu Apr 27 15:29:09 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux

free -m

           total        used        free      shared  buff/cache   available

Mem: 3951 289 924 46 2737 3299 Swap: 1023 0 1023

This is more or less a cross-post from https://help.nextcloud.com/t/restarting-docker-results-in-failed-to-fork-child-processes/12209

ripper17 avatar May 23 '17 10:05 ripper17

After updating to latest docker image it is working again for me - if anybody has done something on purpose to fix this: Thanks :-)

ripper17 avatar Jun 01 '17 07:06 ripper17

I'm sorry to post on an closed issue, but this issue seems to still be here or to have returned since OP commented that an update fixed his situation. I am using a docker image pulled today. System is Arch Linux.

#uname -r
4.9.30-1-lts
#pacman -Si docker
Repository      : community
Name            : docker
Version         : 1:17.05.0-1
[…]
Build Date      : Fri 05 May 2017 10:42:16 PM CEST
Validated By    : MD5 Sum  SHA-256 Sum  Signature
Installed Size  : 88.17 MiB
Packager        : Sé
[
    {
        "Id": "38ea21ae35b6214d779a1b23208d8a75bc435ff8315dbb57e20d02498bc0333a",
        "Created": "2017-06-04T07:52:49.45273884Z",
        "Path": "/bin/sh",
        "Args": [
            "-c",
            "bash start-libreoffice.sh"
        ],
        "State": {
            "Status": "exited",
            "Running": false,
            "Paused": false,
            "Restarting": false,
            "OOMKilled": false,
            "Dead": false,
            "Pid": 0,
            "ExitCode": 137,
            "Error": "",
            "StartedAt": "2017-06-04T07:57:17.788074925Z",
            "FinishedAt": "2017-06-04T12:03:22.069898922Z"
        },
        "Image": "sha256:d3757c5469234839e2d58dfaee49cc8466d61f87074961391986e861c6dbf14f",
        "ResolvConfPath": "/var/lib/docker/containers/38ea21ae35b6214d779a1b23208d8a75bc435ff8315dbb57e20d02498bc0333a/resolv.conf",
        "HostnamePath": "/var/lib/docker/containers/38ea21ae35b6214d779a1b23208d8a75bc435ff8315dbb57e20d02498bc0333a/hostname",
        "HostsPath": "/var/lib/docker/containers/38ea21ae35b6214d779a1b23208d8a75bc435ff8315dbb57e20d02498bc0333a/hosts",
        "LogPath": "/var/lib/docker/containers/38ea21ae35b6214d779a1b23208d8a75bc435ff8315dbb57e20d02498bc0333a/38ea21ae35b6214d779a1b23208d8a75bc435ff8315dbb57e20d02498bc0333a-json.log",
        "Name": "/wizardly_borg",
        "RestartCount": 5,
        "Driver": "overlay2",
        "MountLabel": "",
        "ProcessLabel": "",
        "AppArmorProfile": "",
        "ExecIDs": null,
        "HostConfig": {
            "Binds": null,
            "ContainerIDFile": "",
            "LogConfig": {
                "Type": "json-file",
                "Config": {}
            },
            "NetworkMode": "default",
            "PortBindings": {
                "9980/tcp": [
                    {
                        "HostIp": "127.0.0.1",
                        "HostPort": "9980"
                    }
                ]
            },
            "RestartPolicy": {
                "Name": "always",
                "MaximumRetryCount": 0
            },
            "AutoRemove": false,
            "VolumeDriver": "",
            "VolumesFrom": null,
            "CapAdd": [
                "MKNOD"
            ],
            "CapDrop": null,
            "Dns": [],
            "DnsOptions": [],
            "DnsSearch": [],
            "ExtraHosts": null,
            "GroupAdd": null,
            "IpcMode": "",
            "Cgroup": "",
            "Links": null,
            "OomScoreAdj": 0,
            "PidMode": "",
            "Privileged": false,
            "PublishAllPorts": false,
            "ReadonlyRootfs": false,
            "SecurityOpt": null,
            "UTSMode": "",
            "UsernsMode": "",
            "ShmSize": 67108864,
            "Runtime": "runc",
            "ConsoleSize": [
                0,
                0
            ],
            "Isolation": "",
            "CpuShares": 0,
            "Memory": 0,
            "NanoCpus": 0,
            "CgroupParent": "",
            "BlkioWeight": 0,
            "BlkioWeightDevice": null,
            "BlkioDeviceReadBps": null,
            "BlkioDeviceWriteBps": null,
            "BlkioDeviceReadIOps": null,
            "BlkioDeviceWriteIOps": null,
            "CpuPeriod": 0,
            "CpuQuota": 0,
            "CpuRealtimePeriod": 0,
            "CpuRealtimeRuntime": 0,
            "CpusetCpus": "",
            "CpusetMems": "",
            "Devices": [],
            "DeviceCgroupRules": null,
            "DiskQuota": 0,
            "KernelMemory": 0,
            "MemoryReservation": 0,
            "MemorySwap": 0,
            "MemorySwappiness": -1,
            "OomKillDisable": false,
            "PidsLimit": 0,
            "Ulimits": null,
            "CpuCount": 0,
            "CpuPercent": 0,
            "IOMaximumIOps": 0,
            "IOMaximumBandwidth": 0
        },
        "GraphDriver": {
            "Data": {
                "LowerDir": "/var/lib/docker/overlay2/a6e3472d865a223330d01c5b078369f206439bea8c8f99e1dc600ca483ffaee0-init/diff:/var/lib/docker/overlay2/b14335218e66a1ea8940ef671bec2f7c6f7030a9f184765af6d9f60ba6c13e94/diff:/var/lib/docker/overlay2/ad5642adbe8566f711fcec95ae4cab5e62ce34cc1df63d108940985415cd149e/diff:/var/lib/docker/overlay2/a30640e1e0bc41324e49d69e0181245bdf94f15dea4520841cddf3fba9c0ccb6/diff:/var/lib/docker/overlay2/24f88ff1146028958a4b0c9467b02afaf052d239616e45a8e808f840e4781113/diff:/var/lib/docker/overlay2/411c16332e95176fb553363ff8248db4c8e5a3464f757ba63726b7f9f92db4e9/diff:/var/lib/docker/overlay2/685233aa866ef5afa2823e5c3d4512df097d59dd9d0bde8253f441735e37535a/diff:/var/lib/docker/overlay2/752fb968056ab7eca54814d6f09a0fe7a44fd37e7be8774cd42fca79733d3b4d/diff:/var/lib/docker/overlay2/d886180b45b8e08fe76f25db152f30e5f2a9af75aaae5f33634881ade9af904a/diff",
                "MergedDir": "/var/lib/docker/overlay2/a6e3472d865a223330d01c5b078369f206439bea8c8f99e1dc600ca483ffaee0/merged",
                "UpperDir": "/var/lib/docker/overlay2/a6e3472d865a223330d01c5b078369f206439bea8c8f99e1dc600ca483ffaee0/diff",
                "WorkDir": "/var/lib/docker/overlay2/a6e3472d865a223330d01c5b078369f206439bea8c8f99e1dc600ca483ffaee0/work"
            },
            "Name": "overlay2"
        },
        "Mounts": [],
        "Config": {
            "Hostname": "38ea21ae35b6",
            "Domainname": "",
            "User": "",
            "AttachStdin": false,
            "AttachStdout": false,
            "AttachStderr": false,
            "ExposedPorts": {
                "9980/tcp": {}
            },
            "Tty": true,
            "OpenStdin": false,
            "StdinOnce": false,
            "Env": [
                "domain=subdomain\\.domain\\.org",
                "PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin",
                "LC_CTYPE=en_US.UTF-8"
            ],
            "Cmd": [
                "/bin/sh",
                "-c",
                "bash start-libreoffice.sh"
            ],
            "ArgsEscaped": true,
            "Image": "collabora/code",
            "Volumes": null,
            "WorkingDir": "",
            "Entrypoint": null,
            "OnBuild": null,
            "Labels": {}
        },
        "NetworkSettings": {
            "Bridge": "",
            "SandboxID": "ffeaab1f03863e71fcb167480dd9c100cb698557378f991972915b0a3fef004d",
            "HairpinMode": false,
            "LinkLocalIPv6Address": "",
            "LinkLocalIPv6PrefixLen": 0,
            "Ports": {},
            "SandboxKey": "/var/run/docker/netns/ffeaab1f0386",
            "SecondaryIPAddresses": null,
            "SecondaryIPv6Addresses": null,
            "EndpointID": "",
            "Gateway": "",
            "GlobalIPv6Address": "",
            "GlobalIPv6PrefixLen": 0,
            "IPAddress": "",
            "IPPrefixLen": 0,
            "IPv6Gateway": "",
            "MacAddress": "",
            "Networks": {
                "bridge": {
                    "IPAMConfig": null,
                    "Links": null,
                    "Aliases": null,
                    "NetworkID": "e55ad2c38cb147a8209e284ef5dcb5078a23641207cd25f490900f3890fbf328",
                    "EndpointID": "",
                    "Gateway": "",
                    "IPAddress": "",
                    "IPPrefixLen": 0,
                    "IPv6Gateway": "",
                    "GlobalIPv6Address": "",
                    "GlobalIPv6PrefixLen": 0,
                    "MacAddress": ""
                }
            }
        }
    }
]
#docker -l debug logs CONTAINER_ID
Generating RSA private key, 2048 bit long modulus
..................+++
.+++
e is 65537 (0x10001)
Generating RSA private key, 2048 bit long modulus
....................................................+++
.........................................................................................................+++
e is 65537 (0x10001)
Signature ok
subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
Getting CA Private Key
loolforkit version details: 2.1.1 - 52f0568e
[1m[31mwsd-00025-00025 07:53:43.257922 [ loolwsd ] FTL  Failed to fork child processes.| wsd/LOOLWSD.cpp:2443[0m
Failed to fork child processes.
[1m[31mwsd-00025-00025 07:53:43.258517 [ loolwsd ] FTL  Failed to fork child processes.| wsd/LOOLWSD.cpp:2590[0m
Failed to fork child processes.
[1m[33mwsd-00025-00025 07:53:43.258980 [ loolwsd ] WRN  Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
[1m[33mwsd-00025-00025 07:53:43.259072 [ loolwsd ] WRN  Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
<shutdown>-00025 07:53:43.260100 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260159 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260207 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260258 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260303 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260348 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260663 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260718 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
[1m[31mfrk-00027-00027 07:53:43.263370 [ forkit ] FTL  Pipe closed.| common/IoUtil.cpp:309[0m
Generating RSA private key, 2048 bit long modulus
...................................................+++
.........................................+++
e is 65537 (0x10001)
Generating RSA private key, 2048 bit long modulus
.............+++
........+++
e is 65537 (0x10001)
Signature ok
subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
Getting CA Private Key
loolforkit version details: 2.1.1 - 52f0568e
[1m[31mwsd-00025-00025 07:54:35.663806 [ loolwsd ] FTL  Failed to fork child processes.| wsd/LOOLWSD.cpp:2443[0m
Failed to fork child processes.
[1m[31mwsd-00025-00025 07:54:35.664365 [ loolwsd ] FTL  Failed to fork child processes.| wsd/LOOLWSD.cpp:2590[0m
Failed to fork child processes.
[1m[33mwsd-00025-00025 07:54:35.664815 [ loolwsd ] WRN  Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
[1m[33mwsd-00025-00025 07:54:35.664913 [ loolwsd ] WRN  Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
<shutdown>-00025 07:54:35.665923 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:54:35.665981 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:54:35.666022 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:54:35.666069 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:54:35.666113 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:54:35.666156 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:54:35.666432 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:54:35.666495 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
[1m[31mfrk-00027-00027 07:54:35.669392 [ forkit ] FTL  Pipe closed.| common/IoUtil.cpp:309[0m
Generating RSA private key, 2048 bit long modulus
..............................................................+++
..........................................................................................................................................................................................................................................+++
e is 65537 (0x10001)
Generating RSA private key, 2048 bit long modulus
.........................................+++
......................+++
e is 65537 (0x10001)
Signature ok
subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
Getting CA Private Key
loolforkit version details: 2.1.1 - 52f0568e
[1m[31mwsd-00025-00025 07:55:29.282207 [ loolwsd ] FTL  Failed to fork child processes.| wsd/LOOLWSD.cpp:2443[0m
Failed to fork child processes.
[1m[31mwsd-00025-00025 07:55:29.282810 [ loolwsd ] FTL  Failed to fork child processes.| wsd/LOOLWSD.cpp:2590[0m
Failed to fork child processes.
[1m[33mwsd-00025-00025 07:55:29.283304 [ loolwsd ] WRN  Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
[1m[33mwsd-00025-00025 07:55:29.283410 [ loolwsd ] WRN  Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
<shutdown>-00025 07:55:29.284533 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:55:29.284593 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:55:29.284685 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:55:29.284762 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:55:29.284833 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:55:29.284911 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:55:29.285297 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:55:29.285381 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
[1m[31mfrk-00027-00027 07:55:29.287970 [ forkit ] FTL  Pipe closed.| common/IoUtil.cpp:309[0m
Generating RSA private key, 2048 bit long modulus
.......................+++
.........................................+++
e is 65537 (0x10001)
Generating RSA private key, 2048 bit long modulus
..............................................................................+++
..................................................................................+++
e is 65537 (0x10001)
Signature ok
subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
Getting CA Private Key
loolforkit version details: 2.1.1 - 52f0568e
[1m[31mwsd-00024-00024 07:56:22.727144 [ loolwsd ] FTL  Failed to fork child processes.| wsd/LOOLWSD.cpp:2443[0m
Failed to fork child processes.
[1m[31mwsd-00024-00024 07:56:22.727668 [ loolwsd ] FTL  Failed to fork child processes.| wsd/LOOLWSD.cpp:2590[0m
Failed to fork child processes.
[1m[33mwsd-00024-00024 07:56:22.728187 [ loolwsd ] WRN  Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
[1m[33mwsd-00024-00024 07:56:22.728290 [ loolwsd ] WRN  Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
<shutdown>-00024 07:56:22.729318 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00024 07:56:22.729380 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00024 07:56:22.729465 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00024 07:56:22.729539 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00024 07:56:22.729609 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00024 07:56:22.729676 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00024 07:56:22.730004 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00024 07:56:22.730085 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
[1m[31mfrk-00026-00026 07:56:22.732483 [ forkit ] FTL  Pipe closed.| common/IoUtil.cpp:309[0m
Generating RSA private key, 2048 bit long modulus
.............................................................................+++
.............................................................................+++
e is 65537 (0x10001)
Generating RSA private key, 2048 bit long modulus
............+++
..+++
e is 65537 (0x10001)
Signature ok
subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
Getting CA Private Key
loolforkit version details: 2.1.1 - 52f0568e
[1m[31mwsd-00025-00025 07:57:15.609266 [ loolwsd ] FTL  Failed to fork child processes.| wsd/LOOLWSD.cpp:2443[0m
Failed to fork child processes.
[1m[31mwsd-00025-00025 07:57:15.609812 [ loolwsd ] FTL  Failed to fork child processes.| wsd/LOOLWSD.cpp:2590[0m
Failed to fork child processes.
[1m[33mwsd-00025-00025 07:57:15.610269 [ loolwsd ] WRN  Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
[1m[33mwsd-00025-00025 07:57:15.610380 [ loolwsd ] WRN  Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507[0m
<shutdown>-00025 07:57:15.611411 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:57:15.611471 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:57:15.611518 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:57:15.611567 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:57:15.611613 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:57:15.611657 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:57:15.611993 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:57:15.612049 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
[1m[31mfrk-00027-00027 07:57:15.614443 [ forkit ] FTL  Pipe closed.| common/IoUtil.cpp:309[0m
Generating RSA private key, 2048 bit long modulus
...............+++
......+++
e is 65537 (0x10001)
Generating RSA private key, 2048 bit long modulus
.......................................................................+++
....+++
e is 65537 (0x10001)
Signature ok
subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
Getting CA Private Key
loolforkit version details: 2.1.1 - 52f0568e
office version details: { "ProductName": "Collabora Office", "ProductVersion": "5.3", "ProductExtension": ".10.13", "BuildId": "e639c2ad9c05a793b16db13bd879342ed75dcf95" }
[1m[33mwsd-00025-00026 07:57:46.504191 [ prisoner_poll ] WRN  ForKit not responsive for 28299 ms forking 1 children. Resetting.| wsd/LOOLWSD.cpp:338[0m

I'm available to provide any extra info or perform test if needed. Thanks for any insight. Regards, Mark.

Marcool04 avatar Jun 04 '17 12:06 Marcool04

this still appears broken, docker image just keeps restarting for me too.

zeigerpuppy avatar Jul 07 '17 14:07 zeigerpuppy

same here

ekozan avatar Jul 09 '17 20:07 ekozan

I'm reopening this, as this still seems to affect many people. In fact, I've not dared to restart my own docker since it is currently running and there's so many reports of it not working for others that I don't want to take any risks.

You might try to restart it over and over again and see if it is some kind of timing / race-condition where it might work 1 out of 20 times or something...

Again, I've not change any of my setup and it suddenly worked after trying countless times...

ripper17 avatar Jul 10 '17 06:07 ripper17

Status : Waiting: CrashLoopBackOff Restarts : 210

:D it's not 1/20 :D 0/210 for me

[
    {
        "Id": "ea1e806cd2cea55598aa7eed2a6efc3a196e17b941c42c362dd2a17e1db48e76",
        "Created": "2017-07-10T17:35:03.395726169Z",
        "Path": "/bin/sh",
        "Args": [
            "-c",
            "bash start-libreoffice.sh"
        ],
        "State": {
            "Status": "exited",
            "Running": false,
            "Paused": false,
            "Restarting": false,
            "OOMKilled": false,
            "Dead": false,
            "Pid": 0,
            "ExitCode": 70,
            "Error": "",
            "StartedAt": "2017-07-10T17:35:03.576286932Z",
            "FinishedAt": "2017-07-10T17:35:54.150846205Z"
        },
        "Image": "sha256:8bc1d306ff41c5a358942308649f3eb76f2a9248d294793e08c1f532d4a04978",
        "ResolvConfPath": "/var/lib/docker/containers/305b10b43f7b46406edc2a2631600e39aa8b747124d61c6f60f709992a05a260/resolv.conf",
        "HostnamePath": "/var/lib/docker/containers/305b10b43f7b46406edc2a2631600e39aa8b747124d61c6f60f709992a05a260/hostname",
        "HostsPath": "/var/lib/kubelet/pods/0d4f0519-64ea-11e7-ab20-7054d216f5a7/etc-hosts",
        "LogPath": "/var/lib/docker/containers/ea1e806cd2cea55598aa7eed2a6efc3a196e17b941c42c362dd2a17e1db48e76/ea1e806cd2cea55598aa7eed2a6efc3a196e17b941c42c362dd2a17e1db48e76-json.log",
        "Name": "/k8s_collabora-code_sad-kitten-collabora-code-4174266487-ljrcx_default_0d4f0519-64ea-11e7-ab20-7054d216f5a7_211",
        "RestartCount": 0,
        "Driver": "overlay",
        "MountLabel": "",
        "ProcessLabel": "",
        "AppArmorProfile": "",
        "ExecIDs": null,
        "HostConfig": {
            "Binds": [
                "/var/lib/kubelet/pods/0d4f0519-64ea-11e7-ab20-7054d216f5a7/volumes/kubernetes.io~secret/default-token-n3djs:/var/run/secrets/kubernetes.io/serviceaccount:ro",
                "/var/lib/kubelet/pods/0d4f0519-64ea-11e7-ab20-7054d216f5a7/etc-hosts:/etc/hosts",
                "/var/lib/kubelet/pods/0d4f0519-64ea-11e7-ab20-7054d216f5a7/containers/collabora-code/e1a7542c:/dev/termination-log"
            ],
            "ContainerIDFile": "",
            "LogConfig": {
                "Type": "json-file",
                "Config": null
            },
            "NetworkMode": "container:305b10b43f7b46406edc2a2631600e39aa8b747124d61c6f60f709992a05a260",
            "PortBindings": null,
            "RestartPolicy": {
                "Name": "",
                "MaximumRetryCount": 0
            },
            "AutoRemove": false,
            "VolumeDriver": "",
            "VolumesFrom": null,
            "CapAdd": [
                "MKNOD"
            ],
            "CapDrop": null,
            "Dns": null,
            "DnsOptions": null,
            "DnsSearch": null,
            "ExtraHosts": null,
            "GroupAdd": null,
            "IpcMode": "container:305b10b43f7b46406edc2a2631600e39aa8b747124d61c6f60f709992a05a260",
            "Cgroup": "",
            "Links": null,
            "OomScoreAdj": 992,
            "PidMode": "",
            "Privileged": false,
            "PublishAllPorts": false,
            "ReadonlyRootfs": false,
            "SecurityOpt": [
                "seccomp=unconfined"
            ],
            "StorageOpt": null,
            "UTSMode": "",
            "UsernsMode": "",
            "ShmSize": 67108864,
            "ConsoleSize": [
                0,
                0
            ],
            "Isolation": "",
            "CpuShares": 102,
            "Memory": 1602224128,
            "CgroupParent": "/kubepods/burstable/pod0d4f0519-64ea-11e7-ab20-7054d216f5a7",
            "BlkioWeight": 0,
            "BlkioWeightDevice": null,
            "BlkioDeviceReadBps": null,
            "BlkioDeviceWriteBps": null,
            "BlkioDeviceReadIOps": null,
            "BlkioDeviceWriteIOps": null,
            "CpuPeriod": 100000,
            "CpuQuota": 100000,
            "CpusetCpus": "",
            "CpusetMems": "",
            "Devices": [],
            "DiskQuota": 0,
            "KernelMemory": 0,
            "MemoryReservation": 0,
            "MemorySwap": -1,
            "MemorySwappiness": -1,
            "OomKillDisable": false,
            "PidsLimit": 0,
            "Ulimits": null,
            "CpuCount": 0,
            "CpuPercent": 0,
            "BlkioIOps": 0,
            "BlkioBps": 0,
            "SandboxSize": 0
        },
        "GraphDriver": {
            "Name": "overlay",
            "Data": {
                "LowerDir": "/var/lib/docker/overlay/3d6352145379228a0df0a5d17d6b13bae0637c567d1886122a4a8f538e9f2f16/root",
                "MergedDir": "/var/lib/docker/overlay/b2bf37103bb655f679122755650e0e889df049995cd472047ccb2c03a5cc8675/merged",
                "UpperDir": "/var/lib/docker/overlay/b2bf37103bb655f679122755650e0e889df049995cd472047ccb2c03a5cc8675/upper",
                "WorkDir": "/var/lib/docker/overlay/b2bf37103bb655f679122755650e0e889df049995cd472047ccb2c03a5cc8675/work"
            }
        },
        "Mounts": [
            {
                "Source": "/var/lib/kubelet/pods/0d4f0519-64ea-11e7-ab20-7054d216f5a7/volumes/kubernetes.io~secret/default-token-n3djs",
                "Destination": "/var/run/secrets/kubernetes.io/serviceaccount",
                "Mode": "ro",
                "RW": false,
                "Propagation": "rprivate"
            },
            {
                "Source": "/var/lib/kubelet/pods/0d4f0519-64ea-11e7-ab20-7054d216f5a7/etc-hosts",
                "Destination": "/etc/hosts",
                "Mode": "",
                "RW": true,
                "Propagation": "rprivate"
            },
            {
                "Source": "/var/lib/kubelet/pods/0d4f0519-64ea-11e7-ab20-7054d216f5a7/containers/collabora-code/e1a7542c",
                "Destination": "/dev/termination-log",
                "Mode": "",
                "RW": true,
                "Propagation": "rprivate"
            }
        ],
        "Config": {
            "Hostname": "sad-kitten-collabora-code-4174266487-ljrcx",
            "Domainname": "",
            "User": "0",
            "AttachStdin": false,
            "AttachStdout": false,
            "AttachStderr": false,
            "Tty": false,
            "OpenStdin": false,
            "StdinOnce": false,
            "Env": [
                "domain=office.*****",
                "NFS_PROVISIONER_PORT_20048_TCP_PROTO=tcp",
                "NFS_PROVISIONER_PORT_111_TCP=tcp://10.101.151.252:111",
                "KUBERNETES_PORT_443_TCP_PORT=443",
                "KUBERNETES_PORT_443_TCP_ADDR=10.96.0.1",
                "NFS_PROVISIONER_SERVICE_PORT_RPCBIND=111",
                "NFS_PROVISIONER_SERVICE_PORT_RPCBIND_UDP=111",
                "NFS_PROVISIONER_PORT_2049_TCP_PORT=2049",
                "KUBERNETES_PORT=tcp://10.96.0.1:443",
                "JUICED_WOLVERINE_POSTGRESQL_SERVICE_PORT=5432",
                "NFS_PROVISIONER_SERVICE_PORT_MOUNTD=20048",
                "JUICED_WOLVERINE_POSTGRESQL_PORT_5432_TCP_PROTO=tcp",
                "SAD_KITTEN_COLLABORA_CODE_PORT_9980_TCP_PROTO=tcp",
                "SAD_KITTEN_COLLABORA_CODE_PORT=tcp://10.103.0.162:9980",
                "NFS_PROVISIONER_SERVICE_PORT=2049",
                "KUBERNETES_SERVICE_PORT_HTTPS=443",
                "NFS_PROVISIONER_PORT_111_TCP_PROTO=tcp",
                "NFS_PROVISIONER_PORT_111_UDP=udp://10.101.151.252:111",
                "NFS_PROVISIONER_PORT_20048_TCP_PORT=20048",
                "NFS_PROVISIONER_PORT_111_UDP_ADDR=10.101.151.252",
                "JUICED_WOLVERINE_POSTGRESQL_PORT=tcp://10.107.46.110:5432",
                "SAD_KITTEN_COLLABORA_CODE_PORT_9980_TCP_ADDR=10.103.0.162",
                "NFS_PROVISIONER_PORT_2049_TCP=tcp://10.101.151.252:2049",
                "NFS_PROVISIONER_PORT_20048_TCP_ADDR=10.101.151.252",
                "NFS_PROVISIONER_PORT_111_TCP_PORT=111",
                "NFS_PROVISIONER_PORT_111_TCP_ADDR=10.101.151.252",
                "KUBERNETES_SERVICE_HOST=10.96.0.1",
                "JUICED_WOLVERINE_POSTGRESQL_PORT_5432_TCP_PORT=5432",
                "KUBERNETES_SERVICE_PORT=443",
                "NFS_PROVISIONER_SERVICE_PORT_NFS=2049",
                "MANAGEABLE_SHRIMP_PYDIO_PORT_80_TCP_PORT=80",
                "SAD_KITTEN_COLLABORA_CODE_SERVICE_HOST=10.103.0.162",
                "NFS_PROVISIONER_PORT=tcp://10.101.151.252:2049",
                "NFS_PROVISIONER_PORT_20048_TCP=tcp://10.101.151.252:20048",
                "MANAGEABLE_SHRIMP_PYDIO_SERVICE_PORT_HTTPS=80",
                "MANAGEABLE_SHRIMP_PYDIO_PORT=tcp://10.96.157.238:80",
                "JUICED_WOLVERINE_POSTGRESQL_SERVICE_HOST=10.107.46.110",
                "NFS_PROVISIONER_PORT_111_UDP_PROTO=udp",
                "KUBERNETES_PORT_443_TCP_PROTO=tcp",
                "MANAGEABLE_SHRIMP_PYDIO_PORT_80_TCP_PROTO=tcp",
                "JUICED_WOLVERINE_POSTGRESQL_PORT_5432_TCP=tcp://10.107.46.110:5432",
                "JUICED_WOLVERINE_POSTGRESQL_PORT_5432_TCP_ADDR=10.107.46.110",
                "SAD_KITTEN_COLLABORA_CODE_SERVICE_PORT_COLLABORA=9980",
                "SAD_KITTEN_COLLABORA_CODE_PORT_9980_TCP_PORT=9980",
                "KUBERNETES_PORT_443_TCP=tcp://10.96.0.1:443",
                "JUICED_WOLVERINE_POSTGRESQL_SERVICE_PORT_POSTGRESQL=5432",
                "SAD_KITTEN_COLLABORA_CODE_PORT_9980_TCP=tcp://10.103.0.162:9980",
                "NFS_PROVISIONER_PORT_111_UDP_PORT=111",
                "MANAGEABLE_SHRIMP_PYDIO_SERVICE_PORT=80",
                "MANAGEABLE_SHRIMP_PYDIO_PORT_80_TCP=tcp://10.96.157.238:80",
                "NFS_PROVISIONER_SERVICE_HOST=10.101.151.252",
                "NFS_PROVISIONER_PORT_2049_TCP_PROTO=tcp",
                "NFS_PROVISIONER_PORT_2049_TCP_ADDR=10.101.151.252",
                "MANAGEABLE_SHRIMP_PYDIO_SERVICE_HOST=10.96.157.238",
                "SAD_KITTEN_COLLABORA_CODE_SERVICE_PORT=9980",
                "MANAGEABLE_SHRIMP_PYDIO_PORT_80_TCP_ADDR=10.96.157.238",
                "PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin",
                "LC_CTYPE=en_US.UTF-8"
            ],
            "Cmd": [
                "/bin/sh",
                "-c",
                "bash start-libreoffice.sh"
            ],
            "Image": "sha256:8bc1d306ff41c5a358942308649f3eb76f2a9248d294793e08c1f532d4a04978",
            "Volumes": null,
            "WorkingDir": "",
            "Entrypoint": null,
            "OnBuild": null,
            "Labels": {
                "annotation.io.kubernetes.container.hash": "f031f7bb",
                "annotation.io.kubernetes.container.ports": "[{\"containerPort\":9980,\"protocol\":\"TCP\"}]",
                "annotation.io.kubernetes.container.restartCount": "211",
                "annotation.io.kubernetes.container.terminationMessagePath": "/dev/termination-log",
                "annotation.io.kubernetes.container.terminationMessagePolicy": "File",
                "annotation.io.kubernetes.pod.terminationGracePeriod": "30",
                "io.kubernetes.container.logpath": "/var/log/pods/0d4f0519-64ea-11e7-ab20-7054d216f5a7/collabora-code_211.log",
                "io.kubernetes.container.name": "collabora-code",
                "io.kubernetes.docker.type": "container",
                "io.kubernetes.pod.name": "sad-kitten-collabora-code-4174266487-ljrcx",
                "io.kubernetes.pod.namespace": "default",
                "io.kubernetes.pod.uid": "0d4f0519-64ea-11e7-ab20-7054d216f5a7",
                "io.kubernetes.sandbox.id": "305b10b43f7b46406edc2a2631600e39aa8b747124d61c6f60f709992a05a260"
            }
        },
        "NetworkSettings": {
            "Bridge": "",
            "SandboxID": "",
            "HairpinMode": false,
            "LinkLocalIPv6Address": "",
            "LinkLocalIPv6PrefixLen": 0,
            "Ports": null,
            "SandboxKey": "",
            "SecondaryIPAddresses": null,
            "SecondaryIPv6Addresses": null,
            "EndpointID": "",
            "Gateway": "",
            "GlobalIPv6Address": "",
            "GlobalIPv6PrefixLen": 0,
            "IPAddress": "",
            "IPPrefixLen": 0,
            "IPv6Gateway": "",
            "MacAddress": "",
            "Networks": null
        }
    }
]

ekozan avatar Jul 10 '17 17:07 ekozan

There's a 50 s time limit for the preinit stage during startup, it might be it fails to complete the process during that period on certain systems.

thebearon avatar Jul 11 '17 14:07 thebearon

        "StartedAt": "2017-07-10T17:35:03.576286932Z",
        "FinishedAt": "2017-07-10T17:35:54.150846205Z"

Yup it's can be that :)

ekozan avatar Jul 11 '17 16:07 ekozan

Where's that timeout defined? Do we have any chance to change that to something a little higher?

ripper17 avatar Jul 13 '17 10:07 ripper17

From the look of my log messages, the timeout is visible, but doesn't appear to be the cause of the issue. Allow me to elaborate:

#### FIRST ATTEMPT STARTS HERE ####
## START RSA KEY SECTION ##
Generating RSA private key, 2048 bit long modulus
..................+++
.+++
e is 65537 (0x10001)
Generating RSA private key, 2048 bit long modulus
....................................................+++
.........................................................................................................+++
e is 65537 (0x10001)
Signature ok
subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
Getting CA Private Key
## END RSA KEYS SECTION ##
loolforkit version details: 2.1.1 - 52f0568e
�[1m�[31mwsd-00025-00025 07:53:43.257922 [ loolwsd ] FTL  Failed to fork child processes.| wsd/LOOLWSD.cpp:2443�[0m
Failed to fork child processes.
�[1m�[31mwsd-00025-00025 07:53:43.258517 [ loolwsd ] FTL  Failed to fork child processes.| wsd/LOOLWSD.cpp:2590�[0m
Failed to fork child processes.
�[1m�[33mwsd-00025-00025 07:53:43.258980 [ loolwsd ] WRN  Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507�[0m
�[1m�[33mwsd-00025-00025 07:53:43.259072 [ loolwsd ] WRN  Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507�[0m
<shutdown>-00025 07:53:43.260100 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260159 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260207 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260258 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260303 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260348 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260663 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
<shutdown>-00025 07:53:43.260718 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
�[1m�[31mfrk-00027-00027 07:53:43.263370 [ forkit ] FTL  Pipe closed.| common/IoUtil.cpp:309�[0m
#### FIRST ATTEMPT ENDS HERE ####

## START RSA KEY SECTION ##
Generating RSA private key, 2048 bit long modulus
...................................................+++
.........................................+++
e is 65537 (0x10001)
Generating RSA private key, 2048 bit long modulus
.............+++
........+++
e is 65537 (0x10001)
Signature ok
subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
Getting CA Private Key
### END RSA KEYS SECTION ###
loolforkit version details: 2.1.1 - 52f0568e
�[1m�[31mwsd-00025-00025 07:54:35.663806 [ loolwsd ] FTL  Failed to fork child processes.| wsd/LOOLWSD.cpp:2443�[0m
Failed to fork child processes.

This is one full cycle and part of a second cycle of the init phase in the log. It starts by generating the RSA private key, which succeeds, then it attempts to call loolforkit which fails with Failed to fork child process. Looking at the timestamps: [RSA section] 07:53:43.257922 -> first fork attempt 07:53:43.263370 -> pipe closed (gives up trying) --- Loop --- [RSA section] 07:54:35.663806 -> second fork attempt ...etc. There is a 52 second interval between the fork attempts. But they are failing way faster than that. And the RSA key generation seems to be taking up most of the 2 second difference. What I think is happening is that it is forking, failing fast (43.263370-43.257922=0.005448s) and then waiting most of the 50 second timeout to restart. So the 50 second limit is not what is causing the error. Something in loolforkit is. But that's just my analysis of the log and I don't know the inner working enough to comment further.

Marcool04 avatar Jul 13 '17 10:07 Marcool04

Yes, looking at my own logs, I'd agree with you that the 50s timeout is between the two attempts and don't seem to be the cause of the issue, but rather one symptom. So it's still unclear why the forking of the child processes fails so often (or even always for some users).

ripper17 avatar Jul 13 '17 10:07 ripper17

I got the same error, the solution is, to use not the /etc/default/docker for configuration the devicemapper - use systemd for this.

The solution mentioned in this post -> https://help.nextcloud.com/t/collabora-docker-capabilities-problem/4774 did solve my problem.

rebruCH avatar Jul 20 '17 09:07 rebruCH

I have the same issue and using overlay2 on Arch Linux. Anyone solved this without switching to devicemapper?

returntrip avatar Oct 19 '17 08:10 returntrip

I have 17 running services on my server, is there any method to keep aufs method and not devicemapper?

ariselseng avatar Nov 09 '17 10:11 ariselseng

The timeout was increased in version 2.1.5, give it a try to see if that helps.

thebearon avatar Nov 16 '17 18:11 thebearon

I have just tested with version 2.1.5, the problem is still not resolved. It just takes longer time to crash but the docker image is not functional as the web app is not responding while waiting:

collabora_1         | 2017-11-19T21:20:16.228234714Z Generating RSA private key, 2048 bit long modulus
collabora_1         | 2017-11-19T21:20:16.232544498Z ...+++
collabora_1         | 2017-11-19T21:20:16.491489887Z ....................................................................................................................................................................................................................................+++
collabora_1         | 2017-11-19T21:20:16.491884384Z e is 65537 (0x10001)
collabora_1         | 2017-11-19T21:20:16.498557068Z Generating RSA private key, 2048 bit long modulus
collabora_1         | 2017-11-19T21:20:16.511605168Z ...........+++
collabora_1         | 2017-11-19T21:20:16.517997544Z .....+++
collabora_1         | 2017-11-19T21:20:16.518292207Z e is 65537 (0x10001)
collabora_1         | 2017-11-19T21:20:16.523152527Z Signature ok
collabora_1         | 2017-11-19T21:20:16.523161166Z subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
collabora_1         | 2017-11-19T21:20:16.523339588Z Getting CA Private Key
collabora_1         | 2017-11-19T21:20:16.547659073Z loolforkit version details: 2.1.5 - c9a8aed
collabora_1         | 2017-11-19T21:20:16.547718300Z frk-00029-00029 21:20:16.547480 [ forkit ] ERR  Ignored setting RLIMIT_FSIZE to 18446744073709551615 bytes. (errno: Success)| common/Seccomp.cpp:284
collabora_1         | 2017-11-19T21:20:16.547730763Z frk-00029-00029 21:20:16.547492 [ forkit ] ERR  Ignored setting RLIMIT_NOFILE to 18446744073709551615 files. (errno: Success)| common/Seccomp.cpp:306
collabora_1         | 2017-11-19T21:20:16.547734680Z frk-00029-00029 21:20:16.547510 [ forkit ] FTL  Capability cap_sys_chroot is not set for the loolforkit program.| kit/ForKit.cpp:168
collabora_1         | 2017-11-19T21:20:16.547738084Z frk-00029-00029 21:20:16.547515 [ forkit ] FTL  Capability cap_mknod is not set for the loolforkit program.| kit/ForKit.cpp:168
collabora_1         | 2017-11-19T21:20:16.547741316Z frk-00029-00029 21:20:16.547529 [ forkit ] FTL  Capability cap_fowner is not set for the loolforkit program.| kit/ForKit.cpp:168
collabora_1         | 2017-11-19T21:20:16.547744653Z FATAL: Capabilities are not set for the loolforkit program.
collabora_1         | 2017-11-19T21:20:16.547747632Z If you are on SLES11, please set 'file_caps=1' as kernel boot option.
collabora_1         | 2017-11-19T21:20:16.547750884Z 
collabora_1         | 2017-11-19T21:24:26.543022381Z wsd-00027-00027 21:24:26.542853 [ loolwsd ] FTL  Failed to fork child processes.| wsd/LOOLWSD.cpp:2623
collabora_1         | 2017-11-19T21:24:26.543045976Z FATAL: Failed to fork child processes.
collabora_1         | 2017-11-19T21:24:26.543058925Z wsd-00027-00027 21:24:26.542987 [ loolwsd ] FTL  Failed to fork child processes.| wsd/LOOLWSD.cpp:2770
collabora_1         | 2017-11-19T21:24:26.543061270Z Failed to fork child processes.
collabora_1         | 2017-11-19T21:24:26.543202917Z wsd-00027-00027 21:24:26.543101 [ loolwsd ] WRN  Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507
collabora_1         | 2017-11-19T21:24:26.543207344Z wsd-00027-00027 21:24:26.543123 [ loolwsd ] WRN  Waking up dead poll thread [admin], started: false, finished: false| ./net/Socket.hpp:507
collabora_1         | 2017-11-19T21:24:26.543408391Z <shutdown>-00027 21:24:26.543306 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
collabora_1         | 2017-11-19T21:24:26.543412585Z <shutdown>-00027 21:24:26.543312 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
collabora_1         | 2017-11-19T21:24:26.543422773Z <shutdown>-00027 21:24:26.543325 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
collabora_1         | 2017-11-19T21:24:26.543424946Z <shutdown>-00027 21:24:26.543329 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
collabora_1         | 2017-11-19T21:24:26.543426867Z <shutdown>-00027 21:24:26.543332 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
collabora_1         | 2017-11-19T21:24:26.543428816Z <shutdown>-00027 21:24:26.543335 [ loolwsd ] WRN  Waking up dead poll thread [accept_poll], started: false, finished: false| ./net/Socket.hpp:507
collabora_1         | 2017-11-19T21:24:26.543583216Z <shutdown>-00027 21:24:26.543478 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507
collabora_1         | 2017-11-19T21:24:26.543587013Z <shutdown>-00027 21:24:26.543490 [ loolwsd ] WRN  Waking up dead poll thread [websrv_poll], started: false, finished: false| ./net/Socket.hpp:507

asirinelli avatar Nov 19 '17 21:11 asirinelli

Still happening for me as well.

Keridos avatar Dec 21 '17 01:12 Keridos

Hi, I found the issue on all my debian setups with docker with versions from 2.1.2 to 3.0.0.0. I found the issue inside a fresh debian 9 virtualbox machine, 2 debian 9 machines and a debian package installation. It used to work until an update in november for me.

For docker I tried overlay2 and overlay storage driver. So far the solution seems to switch the storage driver to devicemapper (I have not tried).

The fact that I find the issue with debian package makes me think the error does not comes from docker but something higher like a system library or a kernel issue.

Can someone else confirms package install and docker fails on the same machine ?

jribal avatar Jan 07 '18 19:01 jribal

the probleme is on CollaboraOnline code

ekozan avatar Jan 07 '18 20:01 ekozan

https://github.com/CollaboraOnline/Docker-CODE/issues/10#issuecomment-345551351: this seems to be a different issue, the required capabilities are missing.

https://github.com/CollaboraOnline/Docker-CODE/issues/10#issuecomment-343110034, https://github.com/CollaboraOnline/Docker-CODE/issues/10#issuecomment-355846689: I've had no issues with aufs and overlay2 storage drivers on Ubuntu 17.04 / 17.10.

There can be different cases, and in others the Collabora Online log in the container should give further errors, eg. this and this are both unrelated issues, but lead to "FTL Failed to fork child process".

thebearon avatar Jan 11 '18 05:01 thebearon

First off I would like to say I am having the same or at least very similar issues with the Docker Container and it restarting constantly. I am running it on DigitalOcean on ubuntu 16.04 and it is the only installed program beyond the base system and docker requirements.

Second, issues like this is the precise reason that open source projects either don't get wide adoption or out right fail! This issue has been open for almost a year and is still not fixed! I'd be more than happy to pay for Collabora Code IF they could get it working and stable but I really don't see that happening.

IDK maybe I'm just old and angry but when you are shipping a product you don't just update it without testing and then push this update out and after people have stated there are major or critical issues you roll it back to the previous version not just leave it on the broken one. Collabora really needs to get there S together!

RCcola1987 avatar Jan 14 '18 17:01 RCcola1987

Okay; a probable fix. Can somebody try passing this environment (SLEEPFORDEBUGGER=0) and see if it works.

You can pass it like -e "SLEEPFORDEBUGGER=0" to the docker run command or via some other medium if you are using docker-compose, etc.

If it doesn't fix the issue, trace level logs (by changing the entry from warning to trace in /etc/loolwsd/loolwsd.xml; also consider changing the flush attribute in to true) might help.

pranavk avatar Jan 19 '18 16:01 pranavk

With this variable, the container has stopped restarting continuously but it get stuck during the initialisation process:

collabora_1              | Generating RSA private key, 2048 bit long modulus
collabora_1              | .......+++
collabora_1              | ...................................................................................+++
collabora_1              | e is 65537 (0x10001)
collabora_1              | Generating RSA private key, 2048 bit long modulus
collabora_1              | ........................................+++
collabora_1              | ...............................................................................................................+++
collabora_1              | e is 65537 (0x10001)
collabora_1              | Signature ok
collabora_1              | subject=/C=DE/ST=BW/L=Stuttgart/O=Dummy Authority/CN=localhost
collabora_1              | Getting CA Private Key
collabora_1              | frk-00029-00029 21:27:42.871039 [ forkit ] FTL  Capability cap_sys_chroot is not set for the loolforkit program.| kit/ForKit.cpp:168
collabora_1              | frk-00029-00029 21:27:42.871047 [ forkit ] FTL  Capability cap_mknod is not set for the loolforkit program.| kit/ForKit.cpp:168
collabora_1              | frk-00029-00029 21:27:42.871051 [ forkit ] FTL  Capability cap_fowner is not set for the loolforkit program.| kit/ForKit.cpp:168
collabora_1              | FATAL: Capabilities are not set for the loolforkit program.
collabora_1              | If you are on SLES11, please set 'file_caps=1' as kernel boot option.
collabora_1              | 
collabora_1              | frk-00029-00029 21:27:42.871057 [ forkit ] WRN  Waking up dead poll thread [delay_poll], started: false, finished: false| net/Socket.hpp:507
collabora_1              | frk-00029-00029 21:27:42.871074 [ forkit ] WRN  Waking up dead poll thread [delay_poll], started: false, finished: false| net/Socket.hpp:507
collabora_1              | loolforkit version details: 3.0.0 - 81c8935

Of course, the server is not started and nothing is answering to the https requests.

asirinelli avatar Jan 19 '18 21:01 asirinelli

Here is a log in "trace" mode: https://framabin.org/?4fe8437720b162a9#O/H659YpD9U/BAMgKRxQSXHN6DIVl2Wfpth8wlOHVaU=

asirinelli avatar Jan 19 '18 23:01 asirinelli

I don't know why capabilities are not set on loolforkit binary in your docker container. As mentioned in the error message, it may depend on your kernel boot options, etc.

pranavk avatar Jan 20 '18 04:01 pranavk

And you didn't set the flush to true in logging, I think. Did you? Anyways, trace level logs are irrelevant here since it's clear what the problem is -- you need to figure out why capabilities are not set in the first place.

pranavk avatar Jan 20 '18 04:01 pranavk

Hi, Still no success after setting flush to true, and using -SLEEPFORDEBUGGER=0 in docker-compose. My kernel capabilities are okay. collabora | frk-00028-00028 14:10:16.490045 [ forkit ] INF Have capability cap_sys_chroot| kit/ForKit.cpp:180 collabora | frk-00028-00028 14:10:16.490066 [ forkit ] INF Have capability cap_mknod| kit/ForKit.cpp:180 collabora | frk-00028-00028 14:10:16.490145 [ forkit ] INF Have capability cap_fowner| kit/ForKit.cpp:180 I can't see a clue in the trace logs.

jribal avatar Jan 20 '18 14:01 jribal

I can confirm the flush logging was set to true.

I am able to run the docker image in a CentOS 7.4 host but I cannot launch it in a Debian 9.3. The docker command lines are identical: docker run -t -d -p 127.0.0.1:9980:9980 -e 'domain=nextcloud\\.example\\.com' --restart always --cap-add MKNOD collabora/code

I have used docker inspect to see if I can see a difference and the only obvious difference is centOS is using overlay2 while Debian is using aufs.

The kernels used and docker configurations are the default ones for each distribution. I do not understand why the capabilities are not set when run on a Debian host.

asirinelli avatar Jan 20 '18 14:01 asirinelli

I'm using debian 9.3 kernel 4.12, overlay2 and it does not work even with capabilities enabled. I installed manually the kernel a while ago Linux 4.12.0-041200-generic #201707022031.

jribal avatar Jan 21 '18 20:01 jribal

@jribal can you paste your trace logs as well?

pranavk avatar Jan 22 '18 15:01 pranavk