podman icon indicating copy to clipboard operation
podman copied to clipboard

podman exec into a "-it" container: container create failed (no logs from conmon): EOF

Open edsantiago opened this issue 4 years ago • 75 comments

Common thread seems to be:

Running: podman [options] run -dti --name test1 quay.io/libpod/fedora-minimal:latest sleep +Inf
time="2021-06-16T19:33:53-05:00" level=warning msg="The input device is not a TTY. The --tty and --interactive flags might not work properly"
99e3b419a97aa408a4d0d3072bbd00579d5edd7c97790aa06d61f233cfdc1b4c
Running: podman [options] exec -ti test1 true
Running: podman [options] exec -ti test1 true       ! sometimes it fails on the first, sometimes on the third
Error: container create failed (no logs from conmon): EOF

Podman exec [It] podman exec terminal doesn't hang

And also just now in a still-live PR (my flake-xref does not handle live PRs): int podman ubuntu-2104 root host

Note: the March and April logs above have been garbagecollected, I can't confirm that the error is the same one. I'm leaving them in the report deliberately, in case it helps to have a timestamp for the start of this flake (i.e. it might not be new in June).

Edit: this is podman, not podman-remote, so it's unlikely to be the same as #7360

edsantiago avatar Jul 13 '21 20:07 edsantiago

Podman exec [It] podman exec terminal doesn't hang

  • fedora-34 : int podman fedora-34 root host
  • ubuntu-2104 : int podman ubuntu-2104 root host

edsantiago avatar Aug 03 '21 17:08 edsantiago

Podman exec [It] podman exec terminal doesn't hang

edsantiago avatar Aug 17 '21 16:08 edsantiago

Hmmm, I wonder if this is the same problem, in a different test? Looks suspiciously close.

  podman network connect
Running: podman [options] exec -it test ip addr show eth1
Error: container create failed (no logs from conmon): EOF

Podman network connect and disconnect [It] podman network connect

  • fedora-34 : int podman fedora-34 root host

edsantiago avatar Aug 17 '21 16:08 edsantiago

Another one, in yet another test. Looks like this is happening more often than I thought, because it happens in multiple tests:

Podman exec [It] podman exec --detach

  • fedora-34 : int podman fedora-34 root host

edsantiago avatar Aug 17 '21 16:08 edsantiago

A friendly reminder that this issue had no activity for 30 days.

github-actions[bot] avatar Sep 17 '21 00:09 github-actions[bot]

Podman exec [It] podman exec terminal doesn't hang

Podman network connect and disconnect [It] podman network connect when not running

edsantiago avatar Sep 23 '21 20:09 edsantiago

Podman network connect and disconnect [It] podman network disconnect and run with network ID

  • fedora-34 : int podman fedora-34 root host

edsantiago avatar Oct 04 '21 19:10 edsantiago

Podman exec [It] podman exec terminal doesn't hang

  • fedora-34 : int podman fedora-34 root host
  • fedora-34 : int remote fedora-34 root host [remote]
  • ubuntu-2104 : int podman ubuntu-2104 root host

edsantiago avatar Oct 19 '21 17:10 edsantiago

Still seeing this. int remote fedora-35 root

edsantiago avatar Nov 04 '21 15:11 edsantiago

I'll take a stab at it. Thanks for assembling the data, @edsantiago!

vrothberg avatar Nov 10 '21 12:11 vrothberg

while true; do                                                                                        
        ./bin/podman run --name=test --replace -dti quay.io/libpod/fedora-minimal:latest sleep +Inf   
        ./bin/podman exec test true                                                                   
        ./bin/podman rm -f -t0 test                                                                   
done                                                                                                  

Ran over 30 minutes but no failure. I'll have a look at the code; maybe I can come up with a theory but a reproducer would be great.

vrothberg avatar Nov 10 '21 12:11 vrothberg

I can't reproduce on my laptop either, but on a 1minutetip f34 VM it fails instantly, on the very first try:

# podman run -dti --name=test quay.io/libpod/fedora-minimal:latest sleep 20;podman exec -it test true
8ed6f60c9a8e38d2081ece7a5471cc1a931f402170a9b0ff8f149bffb434994b
Error: container create failed (no logs from conmon): EOF

After that first time it still fails, but only once in 4-5 times. Note that it fails even without < /dev/null on either podman command.

podman-3.4.1-1.fc34.x86_64 conmon-2.0.30-2.fc34.x86_64

edsantiago avatar Nov 10 '21 19:11 edsantiago

One more note: I think the -it is needed on exec. Without it, I can't reproduce the failure.

edsantiago avatar Nov 10 '21 19:11 edsantiago

mheon PTAL

rhatdan avatar Nov 10 '21 20:11 rhatdan

One would think this is a race between podman run creating the container and launching conmon, and podman exec gets to talk to conmon before it knows there is a container,causing some issues.

rhatdan avatar Nov 10 '21 20:11 rhatdan

Well, except that it's not always the first exec. This log shows the first three execs working, then it fails on the fourth.

edsantiago avatar Nov 10 '21 21:11 edsantiago

Very difficult to track this down without a repro - we need to know what's going on with Conmon such that it's blowing up (personally I think Conmon is probably either segfaulting or just printing the error to the journal and exiting without reporting the real error to Podman). Might be logs in the journal that will help us?

mheon avatar Nov 10 '21 21:11 mheon

@rhatdan It's not actually container create that's failing, that's a bad error message. We're trying to make a Conmon for the exec session but Conmon is failing with no logs as to why.

mheon avatar Nov 10 '21 21:11 mheon

@mheon see my 1minutetip f34 VM comment above. It reproduces reliably.

edsantiago avatar Nov 10 '21 22:11 edsantiago

Here's one in the brand-new ubuntu-2110

edsantiago avatar Nov 11 '21 17:11 edsantiago

Podman network connect and disconnect [It] podman network disconnect when not running

Podman network connect and disconnect [It] podman network disconnect

  • fedora-34 : int remote fedora-34 root host [remote]
  • ubuntu-2110 : int podman ubuntu-2110 root host
  • ubuntu-2110 : int podman ubuntu-2110 rootless host

edsantiago avatar Nov 22 '21 13:11 edsantiago

Podman exec [It] podman exec terminal doesn't hang

Podman network connect and disconnect [It] podman network disconnect

  • fedora-34 : int remote fedora-34 root host [remote]
  • ubuntu-2110 : int podman ubuntu-2110 root host

edsantiago avatar Nov 30 '21 19:11 edsantiago

Fresh one in ubuntu 2110 root. Curious thing: once it happens one time, it seems to happen on a bunch more tests afterward.

edsantiago avatar Nov 30 '21 22:11 edsantiago

Here's one where it fails with bad exit code, but the conmon error isn't present:

# podman [options] run -dti --name test1 registry.fedoraproject.org/fedora-minimal:34 sleep +Inf
time="2021-12-08T15:27:00Z" level=warning msg="The input device is not a TTY. The --tty and --interactive flags might not work properly"
ce72bce58b4ef3d0215bc5d805594b94f8ae18e1eee558471358f6a682846df3
# podman [options] exec -ti test1 true
# podman [options] exec -ti test1 true       <--- this is the one that seems to fail
...
         ? Failure [4.220 seconds]
         Podman exec
         /var/tmp/go/src/github.com/containers/podman/test/e2e/exec_test.go:16
           podman exec terminal doesn't hang [It]
           /var/tmp/go/src/github.com/containers/podman/test/e2e/exec_test.go:334
         
           Expected
               <int>: 129
           to match exit code:
               <int>: 0

Podman exec [It] podman exec terminal doesn't hang

  • fedora-34 : int podman fedora-34 root container

edsantiago avatar Dec 09 '21 13:12 edsantiago

A friendly reminder that this issue had no activity for 30 days.

github-actions[bot] avatar Jan 09 '22 00:01 github-actions[bot]

@edsantiago is this still an issue?

rhatdan avatar Jan 10 '22 14:01 rhatdan

Last seen 12-21:

Podman init containers [It] podman ensure always init containers always run

Podman network connect and disconnect [It] podman network connect and run with network ID

  • fedora-35 : int podman fedora-35 root host
  • ubuntu-2104 : int remote ubuntu-2104 root host [remote]

Maybe Santa's elves fixed it over break. Or maybe our CI use has been low due to so many of us on PTO. (Since you removed the stale-issue tag, I'm pretty sure your guess is the same as mine).

edsantiago avatar Jan 10 '22 15:01 edsantiago

A friendly reminder that this issue had no activity for 30 days.

github-actions[bot] avatar Feb 10 '22 00:02 github-actions[bot]

Podman exec [It] podman exec terminal doesn't hang

  • ubuntu-2110 : int podman ubuntu-2110 root host

edsantiago avatar Feb 10 '22 14:02 edsantiago

A friendly reminder that this issue had no activity for 30 days.

github-actions[bot] avatar Mar 13 '22 00:03 github-actions[bot]