postgres icon indicating copy to clipboard operation
postgres copied to clipboard

Add a HEALTHCHECK to the images

Open kodedylf opened this issue 1 year ago • 7 comments

When helping people on StackOverflow, I often see people having problems with Docker Compose and applications that try to connect to the database immediately on startup. When using a simple docker compose 'depends_on' directive, compose only waits for the database to be started before starting the application. The application then fails because the database isn't ready to accept connections yet.

The solution is to add a healthcheck to the database container in docker compose and have the application wait for the database to enter a healthy state.

It would be nice if the healthcheck was included in the Postgres image by default. A simple HEALTHCHECK ["pg_isready"] should be enough.

kodedylf avatar May 16 '24 08:05 kodedylf

Some of this is discussed in #920

LaurentGoderre avatar May 16 '24 13:05 LaurentGoderre

Indeed, see https://github.com/docker-library/faq#healthcheck and the resulting non-triviality of https://github.com/docker-library/healthcheck/blob/master/postgres/docker-healthcheck

tianon avatar May 16 '24 16:05 tianon

Why not include healthcheck script but not enable it by default to make things a bit easier?

mwisnicki avatar Aug 03 '24 02:08 mwisnicki

IMHO included healthcheck should have these features:

  • ensures all init scripts are done
  • ensures db can be queried (SELECT 1)
  • allows user to provide custom query via env variable

mwisnicki avatar Aug 03 '24 03:08 mwisnicki

I was also struggling with a health check until I discovered that pg_ready was not what I was looking for. Here I posted my solution on stackoverflow and explain why this is the case.

My healthcheck:

test: ["CMD-SHELL", "psql -U ${DB_USER} -d ${DB_MAIN} -c 'SELECT 1' || exit 1"]

hibuna avatar Aug 10 '24 21:08 hibuna

See https://github.com/docker-library/postgres/pull/920#issuecomment-1005306817, especially:

Using --host 127.0.0.1 with pg_isready or psql for checking the status of the container from within it is going to be the safed/best supported option (see https://github.com/docker-library/healthcheck/blob/master/postgres/docker-healthcheck for an example that does exactly that).

(Your example will report "healthy" before the container is actually ready for external connections because it will happily connect to the unix socket of the initialization instance.)

tianon avatar Aug 12 '24 16:08 tianon

pg_isready --host=127.0.0.1 || exit 1 seems like a sufficiently good default, no? For more specific use cases, users of the image can override the HEALTHCHECK instruction to change the command, the options, or to disable it altogether.


Rebuttal of the FAQ below…

  • many users will have their own idea of what "healthy" means and credentials change over time making generic health checks hard to define

As mentioned above, users can customize the instruction as they wish. Checking whether the server is ready to accept a connection seems like a reasonable default.

after upgrading their images, current users will have extra unexpected load on their systems for healthchecks they don't necessarily need/want and may be unaware of

Yes, this is a small breaking change. But we should also be open to breaking changes; otherwise, we will be stuck with less-than-ideal solutions. Besides, in what use cases would it make sense not to add the HEALTHCHECK instruction?

Kubernetes does not use Docker's heath checks (opting instead for separate liveness and readiness probes)

Kubernetes disables Docker’s HEALTHCHECK instruction (like anyone else can!), so adding the instruction to the image does not impact Kubernetes.

sometimes things like databases will take too long to initialize, and a defined health check will often cause the orchestration system to prematurely kill the container (https://github.com/docker-library/mysql/issues/439)

The root cause here is that the configured timeout is too small. The solution is to increase the timeout, not to disable the HEALTHCHECK instruction.

fumoboy007 avatar Aug 26 '24 22:08 fumoboy007