caddy icon indicating copy to clipboard operation
caddy copied to clipboard

Exposing TLS certificate metrics

Open miekg opened this issue 7 years ago • 9 comments

This is a question/feature request.

For caddy-prometheus (github.com/miekg/caddy-prometheus) I would like to expose metrics from the certs caddy has configured. Like time left per cert, failures to update, etc. etc.

Is there an API for this, should caddy expose some of these bits or should I hack in into ACME (? believe that is used for handling LE comms)?

miekg avatar May 22 '17 10:05 miekg

Hello @miekg,

this sounds like a great idea!

I believe the best approach would be to create an API in github.com/xenolf/lego/acme -- It might be useful for other projects using lego/acme

Basically you should "hack in into ACME" 😄

__

On the other side, I believe it would be easier to create those metics directly in Caddy and expose them to caddy-prometheus

elcore avatar May 23 '17 19:05 elcore

[ Quoting [email protected] in "Re: [mholt/caddy] Exposing TLS cert..." ]

Hello @miekg,

this sounds like a great idea!

I believe the best approach would be to expose those metrics in github.com/xenolf/lego/acme -- It might be useful for other projects using lego/acme

Basically you should "hack in into ACME" 😄

Interesting thought. I'll look into that, there could either be an API in ACME, or directly expose prometheus metrics. I'll open an issue there.

On the other side, I believe it would be easier to create those metics directly in Caddy and expose them to caddy-prometheus

But how do I get access to the information that those metrics will expose?

/Miek

-- Miek Gieben

miekg avatar May 23 '17 19:05 miekg

But how do I get access to the information that those metrics will expose?

You could import caddy/caddytls in caddy-prometheus and use those metrics -- I have not really thought deeply about this ..........

I just posted some ideas I had :smile:, my raw thoughts.

elcore avatar May 23 '17 19:05 elcore

Interesting thought. I'll look into that, there could either be an API in ACME, or directly expose prometheus metrics. I'll open an issue there.

Awesome :smile:

elcore avatar May 24 '17 08:05 elcore

I think we can do something like this; Caddy can emit events but right now it only emits one (startup) because we're adding events based on need. I don't have a page on the wiki yet explaining how to hook into events (I'll do that soon) but it's very easy; you can look at the caddy-service plugin and see how it does it.

Although I think we'll need to change the signature of event hooks to pass in some information. Shouldn't be too hard since only that plugin uses event hooks; I'll talk to the authors about getting it updated.

I think this is better done in Caddy than lego.

mholt avatar May 26 '17 15:05 mholt

Interesting. What new events would we need? As I alluded to above, qps to let's encrypt, possible errors returned from the let's encrypt, TLS cert properties (don't know what is interesting there).

miekg avatar May 26 '17 15:05 miekg

We could add a lot of events around TLS management. Caddy scans loaded certs every 12 hours for renewal, or every hour for OCSP stapling, those are events (although maybe uninteresting, and not needed to emit). When Caddy starts renewing a certificate, or finishes successfully or has an error, those are events. When it updates OCSP staples, that's an event. When it obtains a new certificate with on-demand TLS, that's an event. When it loads a certificate from disk, that's an event. By emitting these with some information attached (currently infeasible but we can change that) you should be able to get what you'd need, I think?

QPS to Let's Encrypt specifically doesn't make much sense since requests to LE are few and far between.

Edit: If you want really fine-grained information, such as every low-level network interaction that lego makes in order to validate a challenge, that'll have to be built into lego, of course, and I dunno how that would go. But for everything at or above "Obtain a certificate" or "renew a certificate", Caddy can emit those.

mholt avatar May 26 '17 15:05 mholt

@miekg If this is still something you're interested in, I think you should join our discussion in the #telemetry channel in Slack. I think that will be the best fit for this feature, since we expect telemetry to be exportable in various formats or directly to external monitoring services.

Either that or we overhaul logging to be able to tee to different outputs.

mholt avatar Mar 27 '18 01:03 mholt

/cc @hairyhenderson

mholt avatar Sep 17 '20 18:09 mholt