Promsafe: Strongly-typed safe labels
Promsafe
Introducing promsafe lib (optional helper lib, similar to promauto) that allows to use type-safe labels.
Motivation
This PR only covers Counter functionality as an example. If idea is fine for community, I'll push further commits expanding logic to Gauge, Histogram, etc
For detailed motivation see my comment below
Fixes #1599
Why?
Currently having unsafe labels lead to several problems: either err-handling nightmare, either panicing (in case if you use "promauto")
Having unsafe labels can lead to following issues:
- Misspelling
- Mistyping
- Misremembering
- Too many labels
- Too few labels
As of state of art of modern Go version, we can use Go Generics to solve these issues.
Examples of how to use it
1. Multi-label mode (safe structs)
type MyCounterLabels struct {
promsafe.StructLabelProvider
EventType string
Success bool
Position uint8 // yes, it's a number, but be careful with high-cardinality labels
ShouldNotBeUsed string `promsafe:"-"`
}
// Note on performance:
// By default if no custom logic provider the MyCounterLabels will use reflect to extract label names and values
// But if performance matters you can define your own exporting methods:
// Optional! (if not specified, it will fallback to reflect)
func (f MyCounterLabels) ToPrometheusLabels() prometheus.Labels {
return prometheus.Labels{"event_type": f.EventType, "success": fmt.Sprintf("%v", f.Success), "position": fmt.Sprintf("%d", f.Position)}
}
// Optional! (if not specified it will fallback to reflect)
func (f MyCounterLabels) ToLabelNames() []string { return []string{"event_type", "success", "position"} }
// Creating a typed counter providing specific labels type
c := promsafe.NewCounterVec[MyCounterLabels](prometheus.CounterOpts{
Name: "items_counted",
})
// Manually register the counter
if err := prometheus.Register(c.Unsafe()); err != nil {
log.Fatal("could not register: ", err.Error())
}
// it ONLY allows you to fill the MyCounterLabels here
c.With(MyCounterLabels{
EventType: "request", Success: true, Position: 1,
}).Inc()
Compatibility with promauto
1. promauto.With call migration
var myReg = prometheus.NewRegistry()
counterOpts := prometheus.CounterOpts{
Name: "items_counted",
}
// Old unsafe code
// promauto.With(myReg).NewCounterVec(counterOpts, []string{"event_type", "source"})
// becomes:
type MyLabels struct {
promsafe.StructLabelProvider
EventType string
Source string
}
c := promauto_adapter.With(myReg).NewCounterVec[MyLabels](counterOpts)
c.With(MyLabels{
EventType: "reservation", Source: "source1",
}).Inc()
Note:
All non-string value types will be automatically converted to string. Here we can add a reasonable type-validation, so we can make it only to work with fields that are strings/bools/ints
Hi! Thanks for innovating here 💪🏽
I presume this is about using generics for label values type safety -- in the relation to defined label names.
Currently having unsafe labels lead to several problems: either err-handling nightmare, either panicing (in case if you use "promauto")
Can you share exactly the requirements behind promsafe. Perhaps it would allow us to make decision if such package is useful to enough to maintain in client_golang OR existing solutions are enough OR is there a way to extend existing packages with improvements for the same goals.
For example, how often you see those err handling nightmare and panics in practice? Can you share some experience/data?
Generally, what's recommended is hardcoding label values in WithLabelValues, which by testing given code-path you know immediately if it's panicking. If you use dynamic values (e.g. variable) as your values then it's generally prone to cardinality issues anyway, thus we experimented with constraint labels solution.
Thus, let's circle back to barebone requirements we want here 🤗 e.g. generally you should avoid using With. What are the cases we are solving here?
Additionally, performance is important for this increment flow, so it would be nice to check how this applies.
Hey. Thanks for a feedback. Let me share details on my motivation behind the provided promsafe package.
By err handling nightmare / panics I meant the following cases:
// Counter registration: we're fine with possible panic here :)
myCounter := promauto.NewCounterVec(prometheus.CounterOpts{
Name: "items_counted",
}, []string{"event_type", "success", "slot" /* 1/2/3/.../9 */})
// But using counter: where there motivation comes from:
// Using .GetMetricWith*() methods will error if labels are messed up
myCounterWithValues, err := myCounter.GetMetricWith(prometheus.Labels{
"event_type": "reservation",
"success": "true",
"slot": "1",
})
if err != nil {
// TODO: handle error
}
// Same error can happen if using *WithLabelValues version:
// myCounterWithValues, err := myCounter.GetMetricWithLabelValues("reservation", "true", "1")
// To avoid error-handling we can use .With/.WithLabelValues, but it will just panic for the same reasons:
myCounter.WithLabelValues("reservation", "true", "2").Inc()
💡 So here and further i call "panic" both panicing of .With* methods or error-handling in .GetMetricWith* methods
Why Panic? why it matters?
Here are several reasons:
- Misspelling. You can misspell label names. (Not relevant for WithLabelValues though)
- Misremembering. You can forget the name of the label. In case of using WithLabelValues you still need to remember the number of labels and their order (and what they mean)
- Missing labels. You can forget a label (both in map of With() or in slice of WithLabelValues())
- Extra labels. You can accidentally pass extra labels (both in map of With() or in slice of WithLabelValues())
- Manual string conversion can lead to failures as well. E.g. you must know to use fmt.Sprintf("%v", boolValue), and choosing wrong "%v" placeholder can ruin values.
All these reasons are possible ways to break code because of panicking in .With() or .WithLabelValues(). Let's not spend time and efforts on code-reviews to ensure that new usage of "counter inc" is not breaking everything.
Also, one more reason is not about failing but about consistency:
6. Type-safety allows you to be both less error-prone and more consistent.
E.g. you just pass bool values as label values, and you know it will always be "true"/"false" not "1","0","on","off","yes",...
same for numbers
How it's solved by promsafe?
// Promsafe example:
// Registering a metric with simply providing the type containing labels
type MyCounterLabels struct {
promsafe.StructLabelProvider
EventType string
Success bool
Slot uint8 // yes, it's a number, still should be careful with high-cardinality labels
}
myCounterSafe := promsafe.NewCounterVec[MyCounterLabels](prometheus.CounterOpts{
Name: "items_counted_detailed",
})
// Calling counter is simple: just provide the filled struct of the dedicated type.
//
// Neither of 5 reasons can panic here. You simply can't mess up the struct.
// With() accepts ONLY this type of struct, you can't send any other struct.
// You don't need to remember the fields and their order. IDE will show you them.
// You can't send more fields.
// You can send less fields (but it can easily fill up with default values, or other custom non-panicy logic).
// You can't mess up types.
// You're consistent with types.
myCounterSafe.With(MyCounterLabels{
EventType: "reservation", Success: true, Slot: 1,
}).Inc()
P.S. issue with inconsistency of promsafe-version of WithLabelValues() method
// One thing that I need to specify here is the inconsistency with promsafe-version of WithLabelValues()
// WithLabelValues() excepts ordered raw strings, that unfortunately breaks the "safety" concept.
// We can't control the order of given strings and even the length of it
// That's why in promsafe, .WithLavelValues() and .GetMetricWithLabelValues() are disabled:
// They are marked deprecated and panic (so they are strongly considered not to be used)
This API is really nice, would love to see this merged. I've already ran into a few of the failure modes @amberpixels mentioned in my first few weeks of using this library.
Small update.
I've pushed some improvements in API, so it's more consistent and stable. Also, I've update the PR description with cleaner and clearer examples, and added a note on performance issue.
Then there is efficiency aspect I would like to understand, given this is a hot path.
There are benchmarks in safe_test.go, and here are the results (on a MacBook Air M3):
BenchmarkCompareCreatingMetric/Prometheus_NewCounterVec-8 1252017 959.5 ns/op
BenchmarkCompareCreatingMetric/Promsafe_(reflect)_NewCounterVec-8 637221 1891 ns/op
BenchmarkCompareCreatingMetric/Promsafe_(fast)_NewCounterVec-8 1000000 1030 ns/op
Using automatic label extraction (via reflection) results in a 2x performance overhead compared to simple prometheus.NewCounterVec. However, the fast method (manual implementation of ToPrometheusLabels) is only about 10% slower.
Also before committing to any of this we have to ask ourselves what to recommend or deprecate in this place. We are getting to the place where there are many ways of doing the same thing, so would love to decide what to remove, if we think this is the way to go.
I completely agree with this.
To achieve and answer all of this, I wonder: A) How bad would it be to host
promsafein another repository for incubation period? B) Is there a room forprometheus/client_golang/expmodule which we could version v0.x and put other experimental stuff like Remote API?
I like the idea of a prometheus/client_golang/exp module (or /x/, as is commonly done with Go packages). This approach makes it clear that the contents are experimental while still keeping them close to the main client library.
I would like to explore slimmer "adapter" that just offers
With(labels T) Kmethod -- it would simplify the code to maintain and allow composability.
Understood. For a minimal implementation, I see the following setup:
1. A wrapper (custom type) for each metric type (e.g., CounterVec, GaugeVec, HistogramVec, etc.).
2. Only the With[T] method, with no need for GetMetricWith, CurryWith, MustCurryWith, and especially no GetMetricWithLabelValues or WithLabelValues.
So are you ok with such "slim" adapter? (it will be required to be implemented per metric type)
// NewCounterVec creates a new CounterVec with type-safe labels.
func NewCounterVec[T LabelsProviderMarker](opts prometheus.CounterOpts) *CounterVec[T] {
emptyLabels := NewEmptyLabels[T]()
inner := prometheus.NewCounterVec(opts, extractLabelNames(emptyLabels))
return &CounterVec[T]{inner: inner}
}
// CounterVec is a wrapper around prometheus.CounterVec that allows type-safe labels.
type CounterVec[T LabelsProviderMarker] struct {
inner *prometheus.CounterVec
}
// With behaves like prometheus.CounterVec.With but with type-safe labels.
func (c *CounterVec[T]) With(labels T) prometheus.Counter {
return c.inner.With(extractLabelsWithValues(labels))
}
@bwplotka I'd like to continue work on the branch (adding gauge, histogram, tests, etc) soon, so my main blocker is the question:
Should I wrap it as experimental functionality? If yes, then what name of the folder do you confirm? /x/ or /exp/?
Here is what I meant as an alternative for type safety (one does not exclude another one) https://github.com/bwplotka/metric-rename-demo/pull/1
Happy new year!
Hello from 2025 :) Thanks
... switch metric definitions to... yaml (: OpenTelemetry starts this new pattern with https://github.com/open-telemetry/weaver project ...
Thanks for noting this. I get the idea. That's an interesting and (imho, reasonable) experiment. I agree on your point that if this experiment succeeds - these safe typed metrics should be made and working exactly the same no matter if they were generated from yaml, or built dynamically in code.
Hey @bwplotka
Considering everything we’ve discussed, I’d like to summarize where we stand.
Code generation (e.g., YAML-based) is viable, by existing projects may choose to generate only some metrics or stick to a code-centric approach. That’s why I tend to find a flexible, combined approach (in aspects of type-safety)
I see ensuring a smooth migration from promauto to either promsafe or code-generated metrics as essential - to be seamless, not disruptive.
So, we have 2 Approaches (who can live separately or together)
Context:
Let’s say we work with a counter MyCounter that has three labels:
type MyLabels struct {
MyInt int
MyCustomConst string // "enum"-like string
MyFloat float64
}
Approach 1. Code-declared metrics (promsafe)
// With promauto, we typically declare a metric like this:
c := promauto.With(reg).NewCounterVec(opts)
//
// Using promsafe, this would be replaced with:
//
c := promsafe.With[MyLabels](reg).NewCounterVec()
// or with the default registry
c := promsafe.NewCounterVec[MyLabels]()
// or even this (to be consistent with your code-generated approach
c := promsafe.MustNewCounterVec[MyLabels](reg)
// which provides **type-safe methods**, allowing usage like:
c.With(MyLabels{myInt, myCustomConst, myFloat}).Inc()
I don’t think custom-tailored type-safe methods (e.g. c.WithMyLabels(myInt, myCustomConst, myFloat)) are feasible within promsafe without requiring too much boilerplate. However, the required setup for promsafe is minimal:
- Define a typed struct for labels (e.g.
MyLabels) - (Optional) Implement
.ToPrometheusLabels()for efficient conversion (avoiding reflection).- This ensures zero performance overhead compared to standard
promautocalls.
- This ensures zero performance overhead compared to standard
Approach 2. Generated Metrics (semconv-like)
// The original declaration:
c := promauto.With(reg).NewCounterVec(opts)
//
// Using generated metrics, this would be replaced with:
//
c := myGeneratedMetrics.MustNewMyCounterVec(reg) // where myGenMetrics is generated user package
// which provides a **fully type-safe interface,** where With* methods are generated:
c.WithLabelValues(myInt, myCustomConst, myFloat).Inc()
To ensure consistency with code-declared type-safe metrics, we could also support:
c.With(MyLabels{MyInt, MyCustomConst, MyFloat})
// We're ok as all this boilerplate code (sturcts, methods) are simply generated
Note: here MyLabels would be generated with the fast implementation (as it knows all the types).
Extra notes:
- For now, this will just work well with even
ConstraintLabels. But in future we can makeConstraintLabelsconcept be optimized to be part of generation metrics process (default values, min-max validation checks, etc) - Code gen approach can also even generate chainable
.WithMyInt(3)but I think this is making it too complicated.
Conclusion
Both promsafe and semconv / promgen offer type safety in different ways, and they can coexist smoothly.
Would love to hear your thoughts—let me know if you have any concerns or need help with implementations! 🚀
Is this still relevant? If so, what is blocking it? Is there anything you can do to help move it forward?
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs.
@bwplotka do you have some updates on the development vector of type-safety feature in prometheus golang client? Will it be similar approach or will it be generation-based approach only?
Can i help via changes in this PR, or should we close it?
Meanwhile, @tombrk has made some experiments with code generation that is able to provide type-safe client_golang code based on Weaver and Semantic conventions.
Here is the code for generation: https://github.com/tombrk/promconv/blob/main/generate.go Type-safe instrumentation libraries: https://github.com/tombrk/promconv/tree/main/otel Example using one of those generated libraries: https://github.com/tombrk/promconv/blob/main/examples/http-server/main.go