opentelemetry-go-contrib OTel Memory Leak

Description

Memory Leak in otel library code.

Environment

OS: Linux
Architecture: x86
Go Version: 1.23.2 - go.opentelemetry.io/contrib/instrumentation/net/http/otelhttp v0.56.0
- go.opentelemetry.io/otel v1.31.0
- go.opentelemetry.io/otel/exporters/otlp/otlpmetric/otlpmetricgrpc v1.31.0
- go.opentelemetry.io/otel/exporters/otlp/otlptrace v1.31.0
- go.opentelemetry.io/otel/exporters/otlp/otlptrace/otlptracegrpc v1.31.0
- go.opentelemetry.io/otel/sdk v1.31.0
- go.opentelemetry.io/otel/trace v1.31.0

Steps To Reproduce

See Comment here: https://github.com/open-telemetry/opentelemetry-go-contrib/issues/5190#issuecomment-2463638063

I am pretty sure this is still an issue or something else in the golang otel ecosystem. I will get a pprof setup possibly tomorrow, but here's some anecdotal evidence I have:

Screenshot 2024-11-07 at 6 49 14 PM

Pretty easy to see when tracing was implemented from that graph. And yes. I have removed our tracing implementation and it's back to normal memory usage.

Here is a rough draft of our setup. Please let me know if I am doing anything egregiously dumb, but for the most part, it's all pretty standard stuff take from various docs:

go.mod

go 1.23.2

require (
	go.opentelemetry.io/contrib/instrumentation/net/http/otelhttp v0.56.0
	go.opentelemetry.io/otel v1.31.0
	go.opentelemetry.io/otel/exporters/otlp/otlpmetric/otlpmetricgrpc v1.31.0
	go.opentelemetry.io/otel/exporters/otlp/otlptrace v1.31.0
	go.opentelemetry.io/otel/exporters/otlp/otlptrace/otlptracegrpc v1.31.0
	go.opentelemetry.io/otel/sdk v1.31.0
	go.opentelemetry.io/otel/trace v1.31.0
)

We are wrapping the otelhttp.NewHandler around the toplevel muxer, so everything is traced. Yes, I know this is expensive, but it shouldn't leak memory. Eventually we will change this to include/exclude/drop stuff, just so we aren't taking in so much volume (ping routes, health checks, etc.) and do more aggressive down sampling.

func NewApi(c config.Config) *Api {
	return &Api{
		c:           &c,
		controllers: []Controller{newControllers(c)},
		server: &http.Server{
			ReadTimeout:  c.ReadTimeout,
			WriteTimeout: c.WriteTimeout,
			IdleTimeout:  c.IdleTimeout,
			Addr:         fmt.Sprintf(":%d", c.Port),
			Handler:      otelhttp.NewHandler(chi.NewMux(), "INGRESS", otelhttp.WithFilter(traceFilter)),
		},
		done:       make(chan bool),
		sigChannel: make(chan os.Signal, 1024),
	}
}

Here is how we are initializing our trace and metrics providers once on boot:

// TracerProvider an OTLP exporter, and configures the corresponding trace provider.
func TracerProvider(ctx context.Context, res *resource.Resource) (func(context.Context) error, error) {
	// Set up a trace exporter
	traceExporter, err := otlptrace.New(ctx, otlptracegrpc.NewClient())
	if err != nil {
		return nil, errors.Wrap(err, "failed to create trace exporter")
	}

	// Register the trace exporter with a TracerProvider, using a batch
	// span processor to aggregate spans before export.
	tracerProvider := sdktrace.NewTracerProvider(
		sdktrace.WithSampler(sdktrace.AlwaysSample()),
		sdktrace.WithResource(res),
		sdktrace.WithBatcher(traceExporter),
	)
	otel.SetTracerProvider(tracerProvider)
	otel.SetTextMapPropagator(
		propagation.NewCompositeTextMapPropagator(
			propagation.TraceContext{},
			propagation.Baggage{},
		))

	// Shutdown will flush any remaining spans and shut down the exporter.
	return tracerProvider.Shutdown, nil
}

// MeterProvider an OTLP exporter, and configures the corresponding meter provider.
func MeterProvider(ctx context.Context, res *resource.Resource) (func(context.Context) error, error) {
	metricExporter, err := otlpmetricgrpc.New(ctx)
	if err != nil {
		return nil, errors.Wrap(err, "failed to create metric exporter")
	}

	meterProvider := sdkmetric.NewMeterProvider(
		sdkmetric.WithReader(sdkmetric.NewPeriodicReader(metricExporter)),
		sdkmetric.WithResource(res),
	)
	otel.SetMeterProvider(meterProvider)

	return meterProvider.Shutdown, nil
}

Then called and shutdown on main:

	shutDownTracer, err := traceinstrument.TracerProvider(ctx, traceRes)
	if err != nil {
		log.Logger.Fatal("failed to create trace provider", zap.Error(err))
	}
	
	defer func(onShutdown func(ctx context.Context) error) {
		if errr := onShutdown(ctx); errr != nil {
			log.Logger.Error("error shutting down trace provider", zap.Error(errr))
		}
	}(shutDownTracer)

	shutdownTraceMetrics, err := traceinstrument.MeterProvider(ctx, traceRes)
	if err != nil {
		log.Logger.Fatal("failed to create meter provider", zap.Error(err))
	}

	defer func(onShutdown func(ctx context.Context) error) {
		if errr := onShutdown(ctx); errr != nil {
			log.Logger.Error("error shutting down metrics provider", zap.Error(errr))
		}
	}(shutdownTraceMetrics)

Note. We are also using the otelhttp.NewTransport to wrap the default logging transport:

http.DefaultTransport = otelhttp.NewTransport(http.DefaultTransport)

If we remove tracing setup, memory usage goes back to normal. So the leak is definitely in our tracing setup.

Expected behavior

Memory does not continuously increase over time.

Nov 08 '24 23:11 tjsampson

Can you update the image to be public, upload a new image, or maybe the raw profiling data?

20241211_112152

Dec 11 '24 19:12 MrAlias

Can you update the image to be public, upload a new image, or maybe the raw profiling data?

Hmm, if you go directly to the linked comment the image show. Not sure what the difference is, but please disregard (:shrug:).

Dec 11 '24 19:12 MrAlias

@tjsampson are you using cumulative temporality (the default IIRC for the metric gRPC exporter)?

Is the cardinality of your metrics unbounded?

Dec 11 '24 19:12 MrAlias

What is the definition of traceFilter? Does it hold state?

Dec 11 '24 19:12 MrAlias

Do you have any pprof data for the memory usage?

Dec 11 '24 19:12 MrAlias

What is the definition of traceFilter? Does it hold state?

func traceFilter(req *http.Request) bool {
	skipTraceAgents := []string{"kube-probe", "ELB-HealthChecker"}
	ua := req.UserAgent()

	for _, skipAgent := range skipTraceAgents {
		if strings.Contains(ua, skipAgent) {
			return false
		}
	}
	return true
}

Possibly the issue?

Dec 15 '24 18:12 tjsampson

@tjsampson are you using cumulative temporality (the default IIRC for the metric gRPC exporter)?

Yes. We are using the default.

Dec 15 '24 18:12 tjsampson

Is the cardinality of your metrics unbounded?

No, they aren't unbounded. We aren't actually using the otel meter provider for any custom metrics (still using prometheus for that). We are using it for the default metrics so that we can link metrics to traces inside Grafana. So, unless the default metrics are unbounded, we should be safe (no custom metrics).

Dec 15 '24 18:12 tjsampson

What is the definition of traceFilter? Does it hold state?

func traceFilter(req *http.Request) bool {
	skipTraceAgents := []string{"kube-probe", "ELB-HealthChecker"}
	ua := req.UserAgent()

	for _, skipAgent := range skipTraceAgents {
		if strings.Contains(ua, skipAgent) {
			return false
		}
	}
	return true
}

Possibly the issue?

I don't see a way for this to grow. Guessing this isn't the issue.

Dec 18 '24 17:12 MrAlias

Are you able to collect pprof memory data? It's is hard to say where the allocations are going at this point without it.

Dec 18 '24 17:12 MrAlias

Screenshot 2024-12-18 at 11 05 17 AM

@MrAlias The leak is pretty slow. We just deployed the service a couple of days ago, as seen in the graph, but its steadily climbing. I've been making tweaks/changes to that code. I will post what we are currently running, just for posterity.

Dec 18 '24 18:12 tjsampson

main.go

	var (
		ctx, cancel = context.WithCancel(context.Background())
		cfg         = config.Boot()
		err         error
	)
	defer cancel()

        traceRes, err := traceinstrument.TraceResource(ctx)
	if err != nil {
		log.Logger.Panic("failed to create trace resource", zap.Error(err))
	}

	shutDownTracer, err := traceinstrument.TracerProvider(ctx, traceRes)
	if err != nil {
		log.Logger.Panic("failed to create trace provider", zap.Error(err))
	}

	defer func(onShutdown func(ctx context.Context) error) {
		if errr := onShutdown(ctx); errr != nil {
			log.Logger.Error("error shutting down trace provider", zap.Error(errr))
		}
	}(shutDownTracer)

	shutdownTraceMetrics, err := traceinstrument.MeterProvider(ctx, traceRes)
	if err != nil {
		log.Logger.Panic("failed to create meter provider", zap.Error(err))
	}

	defer func(onShutdown func(ctx context.Context) error) {
		if errr := onShutdown(ctx); errr != nil {
			log.Logger.Error("error shutting down metrics provider", zap.Error(errr))
		}
	}(shutdownTraceMetrics)

        .... do other stuff.....

instrument.go


func TraceResource(ctx context.Context) (*resource.Resource, error) {
	var (
		ciEnv            = os.Getenv("CI_ENVIRONMENT") 
		cloudEnvironment = os.Getenv("CLOUD_ENVIRONMENT") 
		attribs          = []attribute.KeyValue{serviceName, serviceVersion}
	)
	if ciEnv != "" {
		attribs = append(attribs, attribute.String("environment.ci", ciEnv))
	}
	if cloudEnvironment != "" {
		attribs = append(attribs, attribute.String("environment.cloud", cloudEnvironment))
	}

	return resource.New(ctx, resource.WithAttributes(attribs...))
}

// TracerProvider an OTLP exporter, and configures the corresponding trace provider.
func TracerProvider(ctx context.Context, res *resource.Resource) (func(context.Context) error, error) {
	// If not enabled, use a no-op tracer provider.
	if !tracingEnabled() {
		log.Logger.Warn("ENABLE_TRACING false, using noop tracer provider")
		tp := traceNoop.NewTracerProvider()
		otel.SetTracerProvider(tp)
		return func(ctx context.Context) error {
			return nil
		}, nil
	}

	// Set up a trace exporter
	traceExporter, err := otlptrace.New(ctx, otlptracegrpc.NewClient())
	if err != nil {
		return nil, errors.Wrap(err, "failed to create trace exporter")
	}

	// Register the trace exporter with a TracerProvider, using a batch
	// span processor to aggregate spans before export.
	tracerProvider := sdktrace.NewTracerProvider(
		sdktrace.WithSampler(sdktrace.AlwaysSample()),
		sdktrace.WithResource(res),
		sdktrace.WithBatcher(traceExporter),
	)
	otel.SetTracerProvider(tracerProvider)
	otel.SetTextMapPropagator(
		propagation.NewCompositeTextMapPropagator(
			propagation.TraceContext{},
			propagation.Baggage{},
		))


	// Shutdown will flush any remaining spans and shut down the exporter.
	return tracerProvider.Shutdown, nil
}

// MeterProvider an OTLP exporter, and configures the corresponding meter provider.
func MeterProvider(ctx context.Context, res *resource.Resource) (func(context.Context) error, error) {
	// If not enabled, use a no-op meter provider.
	if !tracingEnabled() {
		log.Logger.Warn("ENABLE_TRACING false, using noop meter provider")
		mp := metricNoop.NewMeterProvider()
		otel.SetMeterProvider(mp)
		return func(ctx context.Context) error {
			return nil
		}, nil
	}

	metricExporter, err := otlpmetricgrpc.New(ctx)
	if err != nil {
		return nil, errors.Wrap(err, "failed to create metric exporter")
	}

	meterProvider := sdkmetric.NewMeterProvider(
		sdkmetric.WithReader(sdkmetric.NewPeriodicReader(metricExporter)),
		sdkmetric.WithResource(res),
	)
	otel.SetMeterProvider(meterProvider)

	return meterProvider.Shutdown, nil
}

func tracingEnabled() bool {
	traceEnabled := os.Getenv("ENABLE_TRACING")
	enabled, err := strconv.ParseBool(traceEnabled)
	if err != nil {
		return false
	}
	return enabled
}

func CloudCustomerSpanEvent(ctx context.Context, evt string) {
	span := trace.SpanFromContext(ctx)
	bag := baggage.FromContext(ctx)
	tc := attribute.Key("customer")
	cust := bag.Member("customer")
	span.AddEvent(evt, trace.WithAttributes(tc.String(cust.Value())))
}

server.go


		server := &http.Server{
			ReadHeaderTimeout: time.Second * 5, 
			ReadTimeout:       c.ReadTimeout,
			WriteTimeout:      c.WriteTimeout,
			IdleTimeout:       c.IdleTimeout,
			Addr:              fmt.Sprintf(":%d", c.Port),
			Handler:           otelhttp.NewHandler(chi.NewMux(), "INGRESS"),
		},

go.mod

	go.opentelemetry.io/contrib/instrumentation/net/http/otelhttp v0.58.0
	go.opentelemetry.io/otel v1.32.0
	go.opentelemetry.io/otel/trace v1.32.0
         go.opentelemetry.io/contrib/propagators/b3 v1.32.0
	go.opentelemetry.io/otel v1.32.0
	go.opentelemetry.io/otel/exporters/otlp/otlpmetric/otlpmetricgrpc v1.32.0
	go.opentelemetry.io/otel/exporters/otlp/otlptrace v1.32.0
	go.opentelemetry.io/otel/exporters/otlp/otlptrace/otlptracegrpc v1.32.0
	go.opentelemetry.io/otel/metric v1.32.0
	go.opentelemetry.io/otel/sdk v1.32.0
	go.opentelemetry.io/otel/sdk/metric v1.32.0

Dec 18 '24 18:12 tjsampson

I've got this hooked up to Grafana Pyroscope and doing some continuous profiling. It typically takes a few days for the leak to really show itself, just because of how slow it is. From what I can tell early on is that it seems to be around these calls:

- go.opentelemetry.io/otel/sdk/metric/exemplar.newStorage
- google.golang.org/grpc/mem.NewTieredBufferPool.newSizedBufferPool
- go.opentelemetry.io/otel/sdk/trace.NewBatchSpanProcessor

I am going to try and get some heap dumps periodically over the next few days. However, given that it's the Holidays, I am not sure if the lower levels of volume/traffic in these test/dev environments will produce the same effect, so might have to wait until after the new year.

Dec 23 '24 16:12 tjsampson

Experiencing a similar issue. From analysing pprof heap files it is possible connected to metric.PeriodicReader and compute aggregation in metric.pipeline.

Not sure if this is the same issue that @tjsampson is having, but the symptoms are similar (slow memory leak that can be observed over days).

Dec 30 '24 10:12 boscar

@boscar Yeah, that sounds and looks similar to our issue. I need to turn trace back on. Right now I have the NooP tracer and metrics provider setup, and the memory leak is gone. I am actually going to turn the trace provider on again and keep the metrics as the noop, because I am pretty certain the issue is with the Metrics Provider, which is what your pprof is alluding to as well.

Dec 31 '24 02:12 tjsampson

I am in the same situation, having the memory allocation growing slowly over days until pods are OOMKilled.

All this is observed with trace, log and metrics generating data to be pushed over gRPC to otelcol:

init code sections

		logExporter, err := otlploggrpc.New(ctx, otlploggrpc.WithGRPCConn(otlpConn))
		if err != nil {
			return ErrWrap(err, "failed to initiate logger exporter")
		}

		processor := sdklog.NewBatchProcessor(logExporter, sdklog.WithExportInterval(c.BatchExportInterval))
		loggerProvider = sdklog.NewLoggerProvider(sdklog.WithProcessor(processor), sdklog.WithResource(otlpResource))
		global.SetLoggerProvider(loggerProvider)

		metricExporter, err := otlpmetricgrpc.New(ctx, otlpmetricgrpc.WithGRPCConn(otlpConn))
		if err != nil {
			return log.ErrWrap(err, "failed to initiate meter exporter")
		}

		meterProvider = sdkmetric.NewMeterProvider(
			sdkmetric.WithReader(sdkmetric.NewPeriodicReader(metricExporter, sdkmetric.WithInterval(c.BatchExportInterval))),
			sdkmetric.WithResource(otlpResource),
		)

		otel.SetMeterProvider(meterProvider)
		meter = meterProvider.Meter(common.GetServiceName(otlpResource), otelmetric.WithInstrumentationAttributes(otlpResource.Attributes()...))

		traceExporter, err := otlptracegrpc.New(ctx, otlptracegrpc.WithGRPCConn(otlpConn))
		if err != nil {
			return log.ErrWrap(err, "failed to initiate trace exporter")
		}

		tracerProvider = sdktrace.NewTracerProvider(
			sdktrace.WithBatcher(traceExporter, sdktrace.WithBatchTimeout(c.BatchExportTimeout)),
			sdktrace.WithResource(otlpResource),
			sdktrace.WithSampler(sdktrace.AlwaysSample()),
			sdktrace.WithSpanProcessor(new(centiBaggagePropagator)), // this way, attributes are passed from bagaged context
		)

		otel.SetTracerProvider(tracerProvider)

		tracer = tracerProvider.Tracer(common.GetServiceName(otlpResource))

May 20 '25 15:05 fredczj

On our side, the problem sounds related to metric exemplars.

gprof trace

Fetching profile over HTTP from http://localhost:8081/debug/pprof/heap
Saved profile in /Users/xxx/pprof/pprof.collect.alloc_objects.alloc_space.inuse_objects.inuse_space.089.pb.gz
File: collect
Type: inuse_space
Time: May 21, 2025 at 5:25pm (CEST)
Entering interactive mode (type "help" for commands, "o" for options)
(pprof) top
Showing nodes accounting for 127.10MB, 96.48% of 131.73MB total
Dropped 45 nodes (cum <= 0.66MB)
Showing top 10 nodes out of 58
      flat  flat%   sum%        cum   cum%
   89.69MB 68.09% 68.09%    89.69MB 68.09%  go.opentelemetry.io/otel/sdk/metric/exemplar.newStorage (inline)
    9.02MB  6.84% 74.93%     9.02MB  6.84%  go.opentelemetry.io/otel/sdk/metric/internal/aggregate.reset[go.shape.struct { FilteredAttributes []go.opentelemetry.io/otel/attribute.KeyValue; Time time.Time; Value go.shape.int64; SpanID []uint8 "json:\",omitempty\""; TraceID []uint8 "json:\",omitempty\"" }]
       7MB  5.31% 80.25%        7MB  5.31%  go.opentelemetry.io/otel/sdk/metric/internal/aggregate.newBuckets[go.shape.int64] (inline)
    6.48MB  4.92% 85.16%     6.48MB  4.92%  go.opentelemetry.io/otel/sdk/metric/internal/aggregate.reset[go.shape.struct { Attributes go.opentelemetry.io/otel/attribute.Set; StartTime time.Time; Time time.Time; Count uint64; Bounds []float64; BucketCounts []uint64; Min go.opentelemetry.io/otel/sdk/metric/metricdata.Extrema[go.shape.int64]; Max go.opentelemetry.io/otel/sdk/metric/metricdata.Extrema[go.shape.int64]; Sum go.shape.int64; Exemplars []go.opentelemetry.io/otel/sdk/metric/metricdata.Exemplar[go.shape.int64] "json:\",omitempty\"" }]
    4.63MB  3.51% 88.67%     4.63MB  3.51%  google.golang.org/grpc/mem.NewTieredBufferPool.newSizedBufferPool.func1
    4.50MB  3.42% 92.09%     4.50MB  3.42%  go.opentelemetry.io/otel/attribute.computeDistinctFixed
    1.78MB  1.35% 93.44%   100.98MB 76.65%  go.opentelemetry.io/otel/sdk/metric/internal/aggregate.(*histValues[go.shape.int64]).measure
    1.50MB  1.14% 94.58%    91.19MB 69.23%  go.opentelemetry.io/otel/sdk/metric/exemplar.NewHistogramReservoir (inline)
    1.50MB  1.14% 95.72%     1.50MB  1.14%  strings.(*Builder).grow
       1MB  0.76% 96.48%        1MB  0.76%  slices.Clone[go.shape.[]uint64,go.shape.uint64]

The package go.opentelemetry.io/otel/sdk/metric/exemplar.newStorage is taking a lot of memory without cleaning.

It appears this is connected to the usage of attributes which values are varying too much:

metricLatency.Record(ctx, gap.Milliseconds(), otelmetric.WithAttributeSet(*tbx.GetAttrs()))

Now we are struggling to disable exemplars… The AlwaysOffFilter sounds inefficient…

		meterProvider = sdkmetric.NewMeterProvider(
			sdkmetric.WithReader(sdkmetric.NewPeriodicReader(metricExporter, sdkmetric.WithInterval(c.BatchExportInterval))),
			sdkmetric.WithResource(otlpResource),
			sdkmetric.WithExemplarFilter(exemplar.AlwaysOffFilter),
		)

May 21 '25 16:05 fredczj

@fredczj any progress? We disabled exemplars, which did not remove the memory leak for us. Any luck on your end?

May 28 '25 06:05 boscar

No. I wonder if we did it correctly, because the sdkmetric.WithExemplarFilter(exemplar.AlwaysOffFilter), didn't change a thing. It looks like it doesn't work atm.

To decrease the memory leak, we have removed an attribute with a high cardinality from our metrics.

May 28 '25 10:05 fredczj

I'm experiencing a similar situation with memory leaks after plugging in the Go OTel libraries.

What I noticed, especially with regards to HTTP instrumentation, is that Go uses strings.Cut in parseRequestLine.

These cut strings point directly into the HTTP buffer used to process each request. So, for as long as these strings stick around in memory (along with any of their copies), the HTTP buffers they reference cannot be garbage collected.

Jun 23 '25 17:06 azazeal