OpenSearch icon indicating copy to clipboard operation
OpenSearch copied to clipboard

[Feature Request] Latency metrics in ClusterApplierService

Open gargharsh3134 opened this issue 1 year ago • 0 comments

Is your feature request related to a problem? Please describe

Given the introduction of Request Tracing Framework (RTF) using OpenTelemetry (OTel), metrics (histogram/counter) can now be published and used to track high latency operations. Committing any change in ClusterState involves running Appliers and Listeners, which are supposed to be very light weight operations. Tracking latency metrics for such operations will help in identifying potential bottlenecks which can slow down the ability of ClusterManager to process the pending tasks queue.

Describe the solution you'd like

OTel Histogram Metrics: Support for Histogram type metrics, which was added as part of #12062, can be utilised to publish the metrics from ClusterApplierService for each applier and listener.

Related component

Cluster Manager

Describe alternatives you've considered

No response

Additional context

No response

gargharsh3134 avatar Feb 15 '24 04:02 gargharsh3134