airflow-exporter icon indicating copy to clipboard operation
airflow-exporter copied to clipboard

Service level metrics

Open elephantum opened this issue 7 years ago • 2 comments

Implement technical metrics of webui, scheduler and executors like uptime, an average time to reread dagbag etc.

elephantum avatar Aug 10 '18 09:08 elephantum

It would probably make sense to outline what the minimum of these metrics should be.

Also what mechanisms are suggested to inject an http interface in process for scheduler and executor. In terms of providing an http component for metrics, this is probably in line with providing an http interface for a /health endpoint. (https://issues.apache.org/jira/browse/AIRFLOW-1084?jql=project%20%3D%20AIRFLOW%20AND%20text%20~%20health)

Also to note, there is one other prometheus plugin for airflow (https://github.com/airflow-plugins/pandora-plugin/blob/master/blueprints/metrics_blueprint.py) But to my knowledge, it also lacks this functionality

smith-m avatar Aug 10 '18 15:08 smith-m

Some of the mentioned above metrics are available via statsd statistics of airflow itself. I'm using statsd-prometheus exporter to monitor them, the downside is that they are not prometheus-way of naming(e.g. airflow_dag_<dag name>_duration)

f1yegor avatar Sep 17 '19 08:09 f1yegor