etl icon indicating copy to clipboard operation
etl copied to clipboard

GCE discovery and/or prometheus scraping not working for private network

Open gfr10598 opened this issue 5 years ago • 1 comments

The universal parser uses a private network, so it can make requests to gardener. This breaks the prometheus scraping. Probably interferes with GCE discover, and may also make the instances inaccessible.

We can get the metrics through the app-engine load balancer, but that is not instance specific, so likely not relevant.

gfr10598 avatar Dec 03 '19 20:12 gfr10598

Discussed in prometheus-sync -- the main problem here is that the default network where gcp-service-discovery runs today cannot reach the private vpc network. While there is currently a minimal prometheus config running in the per-datatype data-processing-cluster there is not yet a prometheus instance in the new data-processing cluster. The insight is that that instance should run the gcp-service-discovery daemon as well (possibly with a modification to gcp-sd to filter services) so that it can reach the AEFlex instances over the private network. This is "working with" the architecture rather than trying to get prometheus-federaiton cluster to connect to the private vpc network.

stephen-soltesz avatar Dec 04 '19 17:12 stephen-soltesz