solr-operator icon indicating copy to clipboard operation
solr-operator copied to clipboard

solr prometheus exporter crashloopbackoff

Open vipul-06 opened this issue 2 years ago • 3 comments

I have deployed solr prometheus exporter for monitoring purpose. It is running on gke but my exporter pod is gettting error of crashloopbackoff. The logs of my exporter pod are like this INFO - 2022-11-10 05:45:56.943; org.apache.solr.common.cloud.ConnectionManager; Waiting for client to connect to ZooKeeper INFO - 2022-11-10 05:45:56.991; org.apache.solr.common.cloud.ConnectionManager; zkClient has connected INFO - 2022-11-10 05:45:56.991; org.apache.solr.common.cloud.ConnectionManager; Client is connected to ZooKeeper INFO - 2022-11-10 05:45:57.003; org.apache.solr.common.cloud.ZkStateReader; Updated live nodes from ZooKeeper... (0) -> (5)

vipul-06 avatar Nov 10 '22 06:11 vipul-06

Can you share the yamls for your solrcloud and solrprometheusexporters? Hard to debug without that.

HoustonPutman avatar Nov 10 '22 18:11 HoustonPutman

is that the same basic auth secret that the SolrCloud is setup to use? have you changed the solrcloud security after creating the cloud? The default roles provided by the Solr Operator should allow for calling the ping handler.

Can you check your solr cloud security json and see if the user in that basic auth secret is allowed to use the ping handler?

HoustonPutman avatar Nov 30 '22 19:11 HoustonPutman

I ran into a similar issue where the prometheus exporter kept crashing: Changing the CPU limits on the prometheus exporter did the trick for me: Otherwise every collection round it crashes because it's being throttled by the CPU.

fliphess avatar May 09 '23 17:05 fliphess