altinity-dashboard
altinity-dashboard copied to clipboard
Health Checks
Use the operator's credentials to connect to the ClickHouse instance and run health checks:
- Access point is available (use chi level service)
- Distributed query check: SELECT count() FROM cluster('all-sharded', cluster('all-sharded', system.one))
- Zookeeper check -- run only if zookeeper is a part of CHI spec.configuration:
SELECT count() FROM
system.zookeeperWHEREpath= '/' - No readonly replicas:
SELECT max(
value) FROM cluster('{cluster}',system.metrics) WHEREmetric= 'ReadonlyReplica' - No delayed inserts:
SELECT
valueFROMsystem.metricsWHEREmetric= 'DelayedInserts' - Healthy schema: MaxPartCountForPartition: >150 yellow, >300 red: select value from system.asynchronous_metrics where metric='MaxPartCountForPartition'
Provide some kind of warning if the health checks are failing.