Rob Johnson issues

Results 14 issues of


                                            Rob Johnson

turn metastatus into an api endpoint

separate `paasta_metastatus` into a lib and api, with the cli entrypoint just there to provide a nicer interface.

actively replace 'unknown' Marathon tasks

Marathon seems to lose the Healthcheck actor for a given task occasionally, leaving it in an 'unknown' state. Let's actively go after these in the bounce and replace them where...

have the autoscaler alert if it needs more capacity than the max count

If the autoscaler would have scaled up the instance count normally, but hasn't been able to because of the `max_instances` limit, it would be good to send a sensu event...

Change the source of truth for required capacity in cluster autoscaler

@mattmb following up on our chat yesterday: - The cluster autoscaler currently looks at usage per slave - it doesn't make any decisions based on 'what' the tasks are. -...

Alert authors of downstream jobs when a scheduled job has failed

I'm proposing that when a Job fails (either through the run failing, or the scheduler not running it as expected), we should notify all the owners of downstream jobs, too....

Add a monitoring check to paasta metastatus for number of Marathon masters

we only check on the number of mesos masters - we should add a check that asserts that the number of Marathon/Chronos masters too

the hostname argument to 'paasta maintenance status' does nothing

``` robj@paasta56-r5-sfo2:~ % sudo paasta_maintenance status paasta56-r5-sfo2.prod.yelpcorp.com 10-64-133-242-uswest1bprod.prod.yelpcorp.com (10.64.133.242): Draining paasta56-r5-sfo2.prod.yelpcorp.com (10.44.5.33): Draining ```

autoscaler should silence kazoo logs

they make the log super noisy - lets make kazoo quieter.

metastatus should page with 0 mesos slaves

metastatus should group multiple attributes together

we often need to answer 'how much capacity do we have in pool X, region Y'. That's kind of difficult right now (you can do ``paasta metastatus -g region pool``...