operations icon indicating copy to clipboard operation
operations copied to clipboard

Deprecate munin

Open pnorman opened this issue 4 years ago • 10 comments

The OWG intends to deprecate our use of munin, being satisifed that prometheus offers all the functionality we need

To do this we need to get 6 months to a year of data in prometheus so we always have enough historical data to predict future demand.

pnorman avatar Jan 15 '21 19:01 pnorman

I updated links on OSM Wiki ( https://wiki.openstreetmap.org/w/index.php?title=Automated_Edits_code_of_conduct&diff=prev&oldid=2096315 https://wiki.openstreetmap.org/w/index.php?title=Servers/Tile_CDN&diff=prev&oldid=2095843 etc ).

https://hardware.openstreetmap.org/ also needs to be updated (or is it also getting deprecated?)

matkoniecz avatar Jan 25 '21 09:01 matkoniecz

Maybe you could leave it to us to decide when the migration is sufficiently advanced that things need changing?

tomhughes avatar Jan 25 '21 09:01 tomhughes

Things on OSM Wiki typically are deeply outdated, but feel free to revert my changes on pages that are actually maintained by someone with more specific sysadmin knowledge.

matkoniecz avatar Jan 25 '21 09:01 matkoniecz

I don't really know why the automated edits page even has that - it's hard to believe anybody actually does that and it's not really reasonable to expect people to interpret the graphs to identify a quiet time. I suspect that language goes back years and somebody tried to be "helpful" by adding the link to munin...

tomhughes avatar Jan 25 '21 10:01 tomhughes

It also seemed to be suspicious to me, but I decided that I am not knowledgeable enough to meddle with that.

Now, given independent confirmation I posted to [email protected] to get feedback from DWG (they are subscribed to this list, so it gives them chance to protest if it still make sense).

matkoniecz avatar Jan 25 '21 10:01 matkoniecz

I posted to [email protected] to get feedback from DWG

I suspect that Tom's comment above describes exactly what happened.

SomeoneElseOSM avatar Jan 25 '21 10:01 SomeoneElseOSM

Removed in https://wiki.openstreetmap.org/w/index.php?title=Automated_Edits_code_of_conduct&diff=prev&oldid=2098675

matkoniecz avatar Jan 25 '21 11:01 matkoniecz

To repeat a question from #360, since Munin is being deprecated, will we have a replacement public dashboard? It was sometimes useful monitoring load on tile servers and API.

Zverik avatar Jan 25 '21 14:01 Zverik

It's been public for some time

tomhughes avatar Jan 25 '21 14:01 tomhughes

will we have a replacement public dashboard?

@Zverik, use https://prometheus.openstreetmap.org/

iandees avatar Jan 25 '21 15:01 iandees

The main use for munin right now is getting historical data for capacity planning.

See also https://github.com/openstreetmap/operations/issues/484

pnorman avatar Nov 18 '22 22:11 pnorman

Test removing munin from chef reduced total kitchen test runtime from ~7 hours to ~6 hours. https://github.com/openstreetmap/chef/actions/runs/3623104415/usage

Firefishy avatar Dec 05 '22 19:12 Firefishy

We'll wait until we have at least one year of data in promotheus before retiring munin, so that we can still do some capacity planning without pulling hair. This should happen around September 2023.

grischard avatar Dec 15 '22 21:12 grischard

Data goes back as far as March 2023. New one year date is March 2024.

Firefishy avatar Oct 02 '23 01:10 Firefishy

Munin has now been removed. I will shortly add a basic munin → prometheus web redirect.

Firefishy avatar Mar 13 '24 12:03 Firefishy