For technical issues with Wikimedia's blog aggregation ("planet") installation. NOTE: Requests for adding a blog should be either filed on Meta wiki or done directly by submitting a patch.
Details
Jul 26 2024
May 21 2024
May 12 2024
May 11 2024
Mar 25 2024
@Jelto After also adding a job to scrape the metrics I am now seeing _some_ data trickle in here:
Change #1014116 merged by Dzahn:
[operations/puppet@production] prometheus/ops: add config for scraping apache metrics on planet servers
Change #1014116 had a related patch set uploaded (by Dzahn; author: Dzahn):
[operations/puppet@production] prometheus/ops: add config for scraping apache metrics on planet servers
root@mw1413:/# curl -s 10.64.32.132:9117/metrics | tail -n 1 process_virtual_memory_max_bytes -1
After https://gerrit.wikimedia.org/r/1009775 has been deployed and entirely dropped the argument parameter the package is now installed again on planet* hosts and the service doesn't fail to start any longer.
Change #1009775 merged by Dzahn:
[operations/puppet@production] prometheus/apache_exporter: drop argument parameter
Mar 8 2024
@Jelto I confirmed the change is that on buster and bullseye the commandline args of the exporter start with "-" but in bookworm they now start with "--".
Change 1009775 had a related patch set uploaded (by Dzahn; author: Dzahn):
[operations/puppet@production] apache_exporter: fix argument syntax in bookworm
Currently the exporter fails with
Mar 7 2024
20:14 < denisse> Hello mutante, if the metrics are already available via HTTP, the next step would be to configure Prometheus to scrape these metrics by adding a new job.
At least the package is installed now on planet* servers.
Change 1009575 merged by Dzahn:
[operations/puppet@production] planet: add prometheus apache exporter to role
Change 1009575 had a related patch set uploaded (by Dzahn; author: Dzahn):
[operations/puppet@production] planet: add prometheus apache exporter to role
Dec 15 2023
Change 983207 merged by Dzahn:
[operations/puppet@production] planet: remove support for buster
Dec 14 2023
This is done! Planet is now hosted on new bookworm VMs and the buster VMs have been deleted.
Change 982157 merged by Dzahn:
[operations/puppet@production] site: remove buster VMs from planet regex
cookbooks.sre.hosts.decommission executed by dzahn@cumin2002 for hosts: planet2002.codfw.wmnet
- planet2002.codfw.wmnet (PASS)
- Downtimed host on Icinga/Alertmanager
- Found Ganeti VM
- VM shutdown
- Started forced sync of VMs in Ganeti cluster codfw to Netbox
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB
- VM removed
- Started forced sync of VMs in Ganeti cluster codfw to Netbox
Change 983207 had a related patch set uploaded (by Dzahn; author: Dzahn):
[operations/puppet@production] planet: remove support for buster
cookbooks.sre.hosts.decommission executed by dzahn@cumin2002 for hosts: planet1002.eqiad.wmnet
- planet1002.eqiad.wmnet (PASS)
- Downtimed host on Icinga/Alertmanager
- Found Ganeti VM
- VM shutdown
- Started forced sync of VMs in Ganeti cluster eqiad to Netbox
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB
- VM removed
- Started forced sync of VMs in Ganeti cluster eqiad to Netbox
Dec 13 2023
cookbooks.sre.hosts.decommission executed by dzahn@cumin1001 for hosts: planet1002.eqiad.wmnet
- planet1002.eqiad.wmnet (FAIL)
- Downtimed host on Icinga/Alertmanager
- Host steps raised exception: Error while performing request to RAPI
That's indeed a bit weird, but she also writes that these articles have been published years ago in a banner kind of thing on top.
I can't help but notice that en.planet.wikimedia.org looks a little broken right now (It has posts from sumana from long ago), is that possibly related to this change (Just think timing is coincidental)
Dec 12 2023
We are now serving all lang versions from a bookworm VM in eqiad.
Mentioned in SAL (#wikimedia-operations) [2023-12-12T22:57:09Z] <mutante> planet - switched to eqiad and bookworm backend (T348392 T345617) - https://meta.wikimedia.org/wiki/Planet_Wikimedia
Change 982489 merged by Dzahn:
[operations/dns@master] planet: switch to new eqiad backend
Change 982489 had a related patch set uploaded (by Dzahn; author: Dzahn):
[operations/dns@master] planet: switch to new eqiad backend
Mentioned in SAL (#wikimedia-operations) [2023-12-12T22:43:52Z] <mutante> planet2003 -manually upgrade rawdog package to 3.0.2 T348392
Change 982484 merged by Dzahn:
[operations/puppet@production] planet: enable feed updates on both new VMs
Change 982156 merged by Dzahn:
[operations/dns@master] Switch planet to bookworm VM backends
after some more debugging, fixing privileges on some state files and removing a broken feed from the cs lang version, now our httpbb tests work on both old and new machines:
Change 982484 had a related patch set uploaded (by Dzahn; author: Dzahn):
[operations/puppet@production] planet: enable feed updates on both new VMs
Dec 11 2023
Change 982157 had a related patch set uploaded (by Dzahn; author: Dzahn):
[operations/puppet@production] site: remove buster VMs from planet regex