1-50 of 10000 results (21ms)
2026-02-25 ยง
10:57 <btullis@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on dse-k8s-worker1025.eqiad.wmnet with reason: host reimage [production]
10:55 <jelto@cumin1003> END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1003.wikimedia.org with reason: Upgrade GitLab replica [production]
10:53 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-worker1027.eqiad.wmnet with OS bookworm [production]
10:46 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-worker1026.eqiad.wmnet with OS bookworm [production]
10:46 <jelto@cumin1003> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1003.wikimedia.org with reason: Upgrade GitLab replica [production]
10:43 <btullis@cumin2002> START - Cookbook sre.hosts.reimage for host dse-k8s-worker1025.eqiad.wmnet with OS bookworm [production]
10:41 <btullis@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dse-k8s-worker1025.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
10:39 <btullis@cumin2002> START - Cookbook sre.hosts.provision for host dse-k8s-worker1025.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
10:39 <btullis@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dse-k8s-worker1025.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
10:37 <daniel@deploy2002> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
10:36 <daniel@deploy2002> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
10:36 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dse-k8s-worker1024.eqiad.wmnet with OS bookworm [production]
10:36 <btullis@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1003" [production]
10:36 <btullis@cumin2002> START - Cookbook sre.hosts.provision for host dse-k8s-worker1025.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
10:35 <btullis@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1003" [production]
10:28 <fceratto@deploy2002> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
10:23 <fceratto@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dborch1003.eqiad.wmnet with OS trixie [production]
10:18 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dse-k8s-worker1024.eqiad.wmnet with reason: host reimage [production]
10:12 <btullis@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on dse-k8s-worker1024.eqiad.wmnet with reason: host reimage [production]
10:09 <fceratto@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dborch1003.eqiad.wmnet with reason: host reimage [production]
10:02 <fceratto@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on dborch1003.eqiad.wmnet with reason: host reimage [production]
09:57 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-worker1024.eqiad.wmnet with OS bookworm [production]
09:54 <fceratto@cumin1003> START - Cookbook sre.hosts.reimage for host dborch1003.eqiad.wmnet with OS trixie [production]
09:54 <btullis@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
09:54 <btullis@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Updating records after renaming and moving vlan of some an-worker hosts - btullis@cumin1003" [production]
09:53 <btullis@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Updating records after renaming and moving vlan of some an-worker hosts - btullis@cumin1003" [production]
09:52 <elukey> uploaded python3-wmflib_3.0.0 to apt.wikimedia.org bullseye-wikimedia,bookworm-wikimedia,trixie-wikimedia [production]
09:48 <btullis@cumin1003> START - Cookbook sre.dns.netbox [production]
09:22 <XioNoX> push pfw policies - T418305 [production]
08:46 <ammarpad@deploy2002> mwscript-k8s job started: extensions/CentralAuth/maintenance/fixStuckGlobalRename.php --wiki=mediawikiwiki --logwiki=metawiki Egortropeano Fortuna1992 # T418331 [production]
08:45 <ammarpad@deploy2002> mwscript-k8s job started: extensions/CentralAuth/maintenance/fixStuckGlobalRename.php --wiki=gawiki --logwiki=metawiki DroopyDoggy AlterDiegos # T418330 [production]
08:20 <slyngshede@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp2045.codfw.wmnet with reason: host reimage [production]
08:14 <slyngshede@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on cp2045.codfw.wmnet with reason: host reimage [production]
07:59 <slyngshede@cumin1003> START - Cookbook sre.hosts.reimage for host cp2045.codfw.wmnet with OS trixie [production]
06:16 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dbproxy1023.eqiad.wmnet with OS trixie [production]
05:59 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dbproxy1023.eqiad.wmnet with reason: host reimage [production]
05:54 <marostegui@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on dbproxy1023.eqiad.wmnet with reason: host reimage [production]
05:38 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host dbproxy1023.eqiad.wmnet with OS trixie [production]
03:37 <eileen> config revision changed from 390d6434 to a0228e6c turn off trustly audit [fundraising]
02:25 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db1261 (T415786)', diff saved to https://phabricator.wikimedia.org/P89022 and previous config saved to /var/cache/conftool/dbconfig/20260225-022502-marostegui.json [production]
02:24 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1261.eqiad.wmnet with reason: Maintenance [production]
02:24 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1260 (T415786)', diff saved to https://phabricator.wikimedia.org/P89021 and previous config saved to /var/cache/conftool/dbconfig/20260225-022446-marostegui.json [production]
02:23 <ryankemper> [WDQS] Restart codfw wdqs-main [production]
02:13 <mwpresync@deploy2002> Finished scap build-images: Publishing wmf/next image (duration: 12m 49s) [production]
02:09 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1260', diff saved to https://phabricator.wikimedia.org/P89020 and previous config saved to /var/cache/conftool/dbconfig/20260225-020938-marostegui.json [production]
02:00 <mwpresync@deploy2002> Started scap build-images: Publishing wmf/next image [production]
01:54 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1260', diff saved to https://phabricator.wikimedia.org/P89019 and previous config saved to /var/cache/conftool/dbconfig/20260225-015430-marostegui.json [production]
01:39 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1260 (T415786)', diff saved to https://phabricator.wikimedia.org/P89018 and previous config saved to /var/cache/conftool/dbconfig/20260225-013921-marostegui.json [production]
00:25 <zabe@deploy2002> Finished scap sync-world: Backport for [[gerrit:1243274|Start reading from new file tables on all small wikis (T416548)]] (duration: 06m 40s) [production]
00:22 <zabe@deploy2002> zabe: Continuing with sync [production]