2201-2250 of 10000 results (34ms)
2025-10-09 ยง
20:42 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dse-k8s-worker2003.codfw.wmnet with OS bookworm [production]
20:42 <reedy@deploy2002> reedy, sbassett: Backport for [[gerrit:1193928|Enable New UI and Multiple Module support for OATHAuth in Wikimedia production (T399644)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
20:37 <reedy@deploy2002> Started scap sync-world: Backport for [[gerrit:1193928|Enable New UI and Multiple Module support for OATHAuth in Wikimedia production (T399644)]] [production]
20:32 <mutante> logmsgbot do you still log - test log T284123 [production]
20:29 <mutante> re-enabled QoS on gerrit servers - with previously stable config - T406774 gerrit:1194811 [production]
20:28 <reedy@deploy2002> Finished scap sync-world: Backport for [[gerrit:1194962|OATHAuth Recovery Code code improvement (T406501)]] (duration: 10m 19s) [production]
20:25 <mutante> re-enabling QoS on gerrit servers - with previously stable config - T406774 [production]
20:24 <reedy@deploy2002> sbassett, reedy: Continuing with sync [production]
20:24 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dse-k8s-worker2003.codfw.wmnet with reason: host reimage [production]
20:22 <reedy@deploy2002> sbassett, reedy: Backport for [[gerrit:1194962|OATHAuth Recovery Code code improvement (T406501)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
20:19 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on dse-k8s-worker2003.codfw.wmnet with reason: host reimage [production]
20:18 <reedy@deploy2002> Started scap sync-world: Backport for [[gerrit:1194962|OATHAuth Recovery Code code improvement (T406501)]] [production]
20:17 <reedy@deploy2002> Finished scap sync-world: Backport for [[gerrit:1194978|Update interwiki cache]], [[gerrit:1194981|Revert "Delete the event-organizer user group on medium and small wikis" (T401445)]], [[gerrit:1194986|Assign campaignevents-generate-invitation-lists right explicitly (T401445)]] (duration: 10m 46s) [production]
20:13 <reedy@deploy2002> daimona, reedy: Continuing with sync [production]
20:11 <reedy@deploy2002> daimona, reedy: Backport for [[gerrit:1194978|Update interwiki cache]], [[gerrit:1194981|Revert "Delete the event-organizer user group on medium and small wikis" (T401445)]], [[gerrit:1194986|Assign campaignevents-generate-invitation-lists right explicitly (T401445)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
20:06 <reedy@deploy2002> Started scap sync-world: Backport for [[gerrit:1194978|Update interwiki cache]], [[gerrit:1194981|Revert "Delete the event-organizer user group on medium and small wikis" (T401445)]], [[gerrit:1194986|Assign campaignevents-generate-invitation-lists right explicitly (T401445)]] [production]
20:04 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host dse-k8s-worker2003.codfw.wmnet with OS bookworm [production]
20:00 <bking@cumin2002> START - Cookbook sre.wdqs.categories-reload reloading categories to wdqs1020.eqiad.wmnet [production]
19:59 <bking@cumin2002> START - Cookbook sre.wdqs.categories-reload reloading categories to wdqs1019.eqiad.wmnet [production]
19:59 <bking@cumin2002> START - Cookbook sre.wdqs.categories-reload reloading categories to wdqs1018.eqiad.wmnet [production]
19:29 <eileen> civicrm upgraded from 14cc3125 to 748922f0 [production]
19:22 <ejegg> donorwiki upgraded from e8ef5539 to 73c34ea4 [production]
19:13 <ejegg> civicrm upgraded from 132211d5 to 14cc3125 [production]
19:04 <jforrester@deploy2002> Finished scap sync-world: Backport for [[gerrit:1195021|i18n: Pull forward wikimedia-boardelection2025-notification-body updates]] (duration: 11m 39s) [production]
18:59 <jforrester@deploy2002> jforrester: Continuing with sync [production]
18:58 <jforrester@deploy2002> jforrester: Backport for [[gerrit:1195021|i18n: Pull forward wikimedia-boardelection2025-notification-body updates]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
18:53 <jforrester@deploy2002> Started scap sync-world: Backport for [[gerrit:1195021|i18n: Pull forward wikimedia-boardelection2025-notification-body updates]] [production]
18:36 <cmooney@cumin1003> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for lvs1020.eqiad.wmnet [production]
18:36 <cmooney@cumin1003> START - Cookbook sre.hosts.remove-downtime for lvs1020.eqiad.wmnet [production]
18:02 <rzl@deploy1003> helmfile [staging] DONE helmfile.d/services/apertium: apply [production]
18:02 <rzl@deploy1003> helmfile [staging] START helmfile.d/services/apertium: apply [production]
17:31 <topranks> begin work to move lvs1020 uplink cable from ssw1-f1-eqiad to ssw1-e1-eqiad [production]
17:30 <cmooney@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on lvs1020.eqiad.wmnet with reason: downtime lvs1020 to supress alerts about enp94s0f0np0 going down and losing backend connectivity [production]
17:08 <bd808@deploy2002> helmfile [codfw] DONE helmfile.d/services/developer-portal: apply [production]
17:06 <bd808@deploy2002> helmfile [codfw] START helmfile.d/services/developer-portal: apply [production]
17:06 <bd808@deploy2002> helmfile [eqiad] DONE helmfile.d/services/developer-portal: apply [production]
17:05 <bd808@deploy2002> helmfile [eqiad] START helmfile.d/services/developer-portal: apply [production]
17:04 <bd808@deploy2002> helmfile [staging] DONE helmfile.d/services/developer-portal: apply [production]
17:02 <bd808@deploy2002> helmfile [staging] START helmfile.d/services/developer-portal: apply [production]
16:57 <cmooney@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:57 <cmooney@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add dns entries for inter.link transit IPs in drmrs - cmooney@cumin1003" [production]
16:47 <cmooney@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add dns entries for inter.link transit IPs in drmrs - cmooney@cumin1003" [production]
16:38 <cmooney@cumin1003> START - Cookbook sre.dns.netbox [production]
16:33 <cwhite> upgrade grafana-loki on grafana hosts T406478 [production]
16:30 <tgr@deploy2002> Finished scap sync-world: Backport for [[gerrit:1194963|session: Improve logging for MultiBackendSessionStore (T402808 T405633 T405634)]], [[gerrit:1194964|session: Improve logging for MultiBackendSessionStore (T402808 T405633 T405634)]] (duration: 20m 07s) [production]
16:26 <tgr@deploy2002> tgr, d3r1ck01: Continuing with sync [production]
16:18 <elukey@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ms-be2078.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
16:18 <sukhe> sukhe@lvs2013:~$ sudo systemctl restart pybal.service [production]
16:14 <tgr@deploy2002> tgr, d3r1ck01: Backport for [[gerrit:1194963|session: Improve logging for MultiBackendSessionStore (T402808 T405633 T405634)]], [[gerrit:1194964|session: Improve logging for MultiBackendSessionStore (T402808 T405633 T405634)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
16:10 <tgr@deploy2002> Started scap sync-world: Backport for [[gerrit:1194963|session: Improve logging for MultiBackendSessionStore (T402808 T405633 T405634)]], [[gerrit:1194964|session: Improve logging for MultiBackendSessionStore (T402808 T405633 T405634)]] [production]