251-300 of 10000 results (91ms)
2025-03-11 §
08:32 <kartik@deploy2002> Finished scap sync-world: Backport for [[gerrit:1126220|EventLogging: Improve handling when suggestions are not present (T388467)]] (duration: 26m 56s) [production]
08:23 <kartik@deploy2002> abi, kartik: Continuing with sync [production]
08:12 <kartik@deploy2002> abi, kartik: Backport for [[gerrit:1126220|EventLogging: Improve handling when suggestions are not present (T388467)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
08:08 <moritzm> installing systemd bugfix updates from Bookworm point release [production]
08:05 <kartik@deploy2002> Started scap sync-world: Backport for [[gerrit:1126220|EventLogging: Improve handling when suggestions are not present (T388467)]] [production]
08:02 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on 8 hosts with reason: Cloning [production]
08:00 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1217.eqiad.wmnet with reason: Maintenance [production]
07:59 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1228.eqiad.wmnet with reason: Maintenance [production]
07:23 <marostegui@cumin1002> END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db1228.eqiad.wmnet [production]
07:19 <marostegui@cumin1002> START - Cookbook sre.mysql.upgrade for db1228.eqiad.wmnet [production]
07:19 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1228.eqiad.wmnet with reason: Maintenance [production]
07:13 <marostegui> Failover m2 from db1228 to db1164 - T388396 [production]
07:00 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db[2160,2233].codfw.wmnet,db[1164,1217,1228].eqiad.wmnet with reason: Primary switchover m2 T388396 [production]
06:45 <marostegui> Drop rt database from m1 T388437 [production]
06:44 <marostegui> Remove rt grants from m1 T388437 [production]
04:03 <mwpresync@deploy2002> Pruned MediaWiki: 1.44.0-wmf.17 (duration: 03m 02s) [production]
03:54 <eileen> civicrm upgraded from f2222fcd to ec20a105 [production]
03:52 <mwpresync@deploy2002> Finished scap sync-world: testwikis to 1.44.0-wmf.20 refs T386215 (duration: 49m 13s) [production]
03:03 <mwpresync@deploy2002> Started scap sync-world: testwikis to 1.44.0-wmf.20 refs T386215 [production]
00:22 <aaron@deploy2002> helmfile [staging] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
00:21 <aaron@deploy2002> helmfile [staging] START helmfile.d/services/changeprop-jobqueue: apply [production]
00:18 <aaron@deploy2002> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
00:13 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ms-be2089.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
00:08 <pt1979@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host restbase1045.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
00:07 <pt1979@cumin1002> START - Cookbook sre.hosts.provision for host restbase1045.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
00:07 <aaron@deploy2002> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
00:02 <pt1979@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host restbase1045.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
00:01 <pt1979@cumin1002> START - Cookbook sre.hosts.provision for host restbase1045.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
00:00 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host ms-be2089.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
2025-03-10 §
23:47 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ms-be2089.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
23:40 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host ms-be2089.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
23:38 <jhancock@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ms-be2089 [production]
23:38 <jhancock@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host ms-be2089 [production]
23:38 <jhancock@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
23:38 <jhancock@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding ms-be2089 to codfw - jhancock@cumin2002" [production]
23:38 <jhancock@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: adding ms-be2089 to codfw - jhancock@cumin2002" [production]
23:34 <jhancock@cumin2002> START - Cookbook sre.dns.netbox [production]
23:31 <jhancock@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ms-be2089 [production]
23:31 <jhancock@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host ms-be2089 [production]
21:48 <tgr_> UTC late deploys done [production]
21:48 <tgr@deploy2002> Finished scap sync-world: Backport for [[gerrit:1126131|Enable SUL3 signup for all of group 1 and 1% of group 2 users (T384007 T384218)]] (duration: 15m 21s) [production]
21:42 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host restbase1043.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:41 <tgr@deploy2002> tgr: Continuing with sync [production]
21:41 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host restbase1043.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:35 <tgr@deploy2002> tgr: Backport for [[gerrit:1126131|Enable SUL3 signup for all of group 1 and 1% of group 2 users (T384007 T384218)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:32 <tgr@deploy2002> Started scap sync-world: Backport for [[gerrit:1126131|Enable SUL3 signup for all of group 1 and 1% of group 2 users (T384007 T384218)]] [production]
21:28 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host db1257.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:23 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host restbase1043.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:22 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host db1257.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
21:21 <jclark@cumin1002> START - Cookbook sre.hosts.provision for host restbase1043.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]