51-100 of 10000 results (17ms)
2025-11-06 ยง
19:37 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db1251 (T407997)', diff saved to https://phabricator.wikimedia.org/P85046 and previous config saved to /var/cache/conftool/dbconfig/20251106-193705-marostegui.json [production]
19:36 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1251.eqiad.wmnet with reason: Maintenance [production]
19:34 <jhancock@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2093.codfw.wmnet with reason: host reimage [production]
19:34 <jhancock@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2092.codfw.wmnet with reason: host reimage [production]
19:33 <jhancock@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2091.codfw.wmnet with reason: host reimage [production]
19:31 <swfrench-wmf> rolling run-puppet-agent on A:cp hosts for haproxy config change [production]
19:29 <jhancock@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2091.codfw.wmnet with reason: host reimage [production]
19:27 <jhancock@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2090.codfw.wmnet with reason: host reimage [production]
19:21 <jhancock@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2090.codfw.wmnet with reason: host reimage [production]
19:21 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1240.eqiad.wmnet with reason: Maintenance [production]
19:19 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcontrol1008-dev.eqiad.wmnet with OS trixie [production]
19:18 <swfrench-wmf> disable-puppet on A:cp hosts for haproxy config change [production]
19:15 <jhuneidi@deploy2002> rebuilt and synchronized wikiversions files: group2 to 1.46.0-wmf.1 refs T408271 [production]
19:06 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on wdqs1013.eqiad.wmnet with reason: C/D Migration [production]
19:05 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on wcqs1003.eqiad.wmnet with reason: C/D Migration [production]
19:05 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1239.eqiad.wmnet with reason: Maintenance [production]
19:05 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1235 (T407997)', diff saved to https://phabricator.wikimedia.org/P85045 and previous config saved to /var/cache/conftool/dbconfig/20251106-190506-marostegui.json [production]
19:02 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on puppetserver1001.eqiad.wmnet with reason: C/D Migration [production]
18:57 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on an-test-worker1002.eqiad.wmnet with reason: C/D Migration [production]
18:55 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on sessionstore1005.eqiad.wmnet with reason: C/D Migration [production]
18:53 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on es1045.eqiad.wmnet with reason: C/D Migration [production]
18:52 <jhancock@cumin1003> START - Cookbook sre.hosts.reimage for host ms-be2093.codfw.wmnet with OS bullseye [production]
18:52 <jhancock@cumin1003> START - Cookbook sre.hosts.reimage for host ms-be2092.codfw.wmnet with OS bullseye [production]
18:52 <jhancock@cumin1003> START - Cookbook sre.hosts.reimage for host ms-be2091.codfw.wmnet with OS bullseye [production]
18:51 <jhancock@cumin1003> START - Cookbook sre.hosts.reimage for host ms-be2090.codfw.wmnet with OS bullseye [production]
18:51 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on db1262.eqiad.wmnet with reason: C/D Migration [production]
18:51 <jhancock@cumin1003> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['ms-be2093'] [production]
18:51 <jhancock@cumin1003> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['ms-be2092'] [production]
18:51 <jhancock@cumin1003> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['ms-be2091'] [production]
18:50 <jhancock@cumin1003> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['ms-be2093'] [production]
18:50 <jhancock@cumin1003> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['ms-be2090'] [production]
18:50 <jhancock@cumin1003> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['ms-be2092'] [production]
18:50 <jhancock@cumin1003> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['ms-be2091'] [production]
18:50 <jhancock@cumin1003> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['ms-be2090'] [production]
18:50 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on db1218.eqiad.wmnet with reason: C/D Migration [production]
18:49 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1235', diff saved to https://phabricator.wikimedia.org/P85044 and previous config saved to /var/cache/conftool/dbconfig/20251106-184958-marostegui.json [production]
18:49 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on db1217.eqiad.wmnet with reason: C/D Migration [production]
18:46 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on db1169.eqiad.wmnet with reason: C/D Migration [production]
18:44 <robh> C5 eqiad c/d server switch migrations in progress [production]
18:44 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on db1168.eqiad.wmnet with reason: C/D Migration [production]
18:43 <jhancock@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ms-be2093.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:43 <jhancock@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ms-be2090.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:43 <jhancock@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ms-be2092.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:42 <jhancock@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ms-be2091.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:41 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on aqs1018.eqiad.wmnet with reason: C/D Migration [production]
18:38 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on krb1002.eqiad.wmnet with reason: C/D Migration [production]
18:34 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1235', diff saved to https://phabricator.wikimedia.org/P85043 and previous config saved to /var/cache/conftool/dbconfig/20251106-183452-marostegui.json [production]
18:34 <swfrench@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-wikifunctions: apply [production]
18:33 <swfrench@deploy2002> helmfile [codfw] START helmfile.d/services/mw-wikifunctions: apply [production]
18:28 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on mc1048.eqiad.wmnet with reason: C/D Migration [production]