401-450 of 10000 results (29ms)
2025-11-06 ยง
21:07 <dzahn@dns1004> END - running authdns-update [production]
21:04 <ladsgroup@deploy2002> Started scap sync-world: Backport for [[gerrit:1202801|Revert "BacklinkCache: Switch order between pr_cascade and links queries"]], [[gerrit:1202800|Revert "RestrictionStore: Switch order between pr_cascade and links queries"]] [production]
21:03 <dzahn@dns1004> START - running authdns-update [production]
20:57 <eileen> civicrm upgraded from 75455a21 to 0f69c4eb [production]
20:41 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on dbstore1008.eqiad.wmnet with reason: Maintenance [production]
20:41 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1251 (T407997)', diff saved to https://phabricator.wikimedia.org/P85050 and previous config saved to /var/cache/conftool/dbconfig/20251106-204120-marostegui.json [production]
20:26 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1251', diff saved to https://phabricator.wikimedia.org/P85049 and previous config saved to /var/cache/conftool/dbconfig/20251106-202612-marostegui.json [production]
20:11 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1251', diff saved to https://phabricator.wikimedia.org/P85048 and previous config saved to /var/cache/conftool/dbconfig/20251106-201105-marostegui.json [production]
19:55 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1251 (T407997)', diff saved to https://phabricator.wikimedia.org/P85047 and previous config saved to /var/cache/conftool/dbconfig/20251106-195557-marostegui.json [production]
19:55 <jhancock@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be2092.codfw.wmnet with OS bullseye [production]
19:55 <jhancock@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin1003" [production]
19:53 <jhancock@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jhancock@cumin1003" [production]
19:44 <andrew@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cloudcontrol1008-dev.eqiad.wmnet'] [production]
19:43 <andrew@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cloudcontrol1008-dev.eqiad.wmnet with OS trixie [production]
19:39 <jhancock@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2093.codfw.wmnet with reason: host reimage [production]
19:39 <cmooney@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on lsw1-d6-eqiad,lsw1-d6-eqiad IPv6,lsw1-d6-eqiad.mgmt with reason: told switch to reboot and its stuck in UEFI shell [production]
19:37 <jhancock@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2092.codfw.wmnet with reason: host reimage [production]
19:37 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db1251 (T407997)', diff saved to https://phabricator.wikimedia.org/P85046 and previous config saved to /var/cache/conftool/dbconfig/20251106-193705-marostegui.json [production]
19:36 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1251.eqiad.wmnet with reason: Maintenance [production]
19:34 <jhancock@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2093.codfw.wmnet with reason: host reimage [production]
19:34 <jhancock@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2092.codfw.wmnet with reason: host reimage [production]
19:33 <jhancock@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2091.codfw.wmnet with reason: host reimage [production]
19:31 <swfrench-wmf> rolling run-puppet-agent on A:cp hosts for haproxy config change [production]
19:29 <jhancock@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2091.codfw.wmnet with reason: host reimage [production]
19:27 <jhancock@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2090.codfw.wmnet with reason: host reimage [production]
19:21 <jhancock@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2090.codfw.wmnet with reason: host reimage [production]
19:21 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1240.eqiad.wmnet with reason: Maintenance [production]
19:19 <andrew@cumin2002> START - Cookbook sre.hosts.reimage for host cloudcontrol1008-dev.eqiad.wmnet with OS trixie [production]
19:18 <swfrench-wmf> disable-puppet on A:cp hosts for haproxy config change [production]
19:15 <jhuneidi@deploy2002> rebuilt and synchronized wikiversions files: group2 to 1.46.0-wmf.1 refs T408271 [production]
19:06 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on wdqs1013.eqiad.wmnet with reason: C/D Migration [production]
19:05 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on wcqs1003.eqiad.wmnet with reason: C/D Migration [production]
19:05 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1239.eqiad.wmnet with reason: Maintenance [production]
19:05 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1235 (T407997)', diff saved to https://phabricator.wikimedia.org/P85045 and previous config saved to /var/cache/conftool/dbconfig/20251106-190506-marostegui.json [production]
19:02 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on puppetserver1001.eqiad.wmnet with reason: C/D Migration [production]
18:57 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on an-test-worker1002.eqiad.wmnet with reason: C/D Migration [production]
18:55 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on sessionstore1005.eqiad.wmnet with reason: C/D Migration [production]
18:53 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on es1045.eqiad.wmnet with reason: C/D Migration [production]
18:52 <jhancock@cumin1003> START - Cookbook sre.hosts.reimage for host ms-be2093.codfw.wmnet with OS bullseye [production]
18:52 <jhancock@cumin1003> START - Cookbook sre.hosts.reimage for host ms-be2092.codfw.wmnet with OS bullseye [production]
18:52 <jhancock@cumin1003> START - Cookbook sre.hosts.reimage for host ms-be2091.codfw.wmnet with OS bullseye [production]
18:51 <jhancock@cumin1003> START - Cookbook sre.hosts.reimage for host ms-be2090.codfw.wmnet with OS bullseye [production]
18:51 <robh@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:10:00 on db1262.eqiad.wmnet with reason: C/D Migration [production]
18:51 <jhancock@cumin1003> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['ms-be2093'] [production]
18:51 <jhancock@cumin1003> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['ms-be2092'] [production]
18:51 <jhancock@cumin1003> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['ms-be2091'] [production]
18:50 <jhancock@cumin1003> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['ms-be2093'] [production]
18:50 <jhancock@cumin1003> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['ms-be2090'] [production]
18:50 <jhancock@cumin1003> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['ms-be2092'] [production]
18:50 <jhancock@cumin1003> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['ms-be2091'] [production]