1401-1450 of 10000 results (101ms)
2023-10-02 ยง
20:27 <ottomata> mw-page-content-change-enrich - increase replicas to 12 to process backlog - T347676 [production]
20:27 <kindrobot@deploy2002> Finished scap: Backport for [[gerrit:962105|Undeploy Reader Demographics 2 pilot survey (T345951)]], [[gerrit:962629|DiscussionTools: Disable timestamp links in production initially]] (duration: 08m 49s) [production]
20:27 <eevans@cumin1001> START - Cookbook sre.hosts.reimage for host restbase1029.eqiad.wmnet with OS bullseye [production]
20:22 <jclark@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-master1003.eqiad.wmnet with OS bullseye [production]
20:21 <kindrobot@deploy2002> esanders and dani and kindrobot: Continuing with sync [production]
20:19 <kindrobot@deploy2002> esanders and dani and kindrobot: Backport for [[gerrit:962105|Undeploy Reader Demographics 2 pilot survey (T345951)]], [[gerrit:962629|DiscussionTools: Disable timestamp links in production initially]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
20:18 <kindrobot@deploy2002> Started scap: Backport for [[gerrit:962105|Undeploy Reader Demographics 2 pilot survey (T345951)]], [[gerrit:962629|DiscussionTools: Disable timestamp links in production initially]] [production]
20:13 <otto@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
20:13 <otto@deploy2002> helmfile [codfw] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
20:12 <otto@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
20:12 <eileen> process control revision changed from b370644b to 9760851c [production]
20:12 <otto@deploy2002> helmfile [codfw] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
20:12 <eileen> revision changed from b370644b to 9760851c [production]
20:11 <eevans@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host restbase1029.eqiad.wmnet with OS bullseye [production]
20:11 <kindrobot@deploy2002> Backport cancelled. [production]
20:01 <eevans@cumin1001> START - Cookbook sre.hosts.reimage for host restbase1029.eqiad.wmnet with OS bullseye [production]
20:01 <eevans@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host restbase1029.eqiad.wmnet with OS bullseye [production]
19:54 <eevans@cumin1001> START - Cookbook sre.hosts.reimage for host restbase1029.eqiad.wmnet with OS bullseye [production]
19:53 <eevans@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts restbase1029.eqiad.wmnet [production]
19:53 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase1029.eqiad.wmnet [production]
19:53 <moritzm> installing libvpx security updates [production]
19:41 <eevans@cumin1001> START - Cookbook sre.hosts.reboot-single for host restbase1029.eqiad.wmnet [production]
19:40 <eevans@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts restbase1029.eqiad.wmnet [production]
19:40 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host restbase1024.eqiad.wmnet with OS bullseye [production]
19:38 <eileen> civicrm upgraded from 7406cdf3 to c1b28287 [production]
19:19 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on restbase1024.eqiad.wmnet with reason: host reimage [production]
19:16 <eevans@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on restbase1024.eqiad.wmnet with reason: host reimage [production]
19:13 <jclark@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['an-master1003'] [production]
19:13 <jclark@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['an-master1003'] [production]
19:11 <jclark@cumin1001> START - Cookbook sre.hosts.reimage for host an-master1004.eqiad.wmnet with OS bullseye [production]
19:02 <eevans@cumin1001> START - Cookbook sre.hosts.reimage for host restbase1024.eqiad.wmnet with OS bullseye [production]
19:02 <eevans@cumin1001> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts restbase1029.eqiad.wmnet [production]
19:02 <eevans@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts restbase1029.eqiad.wmnet [production]
19:01 <jclark@cumin1001> START - Cookbook sre.hosts.reimage for host an-master1003.eqiad.wmnet with OS bullseye [production]
19:00 <jclark@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['an-master1003.eqiad.wmnet'] [production]
19:00 <jclark@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['an-master1003.eqiad.wmnet'] [production]
19:00 <jclark@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-master1004.eqiad.wmnet with OS bullseye [production]
19:00 <jclark@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-master1003.eqiad.wmnet with OS bullseye [production]
18:56 <eevans@cumin1001> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts restbase1024.eqiad.wmnet [production]
18:56 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase1024.eqiad.wmnet [production]
18:44 <eevans@cumin1001> START - Cookbook sre.hosts.reboot-single for host restbase1024.eqiad.wmnet [production]
18:44 <eevans@cumin1001> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts restbase1024.eqiad.wmnet [production]
18:42 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for restbase1023.eqiad.wmnet [production]
18:42 <eevans@cumin1001> START - Cookbook sre.hosts.remove-downtime for restbase1023.eqiad.wmnet [production]
18:40 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host restbase1023.eqiad.wmnet with OS bullseye [production]
18:16 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on restbase1023.eqiad.wmnet with reason: host reimage [production]
18:13 <eevans@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on restbase1023.eqiad.wmnet with reason: host reimage [production]
17:59 <eevans@cumin1001> START - Cookbook sre.hosts.reimage for host restbase1023.eqiad.wmnet with OS bullseye [production]
17:59 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for restbase1022.eqiad.wmnet [production]
17:59 <eevans@cumin1001> START - Cookbook sre.hosts.remove-downtime for restbase1022.eqiad.wmnet [production]