1-50 of 10000 results (40ms)
2022-12-14 ยง
23:50 <cwhite@cumin2002> START - Cookbook sre.hosts.reimage for host logstash2026.codfw.wmnet with OS bullseye [production]
23:48 <cwhite@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['logstash2026'] [production]
23:44 <ejegg> civicrm upgraded from a1c2630a to 98b48b9a [production]
23:41 <cwhite@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['logstash2026'] [production]
23:40 <cwhite@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['logstash2026'] [production]
23:33 <cwhite@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['logstash2026'] [production]
23:29 <ryankemper> [WDQS] Downtimed wdqs20[09-12] for the next 7 days [production]
23:28 <ryankemper> T301167 wdqs2011/2012 were not visible in pybal (oversight from when I added the other hosts with conftool last week). Fixed that, so now all of the new hosts are showing up properly. [production]
23:27 <ryankemper@puppetmaster1001> conftool action : set/weight=10:pooled=no; selector: name=wdqs2012.* [production]
23:27 <ryankemper@puppetmaster1001> conftool action : set/weight=10:pooled=no; selector: name=wdqs2011.* [production]
23:14 <bd808> Toolhub: rebuilding search indices following app update [production]
23:12 <bd808@deploy1002> helmfile [eqiad] DONE helmfile.d/services/toolhub: apply [production]
23:10 <denisse@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 0:10:00 on alert2001.wikimedia.org with reason: kernel update [production]
23:10 <bd808@deploy1002> helmfile [eqiad] START helmfile.d/services/toolhub: apply [production]
23:10 <denisse@cumin1001> START - Cookbook sre.hosts.downtime for 0:10:00 on alert2001.wikimedia.org with reason: kernel update [production]
23:04 <bd808@deploy1002> helmfile [codfw] DONE helmfile.d/services/toolhub: apply [production]
23:03 <denisse@cumin1001> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host alert2001.wikimedia.org [production]
23:03 <denisse@cumin1001> START - Cookbook sre.hosts.reboot-single for host alert2001.wikimedia.org [production]
23:03 <bd808@deploy1002> helmfile [codfw] START helmfile.d/services/toolhub: apply [production]
23:01 <bd808@deploy1002> helmfile [staging] DONE helmfile.d/services/toolhub: apply [production]
22:59 <bd808@deploy1002> helmfile [staging] START helmfile.d/services/toolhub: apply [production]
22:56 <tgr> doing the last backport by hand due to T325252 [production]
22:49 <tgr@deploy1002> Finished scap: Backport for [[gerrit:868047|NewImpact: Add log event for clicking suggested edits button (T325041)]], [[gerrit:868051|UserEditTracker: Allow querying primary DB for edit timestamp]] (duration: 11m 37s) [production]
22:46 <denisse@cumin1001> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host alert1001.wikimedia.org [production]
22:39 <tgr@deploy1002> tgr and kharlan and tgr: Backport for [[gerrit:868047|NewImpact: Add log event for clicking suggested edits button (T325041)]], [[gerrit:868051|UserEditTracker: Allow querying primary DB for edit timestamp]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet [production]
22:37 <tgr@deploy1002> Started scap: Backport for [[gerrit:868047|NewImpact: Add log event for clicking suggested edits button (T325041)]], [[gerrit:868051|UserEditTracker: Allow querying primary DB for edit timestamp]] [production]
22:36 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 1 day, 0:00:00 on wdqs2009.codfw.wmnet with reason: NFS troubleshooting [production]
22:36 <bking@cumin2002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on wdqs2009.codfw.wmnet with reason: NFS troubleshooting [production]
22:32 <denisse@cumin1001> START - Cookbook sre.hosts.reboot-single for host alert1001.wikimedia.org [production]
22:12 <samtar@deploy1002> Finished scap: Backport for [[gerrit:867311|Deployment of DiscussionTools reply visual enhancements for more wikis (T323537)]] (duration: 08m 12s) [production]
22:06 <samtar@deploy1002> samtar and kemayo: Backport for [[gerrit:867311|Deployment of DiscussionTools reply visual enhancements for more wikis (T323537)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet [production]
22:04 <samtar@deploy1002> Started scap: Backport for [[gerrit:867311|Deployment of DiscussionTools reply visual enhancements for more wikis (T323537)]] [production]
22:03 <samtar@deploy1002> Finished scap: Backport for [[gerrit:867619|VisualEnhancements: in some languages put an arrow by the reply button (T323537)]], [[gerrit:867620|VisualEnhancements: in some languages put an arrow by the reply button (T323537)]] (duration: 08m 55s) [production]
21:56 <samtar@deploy1002> samtar and kemayo: Backport for [[gerrit:867619|VisualEnhancements: in some languages put an arrow by the reply button (T323537)]], [[gerrit:867620|VisualEnhancements: in some languages put an arrow by the reply button (T323537)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet [production]
21:55 <eileen> civicrm upgraded from a65bc00a to a1c2630a [production]
21:54 <samtar@deploy1002> Started scap: Backport for [[gerrit:867619|VisualEnhancements: in some languages put an arrow by the reply button (T323537)]], [[gerrit:867620|VisualEnhancements: in some languages put an arrow by the reply button (T323537)]] [production]
21:53 <samtar@deploy1002> Finished scap: Backport for [[gerrit:868049|Parsoid: don't bypass ParserCache when using Title]] (duration: 11m 13s) [production]
21:44 <samtar@deploy1002> samtar and daniel: Backport for [[gerrit:868049|Parsoid: don't bypass ParserCache when using Title]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet [production]
21:42 <samtar@deploy1002> Started scap: Backport for [[gerrit:868049|Parsoid: don't bypass ParserCache when using Title]] [production]
21:39 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1055.eqiad.wmnet with OS bullseye [production]
21:35 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1054.eqiad.wmnet with OS bullseye [production]
21:26 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1060.eqiad.wmnet with OS bullseye [production]
21:24 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1059.eqiad.wmnet with OS bullseye [production]
21:23 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1058.eqiad.wmnet with OS bullseye [production]
21:22 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1061.eqiad.wmnet with OS bullseye [production]
21:19 <andrew@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=1) for host cloudvirt1057.eqiad.wmnet with OS bullseye [production]
21:19 <andrew@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=1) for host cloudvirt1056.eqiad.wmnet with OS bullseye [production]
21:17 <samtar@deploy1002> backport aborted: (duration: 15m 35s) [production]
21:14 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1054.eqiad.wmnet with reason: host reimage [production]
21:12 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1055.eqiad.wmnet with reason: host reimage [production]