1251-1300 of 10000 results (90ms)
2023-11-28 ยง
23:13 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on restbase2030.codfw.wmnet with reason: host reimage [production]
23:10 <pt1979@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on restbase2030.codfw.wmnet with reason: host reimage [production]
23:07 <bblack> cp4052 - repool [production]
23:05 <bblack> cp4052 - depool temporarily [production]
23:01 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host restbase2032.codfw.wmnet with OS bullseye [production]
22:51 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host restbase2030.codfw.wmnet with OS bullseye [production]
22:51 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host restbase2031.codfw.wmnet with OS bullseye [production]
22:51 <pt1979@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002" [production]
22:49 <pt1979@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002" [production]
22:33 <pt1979@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host restbase2030.codfw.wmnet with OS bullseye [production]
22:33 <bblack> cp4052 - disabling puppet to experiment on how we gather prometheus stats from ATS... [production]
22:33 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on restbase2031.codfw.wmnet with reason: host reimage [production]
22:27 <pt1979@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on restbase2031.codfw.wmnet with reason: host reimage [production]
22:23 <urbanecm@deploy2002> helmfile [codfw] DONE helmfile.d/services/wikifeeds: apply [production]
22:22 <urbanecm@deploy2002> helmfile [codfw] START helmfile.d/services/wikifeeds: apply [production]
22:22 <urbanecm@deploy2002> helmfile [eqiad] DONE helmfile.d/services/wikifeeds: apply [production]
22:22 <urbanecm@deploy2002> helmfile [eqiad] START helmfile.d/services/wikifeeds: apply [production]
22:20 <urbanecm@deploy2002> helmfile [staging] DONE helmfile.d/services/wikifeeds: apply [production]
22:19 <urbanecm@deploy2002> helmfile [staging] START helmfile.d/services/wikifeeds: apply [production]
22:12 <urbanecm@deploy2002> Finished scap: Backport for [[gerrit:978072|Fixes: Duplicate events for radio buttons (T352075)]], [[gerrit:978071|Fixes: Duplicate events for radio buttons (T352075)]], [[gerrit:978067|Work around Parsoid's messy handling of some extensions (T351461)]] (duration: 13m 02s) [production]
22:09 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host restbase2031.codfw.wmnet with OS bullseye [production]
22:04 <urbanecm@deploy2002> urbanecm and ssastry and jdlrobson: Continuing with sync [production]
22:01 <urbanecm@deploy2002> urbanecm and ssastry and jdlrobson: Backport for [[gerrit:978072|Fixes: Duplicate events for radio buttons (T352075)]], [[gerrit:978071|Fixes: Duplicate events for radio buttons (T352075)]], [[gerrit:978067|Work around Parsoid's messy handling of some extensions (T351461)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:59 <urbanecm@deploy2002> Started scap: Backport for [[gerrit:978072|Fixes: Duplicate events for radio buttons (T352075)]], [[gerrit:978071|Fixes: Duplicate events for radio buttons (T352075)]], [[gerrit:978067|Work around Parsoid's messy handling of some extensions (T351461)]] [production]
21:58 <urbanecm@deploy2002> Finished scap: Backport for [[gerrit:973876|Increase coverage of Reader Demographics 2 surveys (T344393)]], [[gerrit:978068|DefaultOutputTransform::deduplicateStyles: don't match inside an attribute]] (duration: 31m 09s) [production]
21:52 <urbanecm@deploy2002> cscott and urbanecm and dani: Continuing with sync [production]
21:49 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host restbase2029.codfw.wmnet with OS bullseye [production]
21:49 <pt1979@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002" [production]
21:47 <pt1979@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002" [production]
21:42 <jclark@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1161.eqiad.wmnet with OS bullseye [production]
21:30 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on restbase2029.codfw.wmnet with reason: host reimage [production]
21:29 <urbanecm@deploy2002> cscott and urbanecm and dani: Backport for [[gerrit:973876|Increase coverage of Reader Demographics 2 surveys (T344393)]], [[gerrit:978068|DefaultOutputTransform::deduplicateStyles: don't match inside an attribute]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:27 <urbanecm@deploy2002> Started scap: Backport for [[gerrit:973876|Increase coverage of Reader Demographics 2 surveys (T344393)]], [[gerrit:978068|DefaultOutputTransform::deduplicateStyles: don't match inside an attribute]] [production]
21:26 <pt1979@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on restbase2029.codfw.wmnet with reason: host reimage [production]
21:18 <jclark@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1001" [production]
21:08 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host restbase2029.codfw.wmnet with OS bullseye [production]
21:07 <pt1979@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host restbase2029.codfw.wmnet with OS bullseye [production]
21:04 <jclark@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1158.eqiad.wmnet with reason: host reimage [production]
21:00 <jclark@cumin1001> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host an-worker1159.mgmt.eqiad.wmnet with reboot policy FORCED [production]
21:00 <jclark@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1158.eqiad.wmnet with reason: host reimage [production]
20:46 <jclark@cumin1001> START - Cookbook sre.hosts.reimage for host an-worker1158.eqiad.wmnet with OS bullseye [production]
20:40 <jclark@cumin1001> START - Cookbook sre.hosts.provision for host an-worker1159.mgmt.eqiad.wmnet with reboot policy FORCED [production]
20:39 <ladsgroup@deploy2002> Finished scap: Backport for [[gerrit:977250|Disable VipsScaler in group0 (T290759)]] (duration: 10m 08s) [production]
20:36 <jclark@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=93) for host an-worker1158.eqiad.wmnet with OS bullseye [production]
20:32 <ladsgroup@deploy2002> ladsgroup: Continuing with sync [production]
20:30 <ladsgroup@deploy2002> ladsgroup: Backport for [[gerrit:977250|Disable VipsScaler in group0 (T290759)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
20:29 <ladsgroup@deploy2002> Started scap: Backport for [[gerrit:977250|Disable VipsScaler in group0 (T290759)]] [production]
20:21 <jclark@cumin1001> START - Cookbook sre.hosts.reimage for host an-worker1161.eqiad.wmnet with OS bullseye [production]
20:13 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on planet2003.codfw.wmnet with reason: maintenance [production]
20:13 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 14 days, 0:00:00 on planet2003.codfw.wmnet with reason: maintenance [production]