6801-6850 of 10000 results (117ms)
2023-11-20 ยง
22:17 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
22:15 <catrope@deploy2002> Finished scap: Backport for [[gerrit:975366|Revert "mw.notify: Limit width of overlay to max-width-page-container" (T349622)]] (duration: 17m 40s) [production]
22:09 <catrope@deploy2002> jdlrobson and catrope: Continuing with sync [production]
21:59 <catrope@deploy2002> jdlrobson and catrope: Backport for [[gerrit:975366|Revert "mw.notify: Limit width of overlay to max-width-page-container" (T349622)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:58 <catrope@deploy2002> Started scap: Backport for [[gerrit:975366|Revert "mw.notify: Limit width of overlay to max-width-page-container" (T349622)]] [production]
21:38 <catrope@deploy2002> Finished scap: Backport for [[gerrit:975879|Disable MobileFrontend AMC drawer temporarily while erroring (T351669)]] (duration: 22m 11s) [production]
21:32 <catrope@deploy2002> catrope and jdlrobson: Continuing with sync [production]
21:17 <catrope@deploy2002> catrope and jdlrobson: Backport for [[gerrit:975879|Disable MobileFrontend AMC drawer temporarily while erroring (T351669)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:16 <catrope@deploy2002> Started scap: Backport for [[gerrit:975879|Disable MobileFrontend AMC drawer temporarily while erroring (T351669)]] [production]
21:12 <catrope@deploy2002> Finished scap: Backport for [[gerrit:973795|Enable action blocks in ruwiki (T351048)]] (duration: 08m 52s) [production]
21:06 <catrope@deploy2002> catrope and stjn: Continuing with sync [production]
21:05 <catrope@deploy2002> catrope and stjn: Backport for [[gerrit:973795|Enable action blocks in ruwiki (T351048)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:03 <catrope@deploy2002> Started scap: Backport for [[gerrit:973795|Enable action blocks in ruwiki (T351048)]] [production]
21:02 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for aqs1014.eqiad.wmnet [production]
21:02 <eevans@cumin1001> START - Cookbook sre.hosts.remove-downtime for aqs1014.eqiad.wmnet [production]
21:02 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host aqs1014.eqiad.wmnet with OS bullseye [production]
20:40 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aqs1014.eqiad.wmnet with reason: host reimage [production]
20:37 <eevans@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on aqs1014.eqiad.wmnet with reason: host reimage [production]
20:34 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2139.codfw.wmnet with reason: Maintenance [production]
20:33 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2139.codfw.wmnet with reason: Maintenance [production]
20:33 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2138:3314 (T348183)', diff saved to https://phabricator.wikimedia.org/P53645 and previous config saved to /var/cache/conftool/dbconfig/20231120-203337-arnaudb.json [production]
20:21 <eevans@cumin1001> START - Cookbook sre.hosts.reimage for host aqs1014.eqiad.wmnet with OS bullseye [production]
20:21 <eevans@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host aqs1014.eqiad.wmnet with OS bullseye [production]
20:18 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2138:3314', diff saved to https://phabricator.wikimedia.org/P53644 and previous config saved to /var/cache/conftool/dbconfig/20231120-201831-arnaudb.json [production]
20:10 <eevans@cumin1001> START - Cookbook sre.hosts.reimage for host aqs1014.eqiad.wmnet with OS bullseye [production]
20:08 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host aqs1013.eqiad.wmnet with OS bullseye [production]
20:03 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2138:3314', diff saved to https://phabricator.wikimedia.org/P53643 and previous config saved to /var/cache/conftool/dbconfig/20231120-200324-arnaudb.json [production]
19:59 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for acmechief2001.codfw.wmnet [production]
19:59 <brett@cumin2002> START - Cookbook sre.hosts.remove-downtime for acmechief2001.codfw.wmnet [production]
19:50 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aqs1013.eqiad.wmnet with reason: host reimage [production]
19:48 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2138:3314 (T348183)', diff saved to https://phabricator.wikimedia.org/P53642 and previous config saved to /var/cache/conftool/dbconfig/20231120-194818-arnaudb.json [production]
19:48 <eevans@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on aqs1013.eqiad.wmnet with reason: host reimage [production]
19:36 <eevans@cumin1001> START - Cookbook sre.hosts.reimage for host aqs1013.eqiad.wmnet with OS bullseye [production]
19:21 <sukhe> pool cp4045.ulsfo.wmnet post reboot and puppet 7 upgrade [production]
19:16 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cp4045.ulsfo.wmnet [production]
19:05 <sukhe@cumin2002> START - Cookbook sre.hosts.reboot-single for host cp4045.ulsfo.wmnet [production]
19:04 <cmooney@cumin1001> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
19:03 <brett@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host acmechief2001.codfw.wmnet with OS bookworm [production]
19:03 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
19:02 <sukhe> depool cp4045 for reboot [production]
18:59 <cmooney@cumin1001> END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox [production]
18:59 <cmooney@cumin1001> START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox [production]
18:59 <cmooney@cumin1001> END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox-canary [production]
18:59 <cmooney@cumin1001> START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox-canary [production]
18:57 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host cp4045.ulsfo.wmnet [production]
18:48 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host cp4045.ulsfo.wmnet [production]
18:44 <brett@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on acmechief2001.codfw.wmnet with reason: host reimage [production]
18:41 <brett@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on acmechief2001.codfw.wmnet with reason: host reimage [production]
18:39 <bking@cumin1001> START - Cookbook sre.wdqs.data-reload [production]
18:38 <bking@cumin1001> END (ERROR) - Cookbook sre.wdqs.data-reload (exit_code=97) [production]