1351-1400 of 10000 results (94ms)
2023-11-13 ยง
16:09 <fnegri@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1031.eqiad.wmnet with reason: host reimage [production]
16:06 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2111', diff saved to https://phabricator.wikimedia.org/P53344 and previous config saved to /var/cache/conftool/dbconfig/20231113-160656-arnaudb.json [production]
15:55 <fnegri@cumin1001> START - Cookbook sre.hosts.reimage for host cloudvirt1031.eqiad.wmnet with OS bookworm [production]
15:54 <oblivian@deploy2002> helmfile [codfw] DONE helmfile.d/services/mobileapps: apply [production]
15:52 <oblivian@deploy2002> helmfile [codfw] START helmfile.d/services/mobileapps: apply [production]
15:51 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2111 (T348183)', diff saved to https://phabricator.wikimedia.org/P53343 and previous config saved to /var/cache/conftool/dbconfig/20231113-155149-arnaudb.json [production]
15:46 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db2111 (T348183)', diff saved to https://phabricator.wikimedia.org/P53342 and previous config saved to /var/cache/conftool/dbconfig/20231113-154641-arnaudb.json [production]
15:46 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2111.codfw.wmnet with reason: Maintenance [production]
15:46 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2111.codfw.wmnet with reason: Maintenance [production]
15:43 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2101.codfw.wmnet with reason: Maintenance [production]
15:42 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2101.codfw.wmnet with reason: Maintenance [production]
15:41 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance [production]
15:40 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance [production]
15:40 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1230 (T348183)', diff saved to https://phabricator.wikimedia.org/P53341 and previous config saved to /var/cache/conftool/dbconfig/20231113-154044-arnaudb.json [production]
15:39 <oblivian@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply [production]
15:38 <oblivian@deploy2002> helmfile [eqiad] START helmfile.d/services/mobileapps: apply [production]
15:31 <oblivian@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply [production]
15:31 <oblivian@deploy2002> helmfile [eqiad] START helmfile.d/services/mobileapps: apply [production]
15:25 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1230', diff saved to https://phabricator.wikimedia.org/P53340 and previous config saved to /var/cache/conftool/dbconfig/20231113-152537-arnaudb.json [production]
15:14 <fabfur> swapped cp1103 <-> cp1078 (T349244) [production]
15:14 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host clouddb1020.eqiad.wmnet [production]
15:13 <fabfur@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cp1103.eqiad.wmnet [production]
15:13 <fabfur@cumin1001> START - Cookbook sre.hosts.remove-downtime for cp1103.eqiad.wmnet [production]
15:10 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1230', diff saved to https://phabricator.wikimedia.org/P53339 and previous config saved to /var/cache/conftool/dbconfig/20231113-151031-arnaudb.json [production]
15:08 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host clouddb1020.eqiad.wmnet [production]
15:07 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host clouddb1019.eqiad.wmnet [production]
15:07 <fabfur> swapped cp1102 <-> cp1077 (T349244) [production]
15:04 <fabfur@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cp1102.eqiad.wmnet [production]
15:04 <fabfur@cumin1001> START - Cookbook sre.hosts.remove-downtime for cp1102.eqiad.wmnet [production]
15:00 <kamila@deploy2002> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
15:00 <kamila@deploy2002> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
15:00 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host clouddb1019.eqiad.wmnet [production]
14:59 <oblivian@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply [production]
14:59 <oblivian@deploy2002> helmfile [eqiad] START helmfile.d/services/mobileapps: apply [production]
14:58 <oblivian@deploy2002> helmfile [staging] DONE helmfile.d/services/mobileapps: apply [production]
14:58 <oblivian@deploy2002> helmfile [staging] START helmfile.d/services/mobileapps: apply [production]
14:57 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host clouddb1018.eqiad.wmnet [production]
14:56 <kamila@deploy2002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
14:56 <kamila@deploy2002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
14:55 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1230 (T348183)', diff saved to https://phabricator.wikimedia.org/P53338 and previous config saved to /var/cache/conftool/dbconfig/20231113-145524-arnaudb.json [production]
14:52 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db1230 (T348183)', diff saved to https://phabricator.wikimedia.org/P53337 and previous config saved to /var/cache/conftool/dbconfig/20231113-145223-arnaudb.json [production]
14:52 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1230.eqiad.wmnet with reason: Maintenance [production]
14:52 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1230.eqiad.wmnet with reason: Maintenance [production]
14:51 <urbanecm> mwmaint2002: stop `extensions/DiscussionTools/maintenance/persistRevisionThreadItems.php --wiki frwiki` again, memory leak didn't stop (T315510) [production]
14:50 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1216.eqiad.wmnet with reason: Maintenance [production]
14:49 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1216.eqiad.wmnet with reason: Maintenance [production]
14:49 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1213:3315 (T348183)', diff saved to https://phabricator.wikimedia.org/P53336 and previous config saved to /var/cache/conftool/dbconfig/20231113-144947-arnaudb.json [production]
14:46 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host clouddb1018.eqiad.wmnet [production]
14:43 <urbanecm> mwmaint2002: foreachwiki extensions/WikimediaMaintenance/createExtensionTables.php MediaModeration (T350321) [production]
14:41 <bblack> cp2027: varnish-frontend-restart to test tcp listen port changes [production]