5101-5150 of 10000 results (82ms)
2023-11-13 ยง
15:43 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2101.codfw.wmnet with reason: Maintenance [production]
15:42 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2101.codfw.wmnet with reason: Maintenance [production]
15:41 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance [production]
15:40 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on dbstore1003.eqiad.wmnet with reason: Maintenance [production]
15:40 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1230 (T348183)', diff saved to https://phabricator.wikimedia.org/P53341 and previous config saved to /var/cache/conftool/dbconfig/20231113-154044-arnaudb.json [production]
15:39 <oblivian@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply [production]
15:38 <oblivian@deploy2002> helmfile [eqiad] START helmfile.d/services/mobileapps: apply [production]
15:31 <oblivian@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply [production]
15:31 <oblivian@deploy2002> helmfile [eqiad] START helmfile.d/services/mobileapps: apply [production]
15:25 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1230', diff saved to https://phabricator.wikimedia.org/P53340 and previous config saved to /var/cache/conftool/dbconfig/20231113-152537-arnaudb.json [production]
15:14 <fabfur> swapped cp1103 <-> cp1078 (T349244) [production]
15:14 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host clouddb1020.eqiad.wmnet [production]
15:13 <fabfur@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cp1103.eqiad.wmnet [production]
15:13 <fabfur@cumin1001> START - Cookbook sre.hosts.remove-downtime for cp1103.eqiad.wmnet [production]
15:10 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1230', diff saved to https://phabricator.wikimedia.org/P53339 and previous config saved to /var/cache/conftool/dbconfig/20231113-151031-arnaudb.json [production]
15:08 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host clouddb1020.eqiad.wmnet [production]
15:07 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host clouddb1019.eqiad.wmnet [production]
15:07 <fabfur> swapped cp1102 <-> cp1077 (T349244) [production]
15:04 <fabfur@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cp1102.eqiad.wmnet [production]
15:04 <fabfur@cumin1001> START - Cookbook sre.hosts.remove-downtime for cp1102.eqiad.wmnet [production]
15:00 <kamila@deploy2002> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
15:00 <kamila@deploy2002> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
15:00 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host clouddb1019.eqiad.wmnet [production]
14:59 <oblivian@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply [production]
14:59 <oblivian@deploy2002> helmfile [eqiad] START helmfile.d/services/mobileapps: apply [production]
14:58 <oblivian@deploy2002> helmfile [staging] DONE helmfile.d/services/mobileapps: apply [production]
14:58 <oblivian@deploy2002> helmfile [staging] START helmfile.d/services/mobileapps: apply [production]
14:57 <btullis@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host clouddb1018.eqiad.wmnet [production]
14:56 <kamila@deploy2002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
14:56 <kamila@deploy2002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
14:55 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1230 (T348183)', diff saved to https://phabricator.wikimedia.org/P53338 and previous config saved to /var/cache/conftool/dbconfig/20231113-145524-arnaudb.json [production]
14:52 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db1230 (T348183)', diff saved to https://phabricator.wikimedia.org/P53337 and previous config saved to /var/cache/conftool/dbconfig/20231113-145223-arnaudb.json [production]
14:52 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1230.eqiad.wmnet with reason: Maintenance [production]
14:52 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1230.eqiad.wmnet with reason: Maintenance [production]
14:51 <urbanecm> mwmaint2002: stop `extensions/DiscussionTools/maintenance/persistRevisionThreadItems.php --wiki frwiki` again, memory leak didn't stop (T315510) [production]
14:50 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1216.eqiad.wmnet with reason: Maintenance [production]
14:49 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1216.eqiad.wmnet with reason: Maintenance [production]
14:49 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1213:3315 (T348183)', diff saved to https://phabricator.wikimedia.org/P53336 and previous config saved to /var/cache/conftool/dbconfig/20231113-144947-arnaudb.json [production]
14:46 <btullis@cumin1001> START - Cookbook sre.hosts.reboot-single for host clouddb1018.eqiad.wmnet [production]
14:43 <urbanecm> mwmaint2002: foreachwiki extensions/WikimediaMaintenance/createExtensionTables.php MediaModeration (T350321) [production]
14:41 <bblack> cp2027: varnish-frontend-restart to test tcp listen port changes [production]
14:40 <urbanecm@deploy2002> Finished scap: Backport for [[gerrit:973784|Deploy Reader Demographics 2 survey (T345951)]], [[gerrit:973788|Add mediamoderation_scan table (T350321)]] (duration: 09m 13s) [production]
14:38 <urbanecm> mwmaint2002: Start several instances of `extensions/DiscussionTools/maintenance/persistRevisionThreadItems.php` (T315510) [production]
14:35 <urbanecm@deploy2002> urbanecm and dani: Continuing with sync [production]
14:34 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1213:3315', diff saved to https://phabricator.wikimedia.org/P53335 and previous config saved to /var/cache/conftool/dbconfig/20231113-143440-arnaudb.json [production]
14:34 <btullis@cumin1001> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host clouddb1017.eqiad.wmnet [production]
14:32 <urbanecm@deploy2002> urbanecm and dani: Backport for [[gerrit:973784|Deploy Reader Demographics 2 survey (T345951)]], [[gerrit:973788|Add mediamoderation_scan table (T350321)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
14:31 <urbanecm@deploy2002> Started scap: Backport for [[gerrit:973784|Deploy Reader Demographics 2 survey (T345951)]], [[gerrit:973788|Add mediamoderation_scan table (T350321)]] [production]
14:30 <urbanecm@deploy2002> Finished scap: Backport for [[gerrit:973339|ParserOutputAccess: Limit local cache size (T315510)]] (duration: 06m 42s) [production]
14:30 <moritzm> installing debianutils bugfix updates from Bookworm point release [production]