351-400 of 10000 results (76ms)
2023-09-28 ยง
14:08 <cdanis> repooling cp5030 after haproxy upgrade & config deploy T317799 [production]
14:02 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host db1228.eqiad.wmnet with OS bullseye [production]
14:02 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host db1228.eqiad.wmnet with OS bullseye [production]
14:02 <cdanis> depooling cp5030 for haproxy upgrade & testing T317799 [production]
14:01 <moritzm> installing gsl security updates [production]
14:00 <klausman> restarted pybal on lvs1020 and lvs2014 (LVS low-traffic backups) for T347278 (ORES turndown) [production]
13:57 <taavi@deploy2002> Finished scap: Backport for [[gerrit:961237|Set WRITE_BOTH for CA wikis on OATHAuth multiple devices (T242031)]] (duration: 11m 02s) [production]
13:57 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2158', diff saved to https://phabricator.wikimedia.org/P52720 and previous config saved to /var/cache/conftool/dbconfig/20230928-135612-arnaudb.json [production]
13:52 <bking@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
13:52 <bking@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
13:52 <moritzm> installing flac security updates [production]
13:50 <taavi@deploy2002> taavi: Continuing with sync [production]
13:49 <jhancock@cumin2002> START - Cookbook sre.hosts.reimage for host db1229.eqiad.wmnet with OS bullseye [production]
13:47 <taavi@deploy2002> taavi: Backport for [[gerrit:961237|Set WRITE_BOTH for CA wikis on OATHAuth multiple devices (T242031)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:47 <bking@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
13:47 <bking@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
13:45 <taavi@deploy2002> Started scap: Backport for [[gerrit:961237|Set WRITE_BOTH for CA wikis on OATHAuth multiple devices (T242031)]] [production]
13:43 <urbanecm@deploy2002> Finished scap: Backport for [[gerrit:957842|Enable WikiLove on arwikisource (T346391)]] (duration: 11m 10s) [production]
13:41 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2158', diff saved to https://phabricator.wikimedia.org/P52719 and previous config saved to /var/cache/conftool/dbconfig/20230928-134105-arnaudb.json [production]
13:37 <urbanecm@deploy2002> zoranzoki21 and urbanecm: Continuing with sync [production]
13:33 <urbanecm@deploy2002> zoranzoki21 and urbanecm: Backport for [[gerrit:957842|Enable WikiLove on arwikisource (T346391)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:31 <urbanecm@deploy2002> Started scap: Backport for [[gerrit:957842|Enable WikiLove on arwikisource (T346391)]] [production]
13:31 <jmm@cumin2002> END (PASS) - Cookbook sre.maps.roll-restart-reboot-master (exit_code=0) rolling reboot on A:maps-master-eqiad [production]
13:31 <urbanecm@deploy2002> Finished scap: Backport for [[gerrit:961742|wikifunctionswiki: Disable NearbyPages (T345459)]] (duration: 11m 07s) [production]
13:28 <urbanecm> mwscript extensions/WikimediaMaintenance/createExtensionTables.php --wiki=arwikisource wikilove # T346391 [production]
13:26 <arnaudb@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db2158 (T343198)', diff saved to https://phabricator.wikimedia.org/P52718 and previous config saved to /var/cache/conftool/dbconfig/20230928-132559-arnaudb.json [production]
13:25 <bking@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
13:25 <urbanecm@deploy2002> ammarpad and urbanecm: Continuing with sync [production]
13:25 <bking@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
13:24 <jmm@cumin2002> START - Cookbook sre.maps.roll-restart-reboot-master rolling reboot on A:maps-master-eqiad [production]
13:21 <urbanecm@deploy2002> ammarpad and urbanecm: Backport for [[gerrit:961742|wikifunctionswiki: Disable NearbyPages (T345459)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:20 <urbanecm@deploy2002> Started scap: Backport for [[gerrit:961742|wikifunctionswiki: Disable NearbyPages (T345459)]] [production]
13:19 <urbanecm@deploy2002> Finished scap: Backport for [[gerrit:960559|Enable Campaigns email on test wiki (T347065)]] (duration: 12m 31s) [production]
13:13 <urbanecm@deploy2002> urbanecm and mhorsey: Continuing with sync [production]
13:08 <urbanecm@deploy2002> urbanecm and mhorsey: Backport for [[gerrit:960559|Enable Campaigns email on test wiki (T347065)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
13:07 <fabfur@cumin1001> START - Cookbook sre.cdn.roll-restart-varnish rolling restart of Varnish on 7 hosts matching query A:cp-upload_ulsfo and not P{cp4052*} [production]
13:07 <fabfur@cumin1001> START - Cookbook sre.cdn.roll-restart-varnish rolling restart of Varnish on 8 hosts matching query A:cp-text_ulsfo [production]
13:06 <urbanecm@deploy2002> Started scap: Backport for [[gerrit:960559|Enable Campaigns email on test wiki (T347065)]] [production]
13:04 <elukey@deploy2002> helmfile [ml-serve-eqiad] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
13:03 <elukey@deploy2002> helmfile [ml-serve-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
13:03 <elukey@deploy2002> helmfile [ml-staging-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
12:47 <elukey> restart thanos-query on titan1002 [production]
12:44 <elukey> restart thanos-query on titan1001 [production]
12:41 <jmm@cumin2002> END (PASS) - Cookbook sre.maps.roll-restart-reboot-master (exit_code=0) rolling reboot on A:maps-master-codfw [production]
12:31 <jmm@cumin2002> START - Cookbook sre.maps.roll-restart-reboot-master rolling reboot on A:maps-master-codfw [production]
11:56 <arnaudb@cumin1001> dbctl commit (dc=all): 'Depooling db2106 (T343198)', diff saved to https://phabricator.wikimedia.org/P52717 and previous config saved to /var/cache/conftool/dbconfig/20230928-115619-arnaudb.json [production]
11:56 <arnaudb@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2106.codfw.wmnet with reason: Maintenance [production]
11:56 <arnaudb@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2106.codfw.wmnet with reason: Maintenance [production]
11:30 <jayme@deploy1002> helmfile [eqiad] DONE helmfile.d/services/machinetranslation: apply [production]
11:26 <jayme@deploy1002> helmfile [eqiad] START helmfile.d/services/machinetranslation: apply [production]