2023-05-17
ยง
|
15:52 |
<jelto@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/miscweb: apply |
[production] |
15:50 |
<jelto@deploy1002> |
helmfile [codfw] START helmfile.d/services/miscweb: apply |
[production] |
15:49 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance es2032', diff saved to https://phabricator.wikimedia.org/P48351 and previous config saved to /var/cache/conftool/dbconfig/20230517-154916-ladsgroup.json |
[production] |
15:46 |
<elukey@deploy1002> |
helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
15:39 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance es1032 (T335845)', diff saved to https://phabricator.wikimedia.org/P48350 and previous config saved to /var/cache/conftool/dbconfig/20230517-153925-ladsgroup.json |
[production] |
15:38 |
<elukey@deploy1002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
15:34 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance es2032 (T335845)', diff saved to https://phabricator.wikimedia.org/P48349 and previous config saved to /var/cache/conftool/dbconfig/20230517-153410-ladsgroup.json |
[production] |
15:30 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling es1032 (T335845)', diff saved to https://phabricator.wikimedia.org/P48348 and previous config saved to /var/cache/conftool/dbconfig/20230517-153042-ladsgroup.json |
[production] |
15:30 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es1032.eqiad.wmnet with reason: Maintenance |
[production] |
15:30 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on es1032.eqiad.wmnet with reason: Maintenance |
[production] |
15:30 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling es2032 (T335845)', diff saved to https://phabricator.wikimedia.org/P48347 and previous config saved to /var/cache/conftool/dbconfig/20230517-153010-ladsgroup.json |
[production] |
15:30 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance es1027 (T335845)', diff saved to https://phabricator.wikimedia.org/P48346 and previous config saved to /var/cache/conftool/dbconfig/20230517-153004-ladsgroup.json |
[production] |
15:30 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es2032.codfw.wmnet with reason: Maintenance |
[production] |
15:29 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on es2032.codfw.wmnet with reason: Maintenance |
[production] |
15:29 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance es2028 (T335845)', diff saved to https://phabricator.wikimedia.org/P48345 and previous config saved to /var/cache/conftool/dbconfig/20230517-152945-ladsgroup.json |
[production] |
15:29 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host irc2002.wikimedia.org |
[production] |
15:25 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host irc2002.wikimedia.org |
[production] |
15:18 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host irc1002.wikimedia.org |
[production] |
15:14 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance es1027', diff saved to https://phabricator.wikimedia.org/P48344 and previous config saved to /var/cache/conftool/dbconfig/20230517-151458-ladsgroup.json |
[production] |
15:14 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host irc1002.wikimedia.org |
[production] |
15:14 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance es2028', diff saved to https://phabricator.wikimedia.org/P48343 and previous config saved to /var/cache/conftool/dbconfig/20230517-151438-ladsgroup.json |
[production] |
15:07 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host zookeeper-test1002.eqiad.wmnet |
[production] |
15:07 |
<aikochou@deploy1002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . |
[production] |
15:01 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host zookeeper-test1002.eqiad.wmnet |
[production] |
14:59 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance es1027', diff saved to https://phabricator.wikimedia.org/P48342 and previous config saved to /var/cache/conftool/dbconfig/20230517-145952-ladsgroup.json |
[production] |
14:59 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance es2028', diff saved to https://phabricator.wikimedia.org/P48341 and previous config saved to /var/cache/conftool/dbconfig/20230517-145932-ladsgroup.json |
[production] |
14:48 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.aqs.roll-restart-reboot (exit_code=0) rolling reboot on P{aqs101[6-9]*} and A:aqs |
[production] |
14:44 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance es1027 (T335845)', diff saved to https://phabricator.wikimedia.org/P48340 and previous config saved to /var/cache/conftool/dbconfig/20230517-144446-ladsgroup.json |
[production] |
14:44 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance es2028 (T335845)', diff saved to https://phabricator.wikimedia.org/P48339 and previous config saved to /var/cache/conftool/dbconfig/20230517-144425-ladsgroup.json |
[production] |
14:40 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling es2028 (T335845)', diff saved to https://phabricator.wikimedia.org/P48338 and previous config saved to /var/cache/conftool/dbconfig/20230517-144025-ladsgroup.json |
[production] |
14:40 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es2028.codfw.wmnet with reason: Maintenance |
[production] |
14:40 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on es2028.codfw.wmnet with reason: Maintenance |
[production] |
14:39 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling es1027 (T335845)', diff saved to https://phabricator.wikimedia.org/P48337 and previous config saved to /var/cache/conftool/dbconfig/20230517-143949-ladsgroup.json |
[production] |
14:39 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es1027.eqiad.wmnet with reason: Maintenance |
[production] |
14:39 |
<otto@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: wgEventStreams - EventBus: produce to mediawiki.page_change.v1 stream - T336817 (duration: 06m 20s) |
[production] |
14:39 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on es1027.eqiad.wmnet with reason: Maintenance |
[production] |
14:38 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on A:dse-k8s-worker |
[production] |
14:36 |
<moritzm> |
installing jackson-databind security updates |
[production] |
14:34 |
<xcollazo@deploy1002> |
Finished deploy [airflow-dags/platform_eng@ad1cc7c]: deploying hotfix for T336800 (duration: 00m 09s) |
[production] |
14:34 |
<xcollazo@deploy1002> |
Started deploy [airflow-dags/platform_eng@ad1cc7c]: deploying hotfix for T336800 |
[production] |
14:33 |
<ottomata> |
EventBus: produce to mediawiki.page_change.v1 stream - T336817 |
[production] |
14:30 |
<otto@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/eventgate-main: sync |
[production] |
14:30 |
<otto@deploy1002> |
helmfile [eqiad] START helmfile.d/services/eventgate-main: sync |
[production] |
14:28 |
<otto@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/eventgate-main: sync |
[production] |
14:28 |
<otto@deploy1002> |
helmfile [codfw] START helmfile.d/services/eventgate-main: sync |
[production] |
14:27 |
<otto@deploy1002> |
helmfile [staging] DONE helmfile.d/services/eventgate-main: sync |
[production] |
14:27 |
<otto@deploy1002> |
helmfile [staging] START helmfile.d/services/eventgate-main: sync |
[production] |
14:27 |
<ottomata> |
rolling restart of eventgate-main to pick up new mediawiki.page_change.v1 stream config - T336817 |
[production] |
14:17 |
<elukey> |
run authdns-update for new ml-serve/ores discovery endpoints - T336726 |
[production] |
14:15 |
<jmm@cumin2002> |
START - Cookbook sre.aqs.roll-restart-reboot rolling reboot on P{aqs101[6-9]*} and A:aqs |
[production] |