2024-05-20
ยง
|
22:02 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2198.codfw.wmnet with reason: Maintenance |
[production] |
22:02 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2182 (T352010)', diff saved to https://phabricator.wikimedia.org/P62722 and previous config saved to /var/cache/conftool/dbconfig/20240520-220247-ladsgroup.json |
[production] |
22:00 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:989216|Add account_conversion event streams. (T363815)]] |
[production] |
21:47 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2182', diff saved to https://phabricator.wikimedia.org/P62721 and previous config saved to /var/cache/conftool/dbconfig/20240520-214739-ladsgroup.json |
[production] |
21:38 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-flink-eqiad cluster: Roll restart of jvm daemons. |
[production] |
21:32 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2182', diff saved to https://phabricator.wikimedia.org/P62720 and previous config saved to /var/cache/conftool/dbconfig/20240520-213230-ladsgroup.json |
[production] |
21:32 |
<bking@cumin2002> |
START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-flink-eqiad cluster: Roll restart of jvm daemons. |
[production] |
21:29 |
<bking@cumin2002> |
END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-flink-codfw cluster: Roll restart of jvm daemons. |
[production] |
21:22 |
<bking@cumin2002> |
START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-flink-codfw cluster: Roll restart of jvm daemons. |
[production] |
21:17 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2182 (T352010)', diff saved to https://phabricator.wikimedia.org/P62719 and previous config saved to /var/cache/conftool/dbconfig/20240520-211721-ladsgroup.json |
[production] |
20:57 |
<ebernhardson@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
20:57 |
<ebernhardson@deploy1002> |
helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
20:51 |
<ebernhardson@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
20:51 |
<ebernhardson@deploy1002> |
helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
20:44 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:980063|Remove readability survey tool (T349337)]], [[gerrit:1034149|wgVectorShareUserScripts should be false now (T301212)]] (duration: 18m 34s) |
[production] |
20:38 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1185 (T352010)', diff saved to https://phabricator.wikimedia.org/P62718 and previous config saved to /var/cache/conftool/dbconfig/20240520-203811-ladsgroup.json |
[production] |
20:38 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1185.eqiad.wmnet with reason: Maintenance |
[production] |
20:37 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1185.eqiad.wmnet with reason: Maintenance |
[production] |
20:37 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1183 (T352010)', diff saved to https://phabricator.wikimedia.org/P62717 and previous config saved to /var/cache/conftool/dbconfig/20240520-203748-ladsgroup.json |
[production] |
20:30 |
<urbanecm@deploy1002> |
ksarabia and jdlrobson and urbanecm: Continuing with sync |
[production] |
20:28 |
<urbanecm@deploy1002> |
ksarabia and jdlrobson and urbanecm: Backport for [[gerrit:980063|Remove readability survey tool (T349337)]], [[gerrit:1034149|wgVectorShareUserScripts should be false now (T301212)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
20:25 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:980063|Remove readability survey tool (T349337)]], [[gerrit:1034149|wgVectorShareUserScripts should be false now (T301212)]] |
[production] |
20:25 |
<urbanecm@deploy1002> |
Finished scap: Backport for [[gerrit:1024813|Introduce sample overrides to web_ui_actions (T361962)]], [[gerrit:1031610|Disable wgParserEnableLegacyMediaDOM (T363597)]], [[gerrit:1031458|Disable last remaining projects using share user scripts (T301212)]] (duration: 18m 18s) |
[production] |
20:24 |
<eileen> |
config revision changed from 21dba21a to 22106526 |
[production] |
20:22 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1183', diff saved to https://phabricator.wikimedia.org/P62716 and previous config saved to /var/cache/conftool/dbconfig/20240520-202240-ladsgroup.json |
[production] |
20:11 |
<urbanecm@deploy1002> |
urbanecm and jdlrobson and ksarabia: Continuing with sync |
[production] |
20:09 |
<urbanecm@deploy1002> |
urbanecm and jdlrobson and ksarabia: Backport for [[gerrit:1024813|Introduce sample overrides to web_ui_actions (T361962)]], [[gerrit:1031610|Disable wgParserEnableLegacyMediaDOM (T363597)]], [[gerrit:1031458|Disable last remaining projects using share user scripts (T301212)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
20:07 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1183', diff saved to https://phabricator.wikimedia.org/P62715 and previous config saved to /var/cache/conftool/dbconfig/20240520-200732-ladsgroup.json |
[production] |
20:06 |
<urbanecm@deploy1002> |
Started scap: Backport for [[gerrit:1024813|Introduce sample overrides to web_ui_actions (T361962)]], [[gerrit:1031610|Disable wgParserEnableLegacyMediaDOM (T363597)]], [[gerrit:1031458|Disable last remaining projects using share user scripts (T301212)]] |
[production] |
19:52 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1183 (T352010)', diff saved to https://phabricator.wikimedia.org/P62714 and previous config saved to /var/cache/conftool/dbconfig/20240520-195224-ladsgroup.json |
[production] |
19:46 |
<swfrench@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/cxserver: apply |
[production] |
19:45 |
<swfrench@deploy1002> |
helmfile [eqiad] START helmfile.d/services/cxserver: apply |
[production] |
19:33 |
<swfrench@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/cxserver: apply |
[production] |
19:32 |
<swfrench@deploy1002> |
helmfile [codfw] START helmfile.d/services/cxserver: apply |
[production] |
19:31 |
<swfrench@deploy1002> |
helmfile [staging] DONE helmfile.d/services/cxserver: apply |
[production] |
19:31 |
<swfrench@deploy1002> |
helmfile [staging] START helmfile.d/services/cxserver: apply |
[production] |
19:23 |
<logmsgbot> |
@deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
19:23 |
<logmsgbot> |
@deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
19:20 |
<akosiaris@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kafka-main1010.eqiad.wmnet with OS bullseye |
[production] |
19:09 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2161 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P62713 and previous config saved to /var/cache/conftool/dbconfig/20240520-190908-root.json |
[production] |
19:02 |
<ebernhardson@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
19:02 |
<ebernhardson@deploy1002> |
helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
18:54 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2161 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P62712 and previous config saved to /var/cache/conftool/dbconfig/20240520-185402-root.json |
[production] |
18:43 |
<logmsgbot> |
@deploy1002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
18:43 |
<logmsgbot> |
@deploy1002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply |
[production] |
18:38 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2161 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P62711 and previous config saved to /var/cache/conftool/dbconfig/20240520-183856-root.json |
[production] |
18:29 |
<swfrench@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/citoid: apply |
[production] |
18:28 |
<swfrench@deploy1002> |
helmfile [eqiad] START helmfile.d/services/citoid: apply |
[production] |
18:23 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2161 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P62710 and previous config saved to /var/cache/conftool/dbconfig/20240520-182350-root.json |
[production] |
18:16 |
<swfrench@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/citoid: apply |
[production] |