2024-10-17
ยง
|
14:28 |
<jayme@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on kubestagemaster2005.codfw.wmnet with reason: host reimage |
[production] |
14:26 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2155 (T367781)', diff saved to https://phabricator.wikimedia.org/P70265 and previous config saved to /var/cache/conftool/dbconfig/20241017-142643-arnaudb.json |
[production] |
14:09 |
<jayme@cumin1002> |
START - Cookbook sre.hosts.reimage for host kubestagemaster2005.codfw.wmnet with OS bookworm |
[production] |
14:08 |
<urbanecm@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1080770|Bump wikimedia/parsoid to 0.20.0-a26 (T377287)]], [[gerrit:1080773|Bump wikimedia/parsoid to 0.20.0-a26 (T377287)]] (duration: 09m 41s) |
[production] |
14:03 |
<urbanecm@deploy2002> |
cscott, urbanecm: Continuing with sync |
[production] |
14:00 |
<urbanecm@deploy2002> |
cscott, urbanecm: Backport for [[gerrit:1080770|Bump wikimedia/parsoid to 0.20.0-a26 (T377287)]], [[gerrit:1080773|Bump wikimedia/parsoid to 0.20.0-a26 (T377287)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
14:00 |
<bking@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-analytics-test: apply |
[production] |
13:59 |
<bking@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-analytics-test: apply |
[production] |
13:58 |
<urbanecm@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1080770|Bump wikimedia/parsoid to 0.20.0-a26 (T377287)]], [[gerrit:1080773|Bump wikimedia/parsoid to 0.20.0-a26 (T377287)]] |
[production] |
13:56 |
<bking@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
13:54 |
<bking@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
13:47 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db2172 (T376905)', diff saved to https://phabricator.wikimedia.org/P70264 and previous config saved to /var/cache/conftool/dbconfig/20241017-134651-ladsgroup.json |
[production] |
13:47 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2172.codfw.wmnet with reason: Maintenance |
[production] |
13:46 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2172.codfw.wmnet with reason: Maintenance |
[production] |
13:46 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2155 (T376905)', diff saved to https://phabricator.wikimedia.org/P70263 and previous config saved to /var/cache/conftool/dbconfig/20241017-134636-ladsgroup.json |
[production] |
13:35 |
<urbanecm@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1080805|Set $wgAllowRawHtmlCopyrightMessages = false (T375789)]], [[gerrit:1080828|tests: ensure maintenance base class has always been requierd (T377391 T357535)]] (duration: 08m 07s) |
[production] |
13:31 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2155', diff saved to https://phabricator.wikimedia.org/P70261 and previous config saved to /var/cache/conftool/dbconfig/20241017-133129-ladsgroup.json |
[production] |
13:30 |
<urbanecm@deploy2002> |
cscott, urbanecm, matmarex: Continuing with sync |
[production] |
13:29 |
<urbanecm@deploy2002> |
cscott, urbanecm, matmarex: Backport for [[gerrit:1080805|Set $wgAllowRawHtmlCopyrightMessages = false (T375789)]], [[gerrit:1080828|tests: ensure maintenance base class has always been requierd (T377391 T357535)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
13:29 |
<urbanecm> |
[urbanecm@mwmaint2002 ~]$ mwscript updateCollation.php --wiki=cswikivoyage --previous-collation=uppercase # T377446 |
[production] |
13:27 |
<wmbot~melos@tools-bastion-13> |
Restarted StewardBot/SULWatcher because of a connection loss |
[tools.stewardbots] |
13:27 |
<urbanecm@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1080805|Set $wgAllowRawHtmlCopyrightMessages = false (T375789)]], [[gerrit:1080828|tests: ensure maintenance base class has always been requierd (T377391 T357535)]] |
[production] |
13:26 |
<arnaudb@cumin1002> |
dbctl commit (dc=all): 'Depooling db2155 (T367781)', diff saved to https://phabricator.wikimedia.org/P70260 and previous config saved to /var/cache/conftool/dbconfig/20241017-132617-arnaudb.json |
[production] |
13:26 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2187.codfw.wmnet with reason: Maintenance |
[production] |
13:26 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db2187.codfw.wmnet with reason: Maintenance |
[production] |
13:26 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2155.codfw.wmnet with reason: Maintenance |
[production] |
13:26 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db2155.codfw.wmnet with reason: Maintenance |
[production] |
13:24 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2204.codfw.wmnet with reason: Maintenance |
[production] |
13:24 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db2204.codfw.wmnet with reason: Maintenance |
[production] |
13:23 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1222.eqiad.wmnet with reason: Maintenance |
[production] |
13:23 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db1222.eqiad.wmnet with reason: Maintenance |
[production] |
13:22 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
13:22 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
13:18 |
<inflatador> |
bking@wdqs1015 depooling to catch up on lag |
[production] |
13:16 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2155', diff saved to https://phabricator.wikimedia.org/P70258 and previous config saved to /var/cache/conftool/dbconfig/20241017-131622-ladsgroup.json |
[production] |
13:14 |
<urbanecm@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1081134|cswikivoyage: Set category collation to uca-cs-u-kn (T377446)]], [[gerrit:1081124|QuickSurveys: Update safety survey coverage (T376517)]] (duration: 07m 23s) |
[production] |
13:10 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1166 (T376905)', diff saved to https://phabricator.wikimedia.org/P70257 and previous config saved to /var/cache/conftool/dbconfig/20241017-131012-ladsgroup.json |
[production] |
13:10 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1166.eqiad.wmnet with reason: Maintenance |
[production] |
13:10 |
<urbanecm@deploy2002> |
kharlan, urbanecm: Continuing with sync |
[production] |
13:09 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1166.eqiad.wmnet with reason: Maintenance |
[production] |
13:09 |
<urbanecm@deploy2002> |
kharlan, urbanecm: Backport for [[gerrit:1081134|cswikivoyage: Set category collation to uca-cs-u-kn (T377446)]], [[gerrit:1081124|QuickSurveys: Update safety survey coverage (T376517)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
13:09 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1157 (T376905)', diff saved to https://phabricator.wikimedia.org/P70256 and previous config saved to /var/cache/conftool/dbconfig/20241017-130947-ladsgroup.json |
[production] |
13:09 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
13:07 |
<urbanecm@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1081134|cswikivoyage: Set category collation to uca-cs-u-kn (T377446)]], [[gerrit:1081124|QuickSurveys: Update safety survey coverage (T376517)]] |
[production] |
13:01 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db2155 (T376905)', diff saved to https://phabricator.wikimedia.org/P70255 and previous config saved to /var/cache/conftool/dbconfig/20241017-130115-ladsgroup.json |
[production] |
13:00 |
<jayme@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubestagemaster2005.codfw.wmnet with OS bookworm |
[production] |
12:59 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
12:58 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2209.codfw.wmnet with reason: Maintenance |
[production] |
12:58 |
<arnaudb@cumin1002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on db2209.codfw.wmnet with reason: Maintenance |
[production] |
12:54 |
<arnaudb@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1189.eqiad.wmnet with reason: Maintenance |
[production] |