2023-10-31
§
|
10:50 |
<fnegri@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcontrol1007.eqiad.wmnet with reason: host reimage |
[production] |
10:48 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db1230 (re)pooling @ 20%: db1230 host warmup', diff saved to https://phabricator.wikimedia.org/P53094 and previous config saved to /var/cache/conftool/dbconfig/20231031-104839-arnaudb.json |
[production] |
10:38 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db1227 (re)pooling @ 10%: dh1227 host warmup', diff saved to https://phabricator.wikimedia.org/P53093 and previous config saved to /var/cache/conftool/dbconfig/20231031-103804-arnaudb.json |
[production] |
10:37 |
<fnegri@cumin1001> |
START - Cookbook sre.hosts.reimage for host cloudcontrol1007.eqiad.wmnet with OS bookworm |
[production] |
10:33 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db1230 (re)pooling @ 10%: db1230 host warmup', diff saved to https://phabricator.wikimedia.org/P53092 and previous config saved to /var/cache/conftool/dbconfig/20231031-103334-arnaudb.json |
[production] |
10:22 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db1227 (re)pooling @ 5%: dh1227 host warmup', diff saved to https://phabricator.wikimedia.org/P53091 and previous config saved to /var/cache/conftool/dbconfig/20231031-102259-arnaudb.json |
[production] |
10:18 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db1230 (re)pooling @ 5%: db1230 host warmup', diff saved to https://phabricator.wikimedia.org/P53090 and previous config saved to /var/cache/conftool/dbconfig/20231031-101829-arnaudb.json |
[production] |
10:17 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'set db1230 as a depooled host', diff saved to https://phabricator.wikimedia.org/P53089 and previous config saved to /var/cache/conftool/dbconfig/20231031-101750-arnaudb.json |
[production] |
09:50 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Depooling db2179 (T343198)', diff saved to https://phabricator.wikimedia.org/P53088 and previous config saved to /var/cache/conftool/dbconfig/20231031-095054-arnaudb.json |
[production] |
09:50 |
<arnaudb@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2179.codfw.wmnet with reason: Maintenance |
[production] |
09:50 |
<arnaudb@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2179.codfw.wmnet with reason: Maintenance |
[production] |
09:47 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'set db1230 as a depooled host', diff saved to https://phabricator.wikimedia.org/P53087 and previous config saved to /var/cache/conftool/dbconfig/20231031-094737-arnaudb.json |
[production] |
09:39 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'set db1230 as a depooled host', diff saved to https://phabricator.wikimedia.org/P53086 and previous config saved to /var/cache/conftool/dbconfig/20231031-093919-arnaudb.json |
[production] |
09:34 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db2179 (re)pooling @ 100%: Host warmup', diff saved to https://phabricator.wikimedia.org/P53085 and previous config saved to /var/cache/conftool/dbconfig/20231031-093457-arnaudb.json |
[production] |
09:34 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Set ', diff saved to https://phabricator.wikimedia.org/P53084 and previous config saved to /var/cache/conftool/dbconfig/20231031-093448-arnaudb.json |
[production] |
09:01 |
<elukey@deploy2002> |
helmfile [staging] DONE helmfile.d/services/changeprop: sync |
[production] |
09:00 |
<elukey@deploy2002> |
helmfile [staging] START helmfile.d/services/changeprop: sync |
[production] |
08:57 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db1230 (re)pooling @ 5%: db1230 host warmup', diff saved to https://phabricator.wikimedia.org/P53083 and previous config saved to /var/cache/conftool/dbconfig/20231031-085740-arnaudb.json |
[production] |
08:56 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db1230 config append', diff saved to https://phabricator.wikimedia.org/P53082 and previous config saved to /var/cache/conftool/dbconfig/20231031-085615-arnaudb.json |
[production] |
08:53 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db2179 (re)pooling @ 90%: Host warmup', diff saved to https://phabricator.wikimedia.org/P53081 and previous config saved to /var/cache/conftool/dbconfig/20231031-085346-arnaudb.json |
[production] |
08:38 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db2179 (re)pooling @ 75%: Host warmup', diff saved to https://phabricator.wikimedia.org/P53080 and previous config saved to /var/cache/conftool/dbconfig/20231031-083841-arnaudb.json |
[production] |
08:23 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db2179 (re)pooling @ 60%: Host warmup', diff saved to https://phabricator.wikimedia.org/P53079 and previous config saved to /var/cache/conftool/dbconfig/20231031-082336-arnaudb.json |
[production] |
08:08 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db2179 (re)pooling @ 45%: Host warmup', diff saved to https://phabricator.wikimedia.org/P53078 and previous config saved to /var/cache/conftool/dbconfig/20231031-080832-arnaudb.json |
[production] |
07:53 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db2179 (re)pooling @ 30%: Host warmup', diff saved to https://phabricator.wikimedia.org/P53077 and previous config saved to /var/cache/conftool/dbconfig/20231031-075327-arnaudb.json |
[production] |
07:38 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db2179 (re)pooling @ 15%: Host warmup', diff saved to https://phabricator.wikimedia.org/P53076 and previous config saved to /var/cache/conftool/dbconfig/20231031-073822-arnaudb.json |
[production] |
07:36 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db2179 weight rebalancing - depooled', diff saved to https://phabricator.wikimedia.org/P53075 and previous config saved to /var/cache/conftool/dbconfig/20231031-073652-arnaudb.json |
[production] |
07:33 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db2179 weight rebalancing', diff saved to https://phabricator.wikimedia.org/P53074 and previous config saved to /var/cache/conftool/dbconfig/20231031-073312-arnaudb.json |
[production] |
07:30 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db2179 depooling from API and pooling in db2140', diff saved to https://phabricator.wikimedia.org/P53073 and previous config saved to /var/cache/conftool/dbconfig/20231031-073023-arnaudb.json |
[production] |
07:19 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'db2179 weight mimic old db2140', diff saved to https://phabricator.wikimedia.org/P53072 and previous config saved to /var/cache/conftool/dbconfig/20231031-071938-arnaudb.json |
[production] |
07:05 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Promote db2140 to s4 primary and set section read-write T349820', diff saved to https://phabricator.wikimedia.org/P53071 and previous config saved to /var/cache/conftool/dbconfig/20231031-070549-arnaudb.json |
[production] |
07:04 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Set s4 codfw as read-only for maintenance - T349820', diff saved to https://phabricator.wikimedia.org/P53070 and previous config saved to /var/cache/conftool/dbconfig/20231031-070405-arnaudb.json |
[production] |
07:02 |
<arnaudb> |
Starting s4 codfw failover from db2179 to db2140 - T349820 |
[production] |
06:49 |
<marostegui@deploy2002> |
Finished scap: Backport for [[gerrit:969772|Revert "ProductionServices.php: Promote pc2014 to pc1 master"]] (duration: 07m 12s) |
[production] |
06:44 |
<marostegui@deploy2002> |
marostegui: Continuing with sync |
[production] |
06:43 |
<marostegui@deploy2002> |
marostegui: Backport for [[gerrit:969772|Revert "ProductionServices.php: Promote pc2014 to pc1 master"]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
06:42 |
<marostegui@deploy2002> |
Started scap: Backport for [[gerrit:969772|Revert "ProductionServices.php: Promote pc2014 to pc1 master"]] |
[production] |
06:36 |
<arnaudb@cumin1001> |
dbctl commit (dc=all): 'Set db2140 with weight 0 T349820', diff saved to https://phabricator.wikimedia.org/P53068 and previous config saved to /var/cache/conftool/dbconfig/20231031-063647-arnaudb.json |
[production] |
06:33 |
<arnaudb@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on 34 hosts with reason: Primary switchover s4 T349820 |
[production] |
06:33 |
<arnaudb@cumin1001> |
START - Cookbook sre.hosts.downtime for 2:00:00 on 34 hosts with reason: Primary switchover s4 T349820 |
[production] |
06:31 |
<marostegui@deploy2002> |
Finished scap: Backport for [[gerrit:970033|ProductionServices.php: Promote pc2014 to pc1 master]] (duration: 06m 50s) |
[production] |
06:26 |
<marostegui@deploy2002> |
marostegui: Continuing with sync |
[production] |
06:25 |
<marostegui@deploy2002> |
marostegui: Backport for [[gerrit:970033|ProductionServices.php: Promote pc2014 to pc1 master]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
06:24 |
<marostegui@deploy2002> |
Started scap: Backport for [[gerrit:970033|ProductionServices.php: Promote pc2014 to pc1 master]] |
[production] |
03:55 |
<mwpresync@deploy2002> |
Pruned MediaWiki: 1.42.0-wmf.1 (duration: 02m 14s) |
[production] |
03:53 |
<mwpresync@deploy2002> |
Finished scap: testwikis wikis to 1.42.0-wmf.3 refs T348356 (duration: 50m 44s) |
[production] |
03:02 |
<mwpresync@deploy2002> |
Started scap: testwikis wikis to 1.42.0-wmf.3 refs T348356 |
[production] |
00:46 |
<sukhe@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp1103.eqiad.wmnet with OS bullseye |
[production] |
00:29 |
<sukhe@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp1103.eqiad.wmnet with OS bullseye |
[production] |
00:19 |
<sukhe@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp1103.eqiad.wmnet with OS bullseye |
[production] |