2025-03-24
ยง
|
12:22 |
<ladsgroup@deploy1003> |
Synchronized portals/wikipedia.org/assets: Minor wikimedia.org mobile fixes (T373204) (duration: 11m 37s) |
[production] |
12:19 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.upgrade for db2205.codfw.wmnet |
[production] |
12:12 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depool db2205 T389377', diff saved to https://phabricator.wikimedia.org/P74335 and previous config saved to /var/cache/conftool/dbconfig/20250324-121227-fceratto.json |
[production] |
12:10 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2214 (re)pooling @ 50%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P74334 and previous config saved to /var/cache/conftool/dbconfig/20250324-121030-root.json |
[production] |
12:09 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Promote db2209 to s3 primary T389377', diff saved to https://phabricator.wikimedia.org/P74333 and previous config saved to /var/cache/conftool/dbconfig/20250324-120947-fceratto.json |
[production] |
12:09 |
<elukey> |
revert rate-limit replicas from 6 -> 3 on Wikikube eqiad |
[production] |
12:08 |
<federico3> |
Starting s3 codfw failover from db2205 to db2209 - T389377 |
[production] |
12:03 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host maps-test2002.codfw.wmnet with OS bookworm |
[production] |
12:02 |
<ladsgroup@deploy1003> |
Finished scap sync-world: Backport for [[gerrit:1130558|Enable dataRedundancy for mainstash (T383327)]] (duration: 15m 43s) |
[production] |
11:58 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2169 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P74332 and previous config saved to /var/cache/conftool/dbconfig/20250324-115843-root.json |
[production] |
11:56 |
<elukey> |
temporarily bump rate-limit replicas from 3 -> 6 on Wikikube eqiad |
[production] |
11:55 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2214 (re)pooling @ 25%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P74331 and previous config saved to /var/cache/conftool/dbconfig/20250324-115524-root.json |
[production] |
11:55 |
<ladsgroup@deploy1003> |
ladsgroup: Continuing with sync |
[production] |
11:54 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Set db2209 with weight 0 T389377', diff saved to https://phabricator.wikimedia.org/P74330 and previous config saved to /var/cache/conftool/dbconfig/20250324-115457-fceratto.json |
[production] |
11:54 |
<fceratto@cumin1002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 24 hosts with reason: Primary switchover s3 T389377 |
[production] |
11:51 |
<ladsgroup@deploy1003> |
ladsgroup: Backport for [[gerrit:1130558|Enable dataRedundancy for mainstash (T383327)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
11:46 |
<ladsgroup@deploy1003> |
Started scap sync-world: Backport for [[gerrit:1130558|Enable dataRedundancy for mainstash (T383327)]] |
[production] |
11:46 |
<fceratto@cumin1002> |
END (PASS) - Cookbook sre.mysql.depool (exit_code=0) db2212 - Depool db2188.codfw.wmnet to then clone it to db2212.codfw.wmnet - fceratto@cumin1002 |
[production] |
11:46 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.depool db2212 - Depool db2188.codfw.wmnet to then clone it to db2212.codfw.wmnet - fceratto@cumin1002 |
[production] |
11:46 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.clone of db2188.codfw.wmnet onto db2212.codfw.wmnet |
[production] |
11:45 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on maps-test2002.codfw.wmnet with reason: host reimage |
[production] |
11:44 |
<moritzm> |
installing subversion security updates |
[production] |
11:43 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2169 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P74328 and previous config saved to /var/cache/conftool/dbconfig/20250324-114338-root.json |
[production] |
11:43 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on maps-test2002.codfw.wmnet with reason: host reimage |
[production] |
11:42 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host maps-test2006.codfw.wmnet |
[production] |
11:40 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2214 (re)pooling @ 10%: Repooling after rebuild index', diff saved to https://phabricator.wikimedia.org/P74327 and previous config saved to /var/cache/conftool/dbconfig/20250324-114019-root.json |
[production] |
11:38 |
<btullis@cumin1002> |
END (PASS) - Cookbook sre.opensearch.roll-restart-reboot (exit_code=0) rolling restart_daemons on A:datahubsearch |
[production] |
11:34 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host maps-test2006.codfw.wmnet |
[production] |
11:31 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host maps-test2005.codfw.wmnet |
[production] |
11:31 |
<fceratto@cumin1002> |
END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db2212.codfw.wmnet |
[production] |
11:28 |
<btullis@cumin1002> |
START - Cookbook sre.opensearch.roll-restart-reboot rolling restart_daemons on A:datahubsearch |
[production] |
11:28 |
<moritzm> |
installing busybox security updates |
[production] |
11:28 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2169 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P74326 and previous config saved to /var/cache/conftool/dbconfig/20250324-112833-root.json |
[production] |
11:26 |
<fceratto@cumin1002> |
START - Cookbook sre.mysql.upgrade for db2212.codfw.wmnet |
[production] |
11:24 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reimage for host maps-test2002.codfw.wmnet with OS bookworm |
[production] |
11:23 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host maps-test2005.codfw.wmnet |
[production] |
11:22 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host maps-test2001.codfw.wmnet with OS bookworm |
[production] |
11:19 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ml-serve2005.codfw.wmnet with OS bookworm |
[production] |
11:13 |
<marostegui@cumin1002> |
dbctl commit (dc=all): 'db2169 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P74325 and previous config saved to /var/cache/conftool/dbconfig/20250324-111327-root.json |
[production] |
11:11 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Depool db2212 T389373', diff saved to https://phabricator.wikimedia.org/P74324 and previous config saved to /var/cache/conftool/dbconfig/20250324-111157-fceratto.json |
[production] |
11:11 |
<btullis@cumin1002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-mariadb1001.eqiad.wmnet |
[production] |
11:10 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host maps-test2004.codfw.wmnet |
[production] |
11:06 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on maps-test2001.codfw.wmnet with reason: host reimage |
[production] |
11:04 |
<btullis> |
rebooting an-mariadb1001 for T376800 |
[analytics] |
11:04 |
<btullis@cumin1002> |
START - Cookbook sre.hosts.reboot-single for host an-mariadb1001.eqiad.wmnet |
[production] |
11:03 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host maps-test2004.codfw.wmnet |
[production] |
11:03 |
<fceratto@cumin1002> |
dbctl commit (dc=all): 'Promote db2203 to s1 primary T389373', diff saved to https://phabricator.wikimedia.org/P74323 and previous config saved to /var/cache/conftool/dbconfig/20250324-110321-fceratto.json |
[production] |
11:03 |
<elukey@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-serve2005.codfw.wmnet with reason: host reimage |
[production] |
11:01 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on maps-test2001.codfw.wmnet with reason: host reimage |
[production] |
11:01 |
<aborrero@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.tofu (exit_code=0) running tofu plan for main branch |
[admin] |