2025-10-14
§
|
05:16 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1221 (re)pooling @ 50%: 10', diff saved to https://phabricator.wikimedia.org/P83828 and previous config saved to /var/cache/conftool/dbconfig/20251014-051619-root.json |
[production] |
05:14 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on es[1031-1032].eqiad.wmnet with reason: Cloning |
[production] |
05:01 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'db1221 (re)pooling @ 25%: 10', diff saved to https://phabricator.wikimedia.org/P83826 and previous config saved to /var/cache/conftool/dbconfig/20251014-050113-root.json |
[production] |
04:53 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depool db1221 for migration to mariadb 10.11', diff saved to https://phabricator.wikimedia.org/P83824 and previous config saved to /var/cache/conftool/dbconfig/20251014-045305-marostegui.json |
[production] |
04:53 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1221.eqiad.wmnet with reason: Maintenance |
[production] |
04:52 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on 14 hosts with reason: Upgrading |
[production] |
04:41 |
<marostegui@cumin1003> |
START - Cookbook sre.mysql.pool es1033 gradually with 4 steps - Pool es1033.eqiad.wmnet in after cloning |
[production] |
04:02 |
<mwpresync@deploy2002> |
Pruned MediaWiki: 1.45.0-wmf.20 (duration: 02m 42s) |
[production] |
03:48 |
<mwpresync@deploy2002> |
Finished scap sync-world: testwikis to 1.45.0-wmf.23 refs T405679 (duration: 45m 02s) |
[production] |
03:03 |
<mwpresync@deploy2002> |
Started scap sync-world: testwikis to 1.45.0-wmf.23 refs T405679 |
[production] |
02:24 |
<denisse@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host webperf2003.codfw.wmnet |
[production] |
02:20 |
<denisse@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host webperf2003.codfw.wmnet |
[production] |
02:09 |
<denisse@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host webperf1003.eqiad.wmnet |
[production] |
02:05 |
<denisse@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host webperf1003.eqiad.wmnet |
[production] |
01:58 |
<denisse@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mwlog2002.codfw.wmnet |
[production] |
01:52 |
<denisse@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host mwlog2002.codfw.wmnet |
[production] |
01:45 |
<denisse@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mwlog1002.eqiad.wmnet |
[production] |
01:39 |
<denisse@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host mwlog1002.eqiad.wmnet |
[production] |
01:14 |
<mwpresync@deploy2002> |
Finished scap build-images: Publishing wmf/next image (duration: 13m 20s) |
[production] |
01:00 |
<mwpresync@deploy2002> |
Started scap build-images: Publishing wmf/next image |
[production] |
2025-10-13
§
|
23:50 |
<musikanimal@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1195756|Add 'accepted' status (T406674)]] (duration: 40m 01s) |
[production] |
23:38 |
<musikanimal@deploy2002> |
musikanimal: Continuing with sync |
[production] |
23:36 |
<musikanimal@deploy2002> |
musikanimal: Backport for [[gerrit:1195756|Add 'accepted' status (T406674)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
23:29 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.presto.reboot-workers (exit_code=0) for Presto an-presto cluster: Reboot Presto nodes |
[production] |
23:10 |
<musikanimal@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1195756|Add 'accepted' status (T406674)]] |
[production] |
22:34 |
<denisse@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host kafkamon2003.codfw.wmnet |
[production] |
22:30 |
<denisse@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host kafkamon2003.codfw.wmnet |
[production] |
22:01 |
<btullis@cumin1003> |
START - Cookbook sre.presto.reboot-workers for Presto an-presto cluster: Reboot Presto nodes |
[production] |
22:01 |
<btullis@deploy2002> |
helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
22:01 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.druid.reboot-workers (exit_code=0) for Druid analytics cluster: Reboot Druid nodes |
[production] |
22:00 |
<btullis@deploy2002> |
helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. |
[production] |
21:52 |
<denisse@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host kafkamon1003.eqiad.wmnet |
[production] |
21:48 |
<denisse@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host kafkamon1003.eqiad.wmnet |
[production] |
21:05 |
<btullis@deploy2002> |
helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. |
[production] |
21:05 |
<denisse@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host graphite2004.codfw.wmnet |
[production] |
21:03 |
<btullis@deploy2002> |
helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. |
[production] |
20:57 |
<denisse@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host graphite2004.codfw.wmnet |
[production] |
20:56 |
<denisse@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host graphite1005.eqiad.wmnet |
[production] |
20:52 |
<denisse@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host graphite1005.eqiad.wmnet |
[production] |
20:52 |
<btullis@cumin1003> |
START - Cookbook sre.druid.reboot-workers for Druid analytics cluster: Reboot Druid nodes |
[production] |
20:45 |
<denisse@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host arclamp2001.codfw.wmnet |
[production] |
20:39 |
<denisse@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host arclamp2001.codfw.wmnet |
[production] |
20:34 |
<eileen> |
civicrm upgraded from 385f00d8 to 9393addf |
[production] |
20:25 |
<denisse@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host arclamp1001.eqiad.wmnet |
[production] |
20:22 |
<dani@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1191688|Undeploy Design Research participant recruitment survey on jawiki (T405577)]] (duration: 09m 01s) |
[production] |
20:19 |
<denisse@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host arclamp1001.eqiad.wmnet |
[production] |
20:18 |
<dani@deploy2002> |
dani: Continuing with sync |
[production] |
20:17 |
<dani@deploy2002> |
dani: Backport for [[gerrit:1191688|Undeploy Design Research participant recruitment survey on jawiki (T405577)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
20:13 |
<dani@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1191688|Undeploy Design Research participant recruitment survey on jawiki (T405577)]] |
[production] |
19:44 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.mysql.clone_es (exit_code=0) of es1027.eqiad.wmnet onto es1050.eqiad.wmnet |
[production] |