|
2025-11-24
ยง
|
| 18:20 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2190', diff saved to https://phabricator.wikimedia.org/P85541 and previous config saved to /var/cache/conftool/dbconfig/20251124-182011-marostegui.json |
[production] |
| 18:19 |
<jclark@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs1030.eqiad.wmnet with reason: host reimage |
[production] |
| 18:16 |
<swfrench-wmf> |
silenced EtcdReplicationDown. f75c71c9-62d3-449f-860a-9b5e4570717a - T405950 |
[production] |
| 18:11 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs1030.eqiad.wmnet with reason: host reimage |
[production] |
| 18:09 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.reimage for host wdqs1028.eqiad.wmnet with OS bookworm |
[production] |
| 18:05 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2190 (T410531)', diff saved to https://phabricator.wikimedia.org/P85540 and previous config saved to /var/cache/conftool/dbconfig/20251124-180503-marostegui.json |
[production] |
| 18:05 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.reimage for host wdqs1031.eqiad.wmnet with OS trixie |
[production] |
| 17:55 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.reimage for host wdqs1030.eqiad.wmnet with OS trixie |
[production] |
| 17:51 |
<jclark@cumin1003> |
START - Cookbook sre.hosts.reimage for host wdqs1029.eqiad.wmnet with OS bookworm |
[production] |
| 17:45 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db2190 (T410531)', diff saved to https://phabricator.wikimedia.org/P85539 and previous config saved to /var/cache/conftool/dbconfig/20251124-174501-marostegui.json |
[production] |
| 17:44 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2190.codfw.wmnet with reason: Maintenance |
[production] |
| 17:44 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2177 (T410531)', diff saved to https://phabricator.wikimedia.org/P85538 and previous config saved to /var/cache/conftool/dbconfig/20251124-174437-marostegui.json |
[production] |
| 17:29 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2177', diff saved to https://phabricator.wikimedia.org/P85537 and previous config saved to /var/cache/conftool/dbconfig/20251124-172929-marostegui.json |
[production] |
| 17:27 |
<cgoubert@deploy2002> |
helmfile [staging] DONE helmfile.d/services/rest-gateway: apply |
[production] |
| 17:26 |
<cgoubert@deploy2002> |
helmfile [staging] START helmfile.d/services/rest-gateway: apply |
[production] |
| 17:24 |
<urbanecm@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/mw-experimental: apply |
[production] |
| 17:23 |
<urbanecm@deploy2002> |
helmfile [codfw] START helmfile.d/services/mw-experimental: apply |
[production] |
| 17:14 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2177', diff saved to https://phabricator.wikimedia.org/P85536 and previous config saved to /var/cache/conftool/dbconfig/20251124-171418-marostegui.json |
[production] |
| 17:11 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1032.eqiad.wmnet with OS trixie |
[production] |
| 17:10 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1030.eqiad.wmnet with OS trixie |
[production] |
| 17:10 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1029.eqiad.wmnet with OS trixie |
[production] |
| 17:09 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1028.eqiad.wmnet with OS trixie |
[production] |
| 17:09 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker1019.eqiad.wmnet |
[production] |
| 17:03 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker1019.eqiad.wmnet |
[production] |
| 17:02 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker1013.eqiad.wmnet |
[production] |
| 17:01 |
<bking@cumin2002> |
START - Cookbook sre.hosts.reimage for host wdqs1032.eqiad.wmnet with OS trixie |
[production] |
| 17:00 |
<bking@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1031.eqiad.wmnet with OS trixie |
[production] |
| 16:59 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2177 (T410531)', diff saved to https://phabricator.wikimedia.org/P85535 and previous config saved to /var/cache/conftool/dbconfig/20251124-165910-marostegui.json |
[production] |
| 16:56 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker1013.eqiad.wmnet |
[production] |
| 16:43 |
<jdrewniak@deploy2002> |
Synchronized portals: Wikimedia Portals Update: [[gerrit:1210618| Bumping portals to master (T128546)]] (duration: 01m 59s) |
[production] |
| 16:41 |
<jdrewniak@deploy2002> |
Synchronized portals/wikipedia.org/assets: Wikimedia Portals Update: [[gerrit:1210618| Bumping portals to master (T128546)]] (duration: 08m 44s) |
[production] |
| 16:36 |
<btullis@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on an-master1004.eqiad.wmnet,an-redacteddb1001.eqiad.wmnet,an-test-coord1001.eqiad.wmnet with reason: Prepping for switch swap |
[production] |
| 16:34 |
<btullis@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on stat1011.eqiad.wmnet with reason: Prepping for switch swap |
[production] |
| 16:34 |
<btullis@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on an-test-master1002.eqiad.wmnet with reason: Prepping for switch swap |
[production] |
| 16:33 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Depooling db2177 (T410531)', diff saved to https://phabricator.wikimedia.org/P85534 and previous config saved to /var/cache/conftool/dbconfig/20251124-163345-marostegui.json |
[production] |
| 16:33 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2177.codfw.wmnet with reason: Maintenance |
[production] |
| 16:33 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2156 (T410531)', diff saved to https://phabricator.wikimedia.org/P85533 and previous config saved to /var/cache/conftool/dbconfig/20251124-163320-marostegui.json |
[production] |
| 16:33 |
<jmm@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host db1169.eqiad.wmnet with OS bookworm |
[production] |
| 16:32 |
<btullis@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on dse-k8s-worker[1011,1013,1019].eqiad.wmnet with reason: Prepping for switch swap |
[production] |
| 16:30 |
<moritzm> |
installing usb.ids updates from Bookworm point release |
[production] |
| 16:28 |
<mwpresync@deploy2002> |
Finished scap build-images: Publishing wmf/next image (duration: 13m 52s) |
[production] |
| 16:23 |
<urbanecm> |
Delete job/growthexperiments-refreshlinkrecommendations-s2-29399967 and job/growthexperiments-refreshlinkrecommendations-s3-29399607 (T407818) |
[production] |
| 16:23 |
<jmm@dns1004> |
END - running authdns-update |
[production] |
| 16:22 |
<jmm@dns1004> |
START - running authdns-update |
[production] |
| 16:18 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2156', diff saved to https://phabricator.wikimedia.org/P85532 and previous config saved to /var/cache/conftool/dbconfig/20251124-161813-marostegui.json |
[production] |
| 16:16 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/analytics-test: apply |
[production] |
| 16:15 |
<btullis@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/analytics-test: apply |
[production] |
| 16:14 |
<mwpresync@deploy2002> |
Started scap build-images: Publishing wmf/next image |
[production] |
| 16:14 |
<hnowlan@deploy2002> |
helmfile [staging] DONE helmfile.d/services/thumbor: apply |
[production] |
| 16:14 |
<hnowlan@deploy2002> |
helmfile [staging] START helmfile.d/services/thumbor: apply |
[production] |