101-150 of 10000 results (89ms)
2025-11-24 ยง
18:20 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2190', diff saved to https://phabricator.wikimedia.org/P85541 and previous config saved to /var/cache/conftool/dbconfig/20251124-182011-marostegui.json [production]
18:19 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs1030.eqiad.wmnet with reason: host reimage [production]
18:16 <swfrench-wmf> silenced EtcdReplicationDown. f75c71c9-62d3-449f-860a-9b5e4570717a - T405950 [production]
18:11 <jclark@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs1030.eqiad.wmnet with reason: host reimage [production]
18:09 <jclark@cumin1003> START - Cookbook sre.hosts.reimage for host wdqs1028.eqiad.wmnet with OS bookworm [production]
18:05 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2190 (T410531)', diff saved to https://phabricator.wikimedia.org/P85540 and previous config saved to /var/cache/conftool/dbconfig/20251124-180503-marostegui.json [production]
18:05 <jclark@cumin1003> START - Cookbook sre.hosts.reimage for host wdqs1031.eqiad.wmnet with OS trixie [production]
17:55 <jclark@cumin1003> START - Cookbook sre.hosts.reimage for host wdqs1030.eqiad.wmnet with OS trixie [production]
17:51 <jclark@cumin1003> START - Cookbook sre.hosts.reimage for host wdqs1029.eqiad.wmnet with OS bookworm [production]
17:45 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db2190 (T410531)', diff saved to https://phabricator.wikimedia.org/P85539 and previous config saved to /var/cache/conftool/dbconfig/20251124-174501-marostegui.json [production]
17:44 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2190.codfw.wmnet with reason: Maintenance [production]
17:44 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2177 (T410531)', diff saved to https://phabricator.wikimedia.org/P85538 and previous config saved to /var/cache/conftool/dbconfig/20251124-174437-marostegui.json [production]
17:29 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2177', diff saved to https://phabricator.wikimedia.org/P85537 and previous config saved to /var/cache/conftool/dbconfig/20251124-172929-marostegui.json [production]
17:27 <cgoubert@deploy2002> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
17:26 <cgoubert@deploy2002> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
17:24 <urbanecm@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-experimental: apply [production]
17:23 <urbanecm@deploy2002> helmfile [codfw] START helmfile.d/services/mw-experimental: apply [production]
17:14 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2177', diff saved to https://phabricator.wikimedia.org/P85536 and previous config saved to /var/cache/conftool/dbconfig/20251124-171418-marostegui.json [production]
17:11 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1032.eqiad.wmnet with OS trixie [production]
17:10 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1030.eqiad.wmnet with OS trixie [production]
17:10 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1029.eqiad.wmnet with OS trixie [production]
17:09 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1028.eqiad.wmnet with OS trixie [production]
17:09 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker1019.eqiad.wmnet [production]
17:03 <btullis@cumin1003> START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker1019.eqiad.wmnet [production]
17:02 <btullis@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-worker1013.eqiad.wmnet [production]
17:01 <bking@cumin2002> START - Cookbook sre.hosts.reimage for host wdqs1032.eqiad.wmnet with OS trixie [production]
17:00 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1031.eqiad.wmnet with OS trixie [production]
16:59 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2177 (T410531)', diff saved to https://phabricator.wikimedia.org/P85535 and previous config saved to /var/cache/conftool/dbconfig/20251124-165910-marostegui.json [production]
16:56 <btullis@cumin1003> START - Cookbook sre.hosts.reboot-single for host dse-k8s-worker1013.eqiad.wmnet [production]
16:43 <jdrewniak@deploy2002> Synchronized portals: Wikimedia Portals Update: [[gerrit:1210618| Bumping portals to master (T128546)]] (duration: 01m 59s) [production]
16:41 <jdrewniak@deploy2002> Synchronized portals/wikipedia.org/assets: Wikimedia Portals Update: [[gerrit:1210618| Bumping portals to master (T128546)]] (duration: 08m 44s) [production]
16:36 <btullis@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on an-master1004.eqiad.wmnet,an-redacteddb1001.eqiad.wmnet,an-test-coord1001.eqiad.wmnet with reason: Prepping for switch swap [production]
16:34 <btullis@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on stat1011.eqiad.wmnet with reason: Prepping for switch swap [production]
16:34 <btullis@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on an-test-master1002.eqiad.wmnet with reason: Prepping for switch swap [production]
16:33 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db2177 (T410531)', diff saved to https://phabricator.wikimedia.org/P85534 and previous config saved to /var/cache/conftool/dbconfig/20251124-163345-marostegui.json [production]
16:33 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2177.codfw.wmnet with reason: Maintenance [production]
16:33 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2156 (T410531)', diff saved to https://phabricator.wikimedia.org/P85533 and previous config saved to /var/cache/conftool/dbconfig/20251124-163320-marostegui.json [production]
16:33 <jmm@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host db1169.eqiad.wmnet with OS bookworm [production]
16:32 <btullis@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on dse-k8s-worker[1011,1013,1019].eqiad.wmnet with reason: Prepping for switch swap [production]
16:30 <moritzm> installing usb.ids updates from Bookworm point release [production]
16:28 <mwpresync@deploy2002> Finished scap build-images: Publishing wmf/next image (duration: 13m 52s) [production]
16:23 <urbanecm> Delete job/growthexperiments-refreshlinkrecommendations-s2-29399967 and job/growthexperiments-refreshlinkrecommendations-s3-29399607 (T407818) [production]
16:23 <jmm@dns1004> END - running authdns-update [production]
16:22 <jmm@dns1004> START - running authdns-update [production]
16:18 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2156', diff saved to https://phabricator.wikimedia.org/P85532 and previous config saved to /var/cache/conftool/dbconfig/20251124-161813-marostegui.json [production]
16:16 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/analytics-test: apply [production]
16:15 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/analytics-test: apply [production]
16:14 <mwpresync@deploy2002> Started scap build-images: Publishing wmf/next image [production]
16:14 <hnowlan@deploy2002> helmfile [staging] DONE helmfile.d/services/thumbor: apply [production]
16:14 <hnowlan@deploy2002> helmfile [staging] START helmfile.d/services/thumbor: apply [production]