151-200 of 10000 results (93ms)
2025-09-30 ยง
15:04 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on phab1004.eqiad.wmnet with reason: phab deploy [production]
15:04 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on phab2002.codfw.wmnet with reason: phab deploy [production]
15:03 <urbanecm@deploy2002> helmfile [codfw] START helmfile.d/services/mw-experimental: apply [production]
15:03 <dzahn@cumin2002> DONE (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 0:30:00 on phab.wmfusercontent.org with reason: version upgrade [production]
15:02 <brouberol@deploy2002> helmfile [dse-k8s-codfw] DONE helmfile.d/dse-k8s-services/airflow-main: apply [production]
15:02 <brouberol@deploy2002> helmfile [dse-k8s-codfw] START helmfile.d/dse-k8s-services/airflow-main: apply [production]
14:59 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
14:58 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:58 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-main: apply [production]
14:58 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-main: apply [production]
14:57 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:49 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1159', diff saved to https://phabricator.wikimedia.org/P83486 and previous config saved to /var/cache/conftool/dbconfig/20250930-144940-fceratto.json [production]
14:48 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
14:45 <dancy@deploy2002> Installation of scap version "4.213.0" completed for 2 hosts [production]
14:43 <dancy@deploy2002> Installing scap version "4.213.0" for 2 host(s) [production]
14:42 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:41 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:41 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:40 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:40 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:40 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:39 <bking@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:38 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
14:34 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1159', diff saved to https://phabricator.wikimedia.org/P83485 and previous config saved to /var/cache/conftool/dbconfig/20250930-143433-fceratto.json [production]
14:30 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:29 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:29 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:29 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:29 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['wdqs2017.codfw.wmnet'] [production]
14:27 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
14:27 <bking@cumin2002> START - Cookbook sre.hosts.reboot-single for host wdqs2016.codfw.wmnet [production]
14:26 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts wdqs2016.codfw.wmnet [production]
14:25 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
14:19 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1159 (T401906)', diff saved to https://phabricator.wikimedia.org/P83484 and previous config saved to /var/cache/conftool/dbconfig/20250930-141925-fceratto.json [production]
14:18 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1159 (T401906)', diff saved to https://phabricator.wikimedia.org/P83483 and previous config saved to /var/cache/conftool/dbconfig/20250930-141816-fceratto.json [production]
14:18 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1159.eqiad.wmnet with reason: Maintenance [production]
14:15 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
13:55 <Lucas_WMDE> UTC afternoon backport+config window done [production]
13:53 <lucaswerkmeister-wmde@deploy2002> Finished scap sync-world: Backport for [[gerrit:1187779|session: Enable MultiBackendSessionStore on `group1` wikis (T402808)]] (duration: 14m 40s) [production]
13:48 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde, d3r1ck01: Continuing with sync [production]
13:47 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dbprov1007.eqiad.wmnet with OS bookworm [production]
13:45 <lucaswerkmeister-wmde@deploy2002> lucaswerkmeister-wmde, d3r1ck01: Backport for [[gerrit:1187779|session: Enable MultiBackendSessionStore on `group1` wikis (T402808)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:38 <lucaswerkmeister-wmde@deploy2002> Started scap sync-world: Backport for [[gerrit:1187779|session: Enable MultiBackendSessionStore on `group1` wikis (T402808)]] [production]
13:36 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
13:33 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-main: apply [production]
13:33 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-main: apply [production]
13:33 <jforrester@deploy2002> Finished scap sync-world: Backport for [[gerrit:1192303|Wikifunctions clients: Enable rich text (HTML) output in embedded calls (T397402)]] (duration: 12m 15s) [production]
13:28 <jforrester@deploy2002> jforrester: Continuing with sync [production]
13:27 <jforrester@deploy2002> jforrester: Backport for [[gerrit:1192303|Wikifunctions clients: Enable rich text (HTML) output in embedded calls (T397402)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:25 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]