151-200 of 10000 results (24ms)
2026-04-16 §
14:56 <root@cumin2002> DONE (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 1:00:00 on mr1-codfw IPv6,mr1-codfw.oob,mr-codfw with reason: router upgrade [production]
14:56 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
14:56 <jelto@cumin1003> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host gerrit2002.wikimedia.org [production]
14:56 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
14:54 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
14:54 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
14:52 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
14:45 <jelto@cumin1003> START - Cookbook sre.hosts.reboot-single for host gerrit2002.wikimedia.org [production]
14:44 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
14:30 <wmftkbot> Test Kitchen mw-user experiment (poll 119279) - adds: none; removes: growthexperiments-revise-tone; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD [analytics]
14:29 <jforrester@deploy1003> Finished scap sync-world: Backport for [[gerrit:1271895|mc: Use MCROUTER_SERVER values rather than local sidepod for WF cache (T423311)]] (duration: 09m 36s) [production]
14:25 <jforrester@deploy1003> jforrester: Continuing with sync [production]
14:25 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
14:21 <jforrester@deploy1003> jforrester: Backport for [[gerrit:1271895|mc: Use MCROUTER_SERVER values rather than local sidepod for WF cache (T423311)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
14:20 <larssandergreen> civicrm upgraded from 801847a7 to 90c0ccd9 [fundraising]
14:20 <jforrester@deploy1003> Started scap sync-world: Backport for [[gerrit:1271895|mc: Use MCROUTER_SERVER values rather than local sidepod for WF cache (T423311)]] [production]
14:19 <larssandergreen> tools upgraded from 9bff5f07 to f14a814e [fundraising]
14:18 <mlitn@deploy1003> Finished scap sync-world: Backport for [[gerrit:1272712|fix: add missing hook registration for create account stats (T422283)]] (duration: 06m 07s) [production]
14:18 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
14:15 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db1248 (T419635)', diff saved to https://phabricator.wikimedia.org/P90945 and previous config saved to /var/cache/conftool/dbconfig/20260416-141515-fceratto.json [production]
14:15 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1248.eqiad.wmnet with reason: Maintenance [production]
14:14 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1247 (T419635)', diff saved to https://phabricator.wikimedia.org/P90944 and previous config saved to /var/cache/conftool/dbconfig/20260416-141450-fceratto.json [production]
14:14 <mlitn@deploy1003> mlitn, migr: Continuing with sync [production]
14:14 <mlitn@deploy1003> mlitn, migr: Backport for [[gerrit:1272712|fix: add missing hook registration for create account stats (T422283)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
14:12 <mlitn@deploy1003> Started scap sync-world: Backport for [[gerrit:1272712|fix: add missing hook registration for create account stats (T422283)]] [production]
14:07 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host testvm2002.codfw.wmnet with OS trixie [production]
14:05 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
14:04 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1247', diff saved to https://phabricator.wikimedia.org/P90943 and previous config saved to /var/cache/conftool/dbconfig/20260416-140442-fceratto.json [production]
14:04 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
14:01 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
14:01 <mlitn@deploy1003> Finished scap sync-world: Backport for [[gerrit:1140748|siwikitionary: update logo to localised svg version. (T342173)]] (duration: 07m 11s) [production]
14:01 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
14:00 <dpogorzelski@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
13:58 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1245.eqiad.wmnet with reason: Maintenance [production]
13:57 <mlitn@deploy1003> mlitn, robertsky: Continuing with sync [production]
13:56 <mlitn@deploy1003> mlitn, robertsky: Backport for [[gerrit:1140748|siwikitionary: update logo to localised svg version. (T342173)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
13:55 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1210 (T419961)', diff saved to https://phabricator.wikimedia.org/P90942 and previous config saved to /var/cache/conftool/dbconfig/20260416-135549-fceratto.json [production]
13:54 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1247', diff saved to https://phabricator.wikimedia.org/P90941 and previous config saved to /var/cache/conftool/dbconfig/20260416-135434-fceratto.json [production]
13:54 <mlitn@deploy1003> Started scap sync-world: Backport for [[gerrit:1140748|siwikitionary: update logo to localised svg version. (T342173)]] [production]
13:54 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on testvm2002.codfw.wmnet with reason: host reimage [production]
13:51 <mlitn@deploy1003> Finished scap sync-world: Backport for [[gerrit:1272527|Squashed diff to master]] (duration: 30m 21s) [production]
13:51 <jiji@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply [production]
13:49 <jiji@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-debug: apply [production]
13:48 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on testvm2002.codfw.wmnet with reason: host reimage [production]
13:47 <wmftkbot> Test Kitchen mw-user experiment (poll 119153) - adds: ab-test-email-confirmation-banner; removes: none; fields: none - xLab/MPIC/TK tips at https://w.wiki/FwuD [analytics]
13:45 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1210', diff saved to https://phabricator.wikimedia.org/P90940 and previous config saved to /var/cache/conftool/dbconfig/20260416-134541-fceratto.json [production]
13:44 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1247 (T419635)', diff saved to https://phabricator.wikimedia.org/P90939 and previous config saved to /var/cache/conftool/dbconfig/20260416-134426-fceratto.json [production]
13:41 <urandom> decommissioning Cassandra [a,b] on aqs1010 — T412830 [production]
13:39 <mlitn@deploy1003> mlitn: Continuing with sync [production]
13:38 <mlitn@deploy1003> mlitn: Backport for [[gerrit:1272527|Squashed diff to master]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]