151-200 of 10000 results (16ms)
2025-12-04 ยง
17:06 <brett@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs1019.eqiad.wmnet with reason: move primary uplink from move primary uplink from asw2-c7-eqiad to lsw1-c7-eqiad and remove link to asw2-d2-eqiad - T405628 [production]
15:55 <hashar@deploy2002> Finished deploy [gerrit/gerrit@121bd1c]: Remove duplicate [DISMISS] button (duration: 00m 11s) [production]
15:55 <hashar@deploy2002> Started deploy [gerrit/gerrit@121bd1c]: Remove duplicate [DISMISS] button [production]
15:51 <dpogorzelski@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on ml-lab1001.eqiad.wmnet with reason: decomission [production]
15:50 <dpogorzelski@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on ml-lab1001.eqiad.wmnet with reason: decomission [production]
15:50 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host conf2005.codfw.wmnet [production]
15:48 <andrew@cloudcumin1001> END (FAIL) - Cookbook wmcs.toolforge.k8s.etcd.depool_and_remove_node (exit_code=99) [toolsbeta]
15:45 <bking@cumin2002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host dse-k8s-worker2003.codfw.wmnet [production]
15:45 <bking@cumin2002> START - Cookbook sre.k8s.pool-depool-node pool for host dse-k8s-worker2003.codfw.wmnet [production]
15:44 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host conf2005.codfw.wmnet [production]
15:43 <hashar@deploy2002> Finished deploy [gerrit/gerrit@774e2ff]: Ease configuration of the motd banner && Add banner for the 2025 developer survey (duration: 00m 15s) [production]
15:43 <hashar@deploy2002> Started deploy [gerrit/gerrit@774e2ff]: Ease configuration of the motd banner && Add banner for the 2025 developer survey [production]
15:42 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.etcd.depool_and_remove_node (T361237) [toolsbeta]
15:41 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host conf2004.codfw.wmnet [production]
15:41 <andrewbogott> deleting toolsbeta-test-k8s-etcd-27 and replacing with a Bullseye node for cluster consistency T361237 [toolsbeta]
15:38 <bking@deploy2002> helmfile [dse-k8s-codfw] DONE helmfile.d/dse-k8s-services/opensearch-ipoid-test: apply [production]
15:38 <bking@deploy2002> helmfile [dse-k8s-codfw] START helmfile.d/dse-k8s-services/opensearch-ipoid-test: apply [production]
15:36 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host conf2004.codfw.wmnet [production]
15:35 <bking@cumin2002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host dse-k8s-worker2003.codfw.wmnet [production]
15:33 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host conf1009.eqiad.wmnet [production]
15:30 <bking@cumin2002> START - Cookbook sre.k8s.pool-depool-node depool for host dse-k8s-worker2003.codfw.wmnet [production]
15:28 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host conf1009.eqiad.wmnet [production]
15:26 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host conf1008.eqiad.wmnet [production]
15:20 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host conf1008.eqiad.wmnet [production]
15:15 <jmm@cumin2002> END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host conf1007.eqiad.wmnet [production]
15:09 <jmm@cumin2002> START - Cookbook sre.puppet.migrate-host for host conf1007.eqiad.wmnet [production]
15:08 <cgoubert@deploy2002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
15:06 <bking@deploy2002> helmfile [dse-k8s-codfw] DONE helmfile.d/dse-k8s-services/opensearch-ipoid-test: apply [production]
15:06 <bking@deploy2002> helmfile [dse-k8s-codfw] START helmfile.d/dse-k8s-services/opensearch-ipoid-test: apply [production]
15:06 <cgoubert@deploy2002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
15:05 <cgoubert@deploy2002> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
15:03 <Lucas_WMDE> UTC afternoon backport+config window done [production]
15:03 <cgoubert@deploy2002> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
15:03 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
15:02 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
15:02 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
15:02 <ladsgroup@deploy2002> Finished scap sync-world: Backport for [[gerrit:1215164|RevisionStore: Catch ParameterAssertionException too (T351953)]] (duration: 09m 26s) [production]
15:01 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
14:59 <cgoubert@deploy2002> helmfile [aux-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
14:59 <cgoubert@deploy2002> helmfile [aux-k8s-codfw] START helmfile.d/admin 'apply'. [production]
14:59 <cgoubert@deploy2002> helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
14:58 <cgoubert@deploy2002> helmfile [aux-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
14:55 <ladsgroup@deploy2002> jforrester, ladsgroup: Continuing with sync [production]
14:54 <ladsgroup@deploy2002> jforrester, ladsgroup: Backport for [[gerrit:1215164|RevisionStore: Catch ParameterAssertionException too (T351953)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
14:52 <ladsgroup@deploy2002> Started scap sync-world: Backport for [[gerrit:1215164|RevisionStore: Catch ParameterAssertionException too (T351953)]] [production]
14:50 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/mw-cron: apply [production]
14:49 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/mw-cron: apply [production]
14:37 <derick@deploy2002> Finished scap sync-world: Backport for [[gerrit:1215165|Revert "User: Log where the data was loaded when CAS update failed" (T410652)]], [[gerrit:1215166|Revert "User: Log where the data was loaded when CAS update failed" (T410652)]], [[gerrit:1215167|Fetch user object from primary DB (for writes) not replica DB (T410652)]] (duration: 13m 24s) [production]
14:27 <derick@deploy2002> d3r1ck01, derick: Continuing with sync [production]
14:26 <derick@deploy2002> d3r1ck01, derick: Backport for [[gerrit:1215165|Revert "User: Log where the data was loaded when CAS update failed" (T410652)]], [[gerrit:1215166|Revert "User: Log where the data was loaded when CAS update failed" (T410652)]], [[gerrit:1215167|Fetch user object from primary DB (for writes) not replica DB (T410652)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes [production]