101-150 of 10000 results (127ms)
2026-02-18 ยง
15:08 <jforrester@deploy2002> helmfile [staging] START helmfile.d/services/wikifunctions: apply [production]
15:02 <jmm@deploy2002> helmfile [staging] DONE helmfile.d/services/proton: apply [production]
14:59 <jmm@deploy2002> helmfile [staging] START helmfile.d/services/proton: apply [production]
14:54 <vgutierrez> uplodaded tcp-mss-clamper 0.6+deb13u1 to trixie-wikimedia (apt-wm.o) - T401832 [production]
14:48 <vgutierrez> upload golang-gitlab-wikimedia-sre-qemutest-dev 0.1.0+deb13u1 to trixie-wikimedia (apt.wm.o) - T401832 [production]
14:39 <vgutierrez> upload golang-github-u-root-u-root 0.12.0-1 to trixie-wikimedia (apt.wm.o) - T401832 [production]
14:15 <mszwarc@deploy2002> Finished scap sync-world: Backport for [[gerrit:1240277|ruwikisource: EnableProtectionIndicators (T417590)]], [[gerrit:1240270|Add '(oathauth-recover-for-user)' to 'wmf-supportsafety' (T415883)]] (duration: 08m 10s) [production]
14:11 <mszwarc@deploy2002> anzx, mszwarc: Continuing with sync [production]
14:09 <mszwarc@deploy2002> anzx, mszwarc: Backport for [[gerrit:1240277|ruwikisource: EnableProtectionIndicators (T417590)]], [[gerrit:1240270|Add '(oathauth-recover-for-user)' to 'wmf-supportsafety' (T415883)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
14:07 <mszwarc@deploy2002> Started scap sync-world: Backport for [[gerrit:1240277|ruwikisource: EnableProtectionIndicators (T417590)]], [[gerrit:1240270|Add '(oathauth-recover-for-user)' to 'wmf-supportsafety' (T415883)]] [production]
13:24 <arnaudb@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on gerrit1003.wikimedia.org with reason: T417246 [production]
12:52 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1028.eqiad.wmnet with OS bookworm [production]
12:45 <jayme@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host kubestage1003.eqiad.wmnet [production]
12:45 <jayme@cumin1003> START - Cookbook sre.k8s.pool-depool-node pool for host kubestage1003.eqiad.wmnet [production]
12:43 <jayme@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubestage1003.eqiad.wmnet with OS trixie [production]
12:24 <jayme@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubestage1003.eqiad.wmnet with reason: host reimage [production]
12:16 <jayme@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on kubestage1003.eqiad.wmnet with reason: host reimage [production]
12:12 <logmsgbot> dreamyjazz Deployed security patch for T411366 [production]
12:06 <logmsgbot> dreamyjazz Deployed security patch for T411366 [production]
12:01 <jayme@cumin1003> START - Cookbook sre.hosts.reimage for host kubestage1003.eqiad.wmnet with OS trixie [production]
11:59 <jayme@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host kubestage1003.eqiad.wmnet [production]
11:57 <vgutierrez> upload golang-github-intel-go-cpuid 0.0~git20210602.5747e5c-2+deb13u1 to trixie-wikimedia (apt.wm.o) - T401832 [production]
11:54 <jayme@cumin1003> START - Cookbook sre.k8s.pool-depool-node depool for host kubestage1003.eqiad.wmnet [production]
11:49 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
11:48 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
11:19 <arnaudb@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on gerrit1003.wikimedia.org with reason: host reimage [production]
11:14 <arnaudb@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on gerrit1003.wikimedia.org with reason: host reimage [production]
11:12 <kevinbazira@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
11:09 <kevinbazira@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
11:07 <fabfur@cumin1003> END (PASS) - Cookbook sre.deploy.hiddenparma (exit_code=0) Hiddenparma deployment to the alerting hosts with reason: "New scope bots - fabfur@cumin1003" [production]
11:07 <fabfur@cumin1003> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) hiddenparma to alert[1002,2002].wikimedia.org with reason: New scope bots - fabfur@cumin1003 [production]
11:06 <fabfur@cumin1003> START - Cookbook sre.deploy.python-code hiddenparma to alert[1002,2002].wikimedia.org with reason: New scope bots - fabfur@cumin1003 [production]
11:06 <fabfur@cumin1003> START - Cookbook sre.deploy.hiddenparma Hiddenparma deployment to the alerting hosts with reason: "New scope bots - fabfur@cumin1003" [production]
10:56 <arnaudb@cumin1003> START - Cookbook sre.hosts.reimage for host gerrit1003.wikimedia.org with OS bookworm [production]
10:51 <joal@deploy2002> Finished deploy [analytics/refinery@28fa1ea] (thin): Regular analytics weekly train THIN [analytics/refinery@28fa1eac] (duration: 01m 56s) [production]
10:49 <joal@deploy2002> Started deploy [analytics/refinery@28fa1ea] (thin): Regular analytics weekly train THIN [analytics/refinery@28fa1eac] [production]
10:49 <joal@deploy2002> Finished deploy [analytics/refinery@28fa1ea]: Regular analytics weekly train [analytics/refinery@28fa1eac] (duration: 04m 06s) [production]
10:44 <joal@deploy2002> Started deploy [analytics/refinery@28fa1ea]: Regular analytics weekly train [analytics/refinery@28fa1eac] [production]
10:44 <joal@deploy2002> Finished deploy [analytics/refinery@28fa1ea] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@28fa1eac] (duration: 01m 57s) [production]
10:42 <joal@deploy2002> Started deploy [analytics/refinery@28fa1ea] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@28fa1eac] [production]
10:41 <arnaudb@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host gerrit1003.wikimedia.org with OS bookworm [production]
10:26 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cumin2003.codfw.wmnet [production]
10:20 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host cumin2003.codfw.wmnet [production]
09:53 <arnaudb@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on gerrit1003.wikimedia.org with reason: host reimage [production]
09:46 <arnaudb@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on gerrit1003.wikimedia.org with reason: host reimage [production]
09:28 <arnaudb@cumin1003> START - Cookbook sre.hosts.reimage for host gerrit1003.wikimedia.org with OS bookworm [production]
08:58 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dbproxy1029.eqiad.wmnet with OS trixie [production]
08:38 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dbproxy1029.eqiad.wmnet with reason: host reimage [production]
08:32 <marostegui@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on dbproxy1029.eqiad.wmnet with reason: host reimage [production]
08:19 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host dbproxy1029.eqiad.wmnet with OS trixie [production]