151-200 of 10000 results (99ms)
2026-02-18 ยง
14:09 <mszwarc@deploy2002> anzx, mszwarc: Backport for [[gerrit:1240277|ruwikisource: EnableProtectionIndicators (T417590)]], [[gerrit:1240270|Add '(oathauth-recover-for-user)' to 'wmf-supportsafety' (T415883)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
14:07 <mszwarc@deploy2002> Started scap sync-world: Backport for [[gerrit:1240277|ruwikisource: EnableProtectionIndicators (T417590)]], [[gerrit:1240270|Add '(oathauth-recover-for-user)' to 'wmf-supportsafety' (T415883)]] [production]
13:24 <arnaudb@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on gerrit1003.wikimedia.org with reason: T417246 [production]
12:52 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1028.eqiad.wmnet with OS bookworm [production]
12:45 <jayme@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host kubestage1003.eqiad.wmnet [production]
12:45 <jayme@cumin1003> START - Cookbook sre.k8s.pool-depool-node pool for host kubestage1003.eqiad.wmnet [production]
12:43 <jayme@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubestage1003.eqiad.wmnet with OS trixie [production]
12:24 <jayme@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubestage1003.eqiad.wmnet with reason: host reimage [production]
12:16 <jayme@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on kubestage1003.eqiad.wmnet with reason: host reimage [production]
12:12 <logmsgbot> dreamyjazz Deployed security patch for T411366 [production]
12:06 <logmsgbot> dreamyjazz Deployed security patch for T411366 [production]
12:01 <jayme@cumin1003> START - Cookbook sre.hosts.reimage for host kubestage1003.eqiad.wmnet with OS trixie [production]
11:59 <jayme@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host kubestage1003.eqiad.wmnet [production]
11:57 <vgutierrez> upload golang-github-intel-go-cpuid 0.0~git20210602.5747e5c-2+deb13u1 to trixie-wikimedia (apt.wm.o) - T401832 [production]
11:54 <jayme@cumin1003> START - Cookbook sre.k8s.pool-depool-node depool for host kubestage1003.eqiad.wmnet [production]
11:49 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
11:48 <brouberol@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
11:19 <arnaudb@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on gerrit1003.wikimedia.org with reason: host reimage [production]
11:14 <arnaudb@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on gerrit1003.wikimedia.org with reason: host reimage [production]
11:12 <kevinbazira@deploy2002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
11:09 <kevinbazira@deploy2002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
11:07 <fabfur@cumin1003> END (PASS) - Cookbook sre.deploy.hiddenparma (exit_code=0) Hiddenparma deployment to the alerting hosts with reason: "New scope bots - fabfur@cumin1003" [production]
11:07 <fabfur@cumin1003> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) hiddenparma to alert[1002,2002].wikimedia.org with reason: New scope bots - fabfur@cumin1003 [production]
11:06 <fabfur@cumin1003> START - Cookbook sre.deploy.python-code hiddenparma to alert[1002,2002].wikimedia.org with reason: New scope bots - fabfur@cumin1003 [production]
11:06 <fabfur@cumin1003> START - Cookbook sre.deploy.hiddenparma Hiddenparma deployment to the alerting hosts with reason: "New scope bots - fabfur@cumin1003" [production]
10:56 <arnaudb@cumin1003> START - Cookbook sre.hosts.reimage for host gerrit1003.wikimedia.org with OS bookworm [production]
10:51 <joal@deploy2002> Finished deploy [analytics/refinery@28fa1ea] (thin): Regular analytics weekly train THIN [analytics/refinery@28fa1eac] (duration: 01m 56s) [production]
10:49 <joal@deploy2002> Started deploy [analytics/refinery@28fa1ea] (thin): Regular analytics weekly train THIN [analytics/refinery@28fa1eac] [production]
10:49 <joal@deploy2002> Finished deploy [analytics/refinery@28fa1ea]: Regular analytics weekly train [analytics/refinery@28fa1eac] (duration: 04m 06s) [production]
10:44 <joal@deploy2002> Started deploy [analytics/refinery@28fa1ea]: Regular analytics weekly train [analytics/refinery@28fa1eac] [production]
10:44 <joal@deploy2002> Finished deploy [analytics/refinery@28fa1ea] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@28fa1eac] (duration: 01m 57s) [production]
10:42 <joal@deploy2002> Started deploy [analytics/refinery@28fa1ea] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@28fa1eac] [production]
10:41 <arnaudb@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host gerrit1003.wikimedia.org with OS bookworm [production]
10:26 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cumin2003.codfw.wmnet [production]
10:20 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host cumin2003.codfw.wmnet [production]
09:53 <arnaudb@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on gerrit1003.wikimedia.org with reason: host reimage [production]
09:46 <arnaudb@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on gerrit1003.wikimedia.org with reason: host reimage [production]
09:28 <arnaudb@cumin1003> START - Cookbook sre.hosts.reimage for host gerrit1003.wikimedia.org with OS bookworm [production]
08:58 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dbproxy1029.eqiad.wmnet with OS trixie [production]
08:38 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dbproxy1029.eqiad.wmnet with reason: host reimage [production]
08:32 <marostegui@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on dbproxy1029.eqiad.wmnet with reason: host reimage [production]
08:19 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host dbproxy1029.eqiad.wmnet with OS trixie [production]
05:32 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db2179 (T415786)', diff saved to https://phabricator.wikimedia.org/P88861 and previous config saved to /var/cache/conftool/dbconfig/20260218-053229-marostegui.json [production]
05:32 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2179.codfw.wmnet with reason: Maintenance [production]
05:32 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2172 (T415786)', diff saved to https://phabricator.wikimedia.org/P88860 and previous config saved to /var/cache/conftool/dbconfig/20260218-053204-marostegui.json [production]
05:28 <kart_> Updated cxserver to 2026-01-20-115813-production (T415038, T415046, T414558) [production]
05:25 <kartik@deploy2002> helmfile [eqiad] DONE helmfile.d/services/cxserver: apply [production]
05:25 <kartik@deploy2002> helmfile [eqiad] START helmfile.d/services/cxserver: apply [production]
05:24 <kartik@deploy2002> helmfile [codfw] DONE helmfile.d/services/cxserver: apply [production]
05:24 <kartik@deploy2002> helmfile [codfw] START helmfile.d/services/cxserver: apply [production]