601-650 of 10000 results (20ms)
2026-06-03 ยง
08:52 <cwilliams@cumin1003> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet [production]
08:52 <cwilliams@cumin1003> START - Cookbook sre.hosts.remove-downtime for clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet [production]
08:51 <cwilliams@cumin1003> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for clouddb[1022-1023].eqiad.wmnet [production]
08:51 <cwilliams@cumin1003> START - Cookbook sre.hosts.remove-downtime for clouddb[1022-1023].eqiad.wmnet [production]
08:50 <kharlan@deploy1003> kharlan: Rolling back deployment [production]
08:48 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db1166 (T426633)', diff saved to https://phabricator.wikimedia.org/P93652 and previous config saved to /var/cache/conftool/dbconfig/20260603-084846-fceratto.json [production]
08:48 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1166.eqiad.wmnet with reason: Maintenance [production]
08:48 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1157 (T426633)', diff saved to https://phabricator.wikimedia.org/P93651 and previous config saved to /var/cache/conftool/dbconfig/20260603-084819-fceratto.json [production]
08:47 <kharlan@deploy1003> kharlan: Backport for [[gerrit:1296635|Revert^2 "hCaptcha: Load self-hosted secure-api.js on group0 wikis" (T403829)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
08:45 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2215.codfw.wmnet with OS trixie [production]
08:45 <jiji@cumin1003> END (PASS) - Cookbook sre.discovery.service-route (exit_code=0) check docker-registry: maintenance [production]
08:45 <jiji@cumin1003> START - Cookbook sre.discovery.service-route check docker-registry: maintenance [production]
08:43 <cwilliams@cumin1003> START - Cookbook sre.mysql.pool pool db1211: Migration of db1211.eqiad.wmnet completed [production]
08:41 <kharlan@deploy1003> Started scap sync-world: Backport for [[gerrit:1296635|Revert^2 "hCaptcha: Load self-hosted secure-api.js on group0 wikis" (T403829)]] [production]
08:41 <cwilliams@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1211.eqiad.wmnet with OS trixie [production]
08:38 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1157', diff saved to https://phabricator.wikimedia.org/P93649 and previous config saved to /var/cache/conftool/dbconfig/20260603-083811-fceratto.json [production]
08:37 <mszwarc@deploy1003> Finished scap sync-world: Backport for [[gerrit:1296632|Image Browsing: add accessible labels to carousel elements (T407793)]] (duration: 32m 11s) [production]
08:36 <marostegui@cumin1003> START - Cookbook sre.mysql.pool pool es2054: repool after upgrade [production]
08:35 <marostegui@cumin1003> END (FAIL) - Cookbook sre.mysql.pool (exit_code=99) pool es2054.codfw.wmnet: After reimage [production]
08:35 <marostegui@cumin1003> START - Cookbook sre.mysql.pool pool es2054.codfw.wmnet: After reimage [production]
08:35 <jiji@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
08:34 <marostegui@cumin1003> END (FAIL) - Cookbook sre.mysql.major-upgrade (exit_code=99) [production]
08:34 <jiji@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. [production]
08:33 <jiji@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
08:33 <jiji@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
08:31 <jiji@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
08:31 <jiji@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
08:31 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host es2054.codfw.wmnet with OS trixie [production]
08:30 <jiji@deploy1003> helmfile [aux-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
08:29 <jiji@deploy1003> helmfile [aux-k8s-codfw] START helmfile.d/admin 'apply'. [production]
08:28 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2215.codfw.wmnet with reason: host reimage [production]
08:28 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1157', diff saved to https://phabricator.wikimedia.org/P93647 and previous config saved to /var/cache/conftool/dbconfig/20260603-082804-fceratto.json [production]
08:26 <hashar> Reloaded Zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/1296559 "inference-services: Add LLM generated editing suggestions CI/CD pipelines." # T427794 [releng]
08:25 <mszwarc@deploy1003> mlitn, mszwarc: Continuing with deployment [production]
08:24 <cwilliams@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1211.eqiad.wmnet with reason: host reimage [production]
08:23 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool es1049: repool after upgrade [production]
08:22 <marostegui@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on db2215.codfw.wmnet with reason: host reimage [production]
08:22 <mszwarc@deploy1003> mlitn, mszwarc: Backport for [[gerrit:1296632|Image Browsing: add accessible labels to carousel elements (T407793)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
08:18 <cwilliams@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on db1211.eqiad.wmnet with reason: host reimage [production]
08:18 <jiji@deploy1003> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
08:17 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1157 (T426633)', diff saved to https://phabricator.wikimedia.org/P93645 and previous config saved to /var/cache/conftool/dbconfig/20260603-081756-fceratto.json [production]
08:17 <jiji@deploy1003> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
08:17 <jiji@deploy1003> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
08:16 <jiji@deploy1003> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
08:14 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es2054.codfw.wmnet with reason: host reimage [production]
08:08 <marostegui@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on es2054.codfw.wmnet with reason: host reimage [production]
08:05 <mszwarc@deploy1003> Started scap sync-world: Backport for [[gerrit:1296632|Image Browsing: add accessible labels to carousel elements (T407793)]] [production]
08:03 <mszwarc@deploy1003> Finished scap sync-world: Backport for [[gerrit:1296580|Add kha to wmgExtraLanguageNames (T427917)]], [[gerrit:1296703|jawiki: lift IP caps for workshop (T427912)]], [[gerrit:1296713|conductwiki: add sitename and logo (T426984 T427541)]], [[gerrit:1296627|Add missing lazy img to carousel (T427821)]], [[gerrit:1295968|MultimediaViewer: enable image carousel as a beta feature on Wikipedias (T426799)] [production]
08:03 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db1157 (T426633)', diff saved to https://phabricator.wikimedia.org/P93643 and previous config saved to /var/cache/conftool/dbconfig/20260603-080346-fceratto.json [production]
08:03 <cwilliams@cumin1003> START - Cookbook sre.hosts.reimage for host db1211.eqiad.wmnet with OS trixie [production]