151-200 of 10000 results (140ms)
2026-06-03 ยง
09:22 <marostegui@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
09:21 <kharlan@deploy1003> kharlan: Backport for [[gerrit:1297069|hCaptcha: Collect risk score for blocked account creations (T427784)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
09:21 <jiji@deploy1003> helmfile [aux-k8s-codfw] DONE helmfile.d/aux-k8s-services/redioscope: apply [production]
09:21 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool es2054: repool after upgrade [production]
09:21 <jiji@deploy1003> helmfile [aux-k8s-codfw] START helmfile.d/aux-k8s-services/redioscope: apply [production]
09:21 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1166', diff saved to https://phabricator.wikimedia.org/P93661 and previous config saved to /var/cache/conftool/dbconfig/20260603-092111-fceratto.json [production]
09:20 <ayounsi@cumin1003> START - Cookbook sre.dns.netbox [production]
09:19 <kharlan@deploy1003> Started scap sync-world: Backport for [[gerrit:1297069|hCaptcha: Collect risk score for blocked account creations (T427784)]] [production]
09:14 <kharlan@deploy1003> Finished scap sync-world: Backport for [[gerrit:1297065|Revert^4 "hCaptcha: Load self-hosted secure-api.js on group0 wikis"]] (duration: 07m 06s) [production]
09:11 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1166', diff saved to https://phabricator.wikimedia.org/P93659 and previous config saved to /var/cache/conftool/dbconfig/20260603-091104-fceratto.json [production]
09:10 <kharlan@deploy1003> kharlan: Continuing with deployment [production]
09:09 <kharlan@deploy1003> kharlan: Backport for [[gerrit:1297065|Revert^4 "hCaptcha: Load self-hosted secure-api.js on group0 wikis"]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
09:07 <kharlan@deploy1003> Started scap sync-world: Backport for [[gerrit:1297065|Revert^4 "hCaptcha: Load self-hosted secure-api.js on group0 wikis"]] [production]
09:06 <jiji@deploy1003> helmfile [codfw] DONE helmfile.d/services/ratelimit: apply [production]
09:05 <kharlan@deploy1003> Finished scap sync-world: Backport for [[gerrit:1297064|Revert^3 "hCaptcha: Load self-hosted secure-api.js on group0 wikis" (T403829)]] (duration: 10m 54s) [production]
09:05 <jiji@deploy1003> helmfile [codfw] START helmfile.d/services/ratelimit: apply [production]
09:04 <bwojtowicz@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'articletopic-outlink' for release 'main' . [production]
09:01 <ayounsi@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "new eqiad/codfw public vlans - ayounsi@cumin1003 - T422043" [production]
09:00 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1166 (T426633)', diff saved to https://phabricator.wikimedia.org/P93656 and previous config saved to /var/cache/conftool/dbconfig/20260603-090056-fceratto.json [production]
09:00 <ayounsi@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "new eqiad/codfw public vlans - ayounsi@cumin1003 - T422043" [production]
09:00 <ayounsi@cumin1003> END (ERROR) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=97) generate netbox hiera data: "new eqiad/codfw public vlans - ayounsi@cumin1003" [production]
09:00 <ayounsi@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "new eqiad/codfw public vlans - ayounsi@cumin1003" [production]
08:59 <kharlan@deploy1003> kharlan: Continuing with deployment [production]
08:59 <kharlan@deploy1003> kharlan: Backport for [[gerrit:1297064|Revert^3 "hCaptcha: Load self-hosted secure-api.js on group0 wikis" (T403829)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
08:55 <kharlan@deploy1003> Started scap sync-world: Backport for [[gerrit:1297064|Revert^3 "hCaptcha: Load self-hosted secure-api.js on group0 wikis" (T403829)]] [production]
08:53 <kharlan@deploy1003> Finished scap sync-world: Backport for [[gerrit:1296635|Revert^2 "hCaptcha: Load self-hosted secure-api.js on group0 wikis" (T403829)]] (duration: 11m 43s) [production]
08:52 <marostegui@cumin1003> START - Cookbook sre.mysql.pool pool db2215: Migration of db2215.codfw.wmnet completed [production]
08:52 <cwilliams@cumin1003> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet [production]
08:52 <cwilliams@cumin1003> START - Cookbook sre.hosts.remove-downtime for clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet [production]
08:51 <cwilliams@cumin1003> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for clouddb[1022-1023].eqiad.wmnet [production]
08:51 <cwilliams@cumin1003> START - Cookbook sre.hosts.remove-downtime for clouddb[1022-1023].eqiad.wmnet [production]
08:50 <kharlan@deploy1003> kharlan: Rolling back deployment [production]
08:48 <fceratto@cumin1003> dbctl commit (dc=all): 'Depooling db1166 (T426633)', diff saved to https://phabricator.wikimedia.org/P93652 and previous config saved to /var/cache/conftool/dbconfig/20260603-084846-fceratto.json [production]
08:48 <fceratto@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1166.eqiad.wmnet with reason: Maintenance [production]
08:48 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1157 (T426633)', diff saved to https://phabricator.wikimedia.org/P93651 and previous config saved to /var/cache/conftool/dbconfig/20260603-084819-fceratto.json [production]
08:47 <kharlan@deploy1003> kharlan: Backport for [[gerrit:1296635|Revert^2 "hCaptcha: Load self-hosted secure-api.js on group0 wikis" (T403829)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
08:45 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2215.codfw.wmnet with OS trixie [production]
08:45 <jiji@cumin1003> END (PASS) - Cookbook sre.discovery.service-route (exit_code=0) check docker-registry: maintenance [production]
08:45 <jiji@cumin1003> START - Cookbook sre.discovery.service-route check docker-registry: maintenance [production]
08:43 <cwilliams@cumin1003> START - Cookbook sre.mysql.pool pool db1211: Migration of db1211.eqiad.wmnet completed [production]
08:41 <kharlan@deploy1003> Started scap sync-world: Backport for [[gerrit:1296635|Revert^2 "hCaptcha: Load self-hosted secure-api.js on group0 wikis" (T403829)]] [production]
08:41 <cwilliams@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1211.eqiad.wmnet with OS trixie [production]
08:38 <fceratto@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1157', diff saved to https://phabricator.wikimedia.org/P93649 and previous config saved to /var/cache/conftool/dbconfig/20260603-083811-fceratto.json [production]
08:37 <mszwarc@deploy1003> Finished scap sync-world: Backport for [[gerrit:1296632|Image Browsing: add accessible labels to carousel elements (T407793)]] (duration: 32m 11s) [production]
08:36 <marostegui@cumin1003> START - Cookbook sre.mysql.pool pool es2054: repool after upgrade [production]
08:35 <marostegui@cumin1003> END (FAIL) - Cookbook sre.mysql.pool (exit_code=99) pool es2054.codfw.wmnet: After reimage [production]
08:35 <marostegui@cumin1003> START - Cookbook sre.mysql.pool pool es2054.codfw.wmnet: After reimage [production]
08:35 <jiji@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
08:34 <marostegui@cumin1003> END (FAIL) - Cookbook sre.mysql.major-upgrade (exit_code=99) [production]
08:34 <jiji@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. [production]