101-150 of 10000 results (143ms)
2026-06-04 ยง
09:54 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on es2057.codfw.wmnet with reason: host reimage [production]
09:53 <ozge@deploy1003> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
09:49 <marostegui@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on es2057.codfw.wmnet with reason: host reimage [production]
09:39 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.major-upgrade (exit_code=0) [production]
09:39 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db1224: Migration of db1224.eqiad.wmnet completed [production]
09:38 <brouberol@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/dse-k8s-services/kafka-ui: apply [production]
09:37 <brouberol@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/dse-k8s-services/kafka-ui: apply [production]
09:36 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/kafka-ui: apply [production]
09:35 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/kafka-ui: apply [production]
09:33 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host es2057.codfw.wmnet with OS trixie [production]
09:32 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool es2057: Upgrading es2057.codfw.wmnet [production]
09:32 <marostegui@cumin1003> START - Cookbook sre.mysql.depool depool es2057: Upgrading es2057.codfw.wmnet [production]
09:31 <marostegui@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
09:26 <Dreamy_Jazz> Running `mwscript-k8s extensions/MediaModeration/maintenance/scanFilesInScanTable.php --wiki="commonswiki" --use-jobqueue --poll-sleep=30 --sleep=60 --verbose` [production]
09:25 <Dreamy_Jazz> Running `/usr/local/bin/foreachwikiindblist "group0.dblist + group1.dblist - mediamoderation-continuous-scan.dblist" extensions/MediaModeration/maintenance/scanFilesInScanTable.php --use-jobqueue --sleep=1 --poll-sleep=10 --verbose` [production]
08:54 <oblivian@cumin1003> END (PASS) - Cookbook sre.deploy.hiddenparma (exit_code=0) Hiddenparma deployment to the alerting hosts with reason: "Introduce pluggable authentication - oblivian@cumin1003" [production]
08:54 <oblivian@cumin1003> END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) hiddenparma to alert[1002,2002].wikimedia.org with reason: Introduce pluggable authentication - oblivian@cumin1003 [production]
08:53 <marostegui@cumin1003> START - Cookbook sre.mysql.pool pool db1224: Migration of db1224.eqiad.wmnet completed [production]
08:53 <oblivian@cumin1003> START - Cookbook sre.deploy.python-code hiddenparma to alert[1002,2002].wikimedia.org with reason: Introduce pluggable authentication - oblivian@cumin1003 [production]
08:53 <oblivian@cumin1003> START - Cookbook sre.deploy.hiddenparma Hiddenparma deployment to the alerting hosts with reason: "Introduce pluggable authentication - oblivian@cumin1003" [production]
08:29 <daniel@deploy1003> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
08:29 <daniel@deploy1003> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
08:24 <daniel@deploy1003> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
08:24 <daniel@deploy1003> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
08:21 <daniel@deploy1003> helmfile [staging] DONE helmfile.d/services/rest-gateway: apply [production]
08:21 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1224.eqiad.wmnet with OS trixie [production]
08:21 <daniel@deploy1003> helmfile [staging] START helmfile.d/services/rest-gateway: apply [production]
08:04 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1224.eqiad.wmnet with reason: host reimage [production]
08:02 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2249.codfw.wmnet with reason: upgrade [production]
08:00 <marostegui@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on db1224.eqiad.wmnet with reason: host reimage [production]
07:53 <marostegui> Install mariadb 10.11.17 on db2249 T427345 [production]
07:43 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host db1224.eqiad.wmnet with OS trixie [production]
07:42 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db1224: Upgrading db1224.eqiad.wmnet [production]
07:41 <marostegui@cumin1003> START - Cookbook sre.mysql.depool depool db1224: Upgrading db1224.eqiad.wmnet [production]
07:41 <marostegui@cumin1003> START - Cookbook sre.mysql.major-upgrade [production]
07:39 <cwilliams@cumin1003> END (PASS) - Cookbook sre.mysql.major-upgrade (exit_code=0) [production]
07:39 <cwilliams@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db1255: Migration of db1255.eqiad.wmnet completed [production]
07:34 <kharlan@deploy1003> Finished scap sync-world: Backport for [[gerrit:1297536|hCaptcha risk scores: VE plugin to collect risk scores for block notices (T426943)]], [[gerrit:1297200|hCaptcha: Render a fresh mobile widget for each captcha attempt (T425929)]], [[gerrit:1297173|hCaptcha: Enable risk-score collection for users blocked by IP blocks (T424629)]] (duration: 08m 56s) [production]
07:29 <kharlan@deploy1003> kharlan, harroyo-wmf: Continuing with deployment [production]
07:27 <kharlan@deploy1003> kharlan, harroyo-wmf: Backport for [[gerrit:1297536|hCaptcha risk scores: VE plugin to collect risk scores for block notices (T426943)]], [[gerrit:1297200|hCaptcha: Render a fresh mobile widget for each captcha attempt (T425929)]], [[gerrit:1297173|hCaptcha: Enable risk-score collection for users blocked by IP blocks (T424629)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwd [production]
07:25 <kharlan@deploy1003> Started scap sync-world: Backport for [[gerrit:1297536|hCaptcha risk scores: VE plugin to collect risk scores for block notices (T426943)]], [[gerrit:1297200|hCaptcha: Render a fresh mobile widget for each captcha attempt (T425929)]], [[gerrit:1297173|hCaptcha: Enable risk-score collection for users blocked by IP blocks (T424629)]] [production]
07:24 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.major-upgrade (exit_code=0) [production]
07:24 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db2191: Migration of db2191.codfw.wmnet completed [production]
07:12 <kharlan@deploy1003> Finished scap sync-world: Backport for [[gerrit:1297550|Revert "EventStreamConfig - webrequest.dumps.dev0 - enable canary events for hive ingestion"]] (duration: 06m 45s) [production]
07:08 <kharlan@deploy1003> kharlan: Continuing with deployment [production]
07:08 <kharlan@deploy1003> kharlan: Backport for [[gerrit:1297550|Revert "EventStreamConfig - webrequest.dumps.dev0 - enable canary events for hive ingestion"]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
07:06 <kharlan@deploy1003> Started scap sync-world: Backport for [[gerrit:1297550|Revert "EventStreamConfig - webrequest.dumps.dev0 - enable canary events for hive ingestion"]] [production]
07:04 <otto@deploy1003> Finished scap sync-world: Backport for [[gerrit:1297260|EventStreamConfig - webrequest.dumps.dev0 - enable canary events for hive ingestion (T425087)]] (duration: 399m 30s) [production]
07:03 <otto@deploy1003> otto: Rolling back deployment [production]
06:53 <cwilliams@cumin1003> START - Cookbook sre.mysql.pool pool db1255: Migration of db1255.eqiad.wmnet completed [production]