|
2026-03-25
ยง
|
| 14:00 |
<blake@cumin1003> |
START - Cookbook sre.switchdc.mediawiki.00-lock-scap for datacenter switchover from codfw to eqiad |
[production] |
| 13:49 |
<otto@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1260091|EventStreamConfig - Increase spark_job_ingestion_scale for larger event streams (T360794 T351225)]] (duration: 07m 48s) |
[production] |
| 13:45 |
<otto@deploy2002> |
otto: Continuing with sync |
[production] |
| 13:45 |
<kevinbazira@deploy2002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . |
[production] |
| 13:44 |
<otto@deploy2002> |
otto: Backport for [[gerrit:1260091|EventStreamConfig - Increase spark_job_ingestion_scale for larger event streams (T360794 T351225)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 13:42 |
<otto@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1260091|EventStreamConfig - Increase spark_job_ingestion_scale for larger event streams (T360794 T351225)]] |
[production] |
| 13:32 |
<awight@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1260614|[beta] Kill synthetic refs with feature flag (T421055)]], [[gerrit:1251193|idwiki: Remove unused user groups on Indonesian Wikipedia (T419105)]], [[gerrit:1251200|ptwiki: Enable block action for the abuse filter (T419312)]], [[gerrit:1256748|ptwiki: Add suppressredirect to autoreviewer and rollbacker user groups (T420704)]] (duration: 11m 33s) |
[production] |
| 13:29 |
<dpogorzelski@deploy2002> |
helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. |
[production] |
| 13:29 |
<dpogorzelski@deploy2002> |
helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. |
[production] |
| 13:27 |
<awight@deploy2002> |
codenamenoreste, awight, gerrit-patch-uploader: Continuing with sync |
[production] |
| 13:23 |
<awight@deploy2002> |
codenamenoreste, awight, gerrit-patch-uploader: Backport for [[gerrit:1260614|[beta] Kill synthetic refs with feature flag (T421055)]], [[gerrit:1251193|idwiki: Remove unused user groups on Indonesian Wikipedia (T419105)]], [[gerrit:1251200|ptwiki: Enable block action for the abuse filter (T419312)]], [[gerrit:1256748|ptwiki: Add suppressredirect to autoreviewer and rollbacker user groups (T420704)] |
[production] |
| 13:20 |
<awight@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1260614|[beta] Kill synthetic refs with feature flag (T421055)]], [[gerrit:1251193|idwiki: Remove unused user groups on Indonesian Wikipedia (T419105)]], [[gerrit:1251200|ptwiki: Enable block action for the abuse filter (T419312)]], [[gerrit:1256748|ptwiki: Add suppressredirect to autoreviewer and rollbacker user groups (T420704)]] |
[production] |
| 13:17 |
<dcausse@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1260045|Revert^2 "search: use the discovery ns record for the semanticsearch cluster"]] (duration: 10m 20s) |
[production] |
| 13:12 |
<dcausse@deploy2002> |
dcausse: Continuing with sync |
[production] |
| 13:09 |
<dpogorzelski@deploy2002> |
helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. |
[production] |
| 13:09 |
<dpogorzelski@deploy2002> |
helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. |
[production] |
| 13:09 |
<dcausse@deploy2002> |
dcausse: Backport for [[gerrit:1260045|Revert^2 "search: use the discovery ns record for the semanticsearch cluster"]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 13:07 |
<dpogorzelski@deploy2002> |
helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. |
[production] |
| 13:06 |
<dcausse@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1260045|Revert^2 "search: use the discovery ns record for the semanticsearch cluster"]] |
[production] |
| 13:06 |
<dpogorzelski@deploy2002> |
helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. |
[production] |
| 13:02 |
<XioNoX> |
Inter.Link - DDoS - Activation of automatic reroute |
[production] |
| 12:56 |
<dpogorzelski@deploy2002> |
helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. |
[production] |
| 12:55 |
<dpogorzelski@deploy2002> |
helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. |
[production] |
| 12:51 |
<marostegui@cumin1003> |
conftool action : set/pooled=yes; selector: name=clouddb1022.eqiad.wmnet,service=s3 |
[production] |
| 12:43 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on clouddb1022.eqiad.wmnet with reason: Downgrade clouddb1022 to 10.11.15 |
[production] |
| 12:41 |
<marostegui@cumin1003> |
conftool action : set/pooled=no; selector: name=clouddb1022.eqiad.wmnet,service=s3 |
[production] |
| 12:40 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-mariadb1002.eqiad.wmnet |
[production] |
| 12:40 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-test-coord1002.eqiad.wmnet |
[production] |
| 12:38 |
<mszwarc@deploy2002> |
mwscript-k8s job started: foreachwikiindblist all demoteIneligibleUsers.php --relay-log checkuser=metawiki --relay-log suppress=metawiki # T418580 |
[production] |
| 12:34 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host an-test-coord1002.eqiad.wmnet |
[production] |
| 12:34 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host an-mariadb1002.eqiad.wmnet |
[production] |
| 12:33 |
<kevinbazira@deploy2002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . |
[production] |
| 12:32 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host wdqs1028.eqiad.wmnet |
[production] |
| 12:25 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host wdqs1028.eqiad.wmnet |
[production] |
| 12:24 |
<kevinbazira@deploy2002> |
helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . |
[production] |
| 12:19 |
<mszwarc@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1260617|Allow for demoting 2FA-less members of further 6 groups (T418580)]] (duration: 10m 23s) |
[production] |
| 12:13 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host wdqs2009.codfw.wmnet |
[production] |
| 12:12 |
<mszwarc@deploy2002> |
mszwarc: Continuing with sync |
[production] |
| 12:11 |
<mszwarc@deploy2002> |
mszwarc: Backport for [[gerrit:1260617|Allow for demoting 2FA-less members of further 6 groups (T418580)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 12:08 |
<mszwarc@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1260617|Allow for demoting 2FA-less members of further 6 groups (T418580)]] |
[production] |
| 12:07 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host wdqs2009.codfw.wmnet |
[production] |
| 12:07 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-ctrl2002.codfw.wmnet |
[production] |
| 12:02 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host dse-k8s-ctrl2002.codfw.wmnet |
[production] |
| 11:56 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host dse-k8s-ctrl2001.codfw.wmnet |
[production] |
| 11:52 |
<marostegui> |
Restart clouddb1022:s3 to enable error_log T420177 |
[production] |
| 11:51 |
<btullis@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host dse-k8s-ctrl2001.codfw.wmnet |
[production] |
| 11:51 |
<jayme> |
migrated wikikube apiservers (eqiad and codfw) to IPIP - T420436 |
[production] |
| 11:49 |
<jayme@cumin1003> |
END (PASS) - Cookbook sre.loadbalancer.migrate-service-ipip (exit_code=0) for alias: wikikube-master-codfw@codfw |
[production] |
| 11:49 |
<jayme@cumin1003> |
END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on (A:lvs-low-traffic-codfw or A:lvs-secondary-codfw) and A:bullseye and A:lvs |
[production] |
| 11:48 |
<jayme@cumin1003> |
START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on (A:lvs-low-traffic-codfw or A:lvs-secondary-codfw) and A:bullseye and A:lvs |
[production] |