1-50 of 10000 results (93ms)
2026-02-26 §
09:57 <jmm@puppetserver1001> conftool action : set/pooled=true; selector: dnsdisc=pki,name=codfw [production]
09:51 <elukey@deploy2002> Started scap sync-world: Test new Docker Registry backend [production]
09:47 <elukey> move the Docker Registry's /v2/restricted (MediaWiki Docker image prefix) to s3/apus - T390251 [production]
09:44 <jmm@puppetserver1001> conftool action : set/pooled=false; selector: dnsdisc=pki,name=codfw [production]
09:43 <urbanecm@deploy2002> mwscript-k8s job started: foreachwikiindblist growthexperiments WikimediaMaintenance:createExtensionTables.php growthexperiments [production]
09:41 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dbproxy1028.eqiad.wmnet with OS trixie [production]
09:18 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dbproxy1028.eqiad.wmnet with reason: host reimage [production]
09:17 <mvernon@cumin2002> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:swift-fe-eqiad [production]
09:15 <hashar@deploy2002> Finished deploy [gerrit/gerrit@74473c2]: wm-checks-api: add Rerun command for codehealth + inline documentation (duration: 00m 14s) [production]
09:15 <hashar@deploy2002> Started deploy [gerrit/gerrit@74473c2]: wm-checks-api: add Rerun command for codehealth + inline documentation [production]
09:13 <marostegui@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on dbproxy1028.eqiad.wmnet with reason: host reimage [production]
09:10 <aikochou@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
09:09 <mvernon@cumin2002> START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe-eqiad [production]
09:06 <mvernon@cumin2002> END (FAIL) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=1) rolling restart_daemons on A:swift-fe [production]
09:05 <mvernon@cumin1003> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies (exit_code=0) rolling restart_daemons on A:thanos-fe [production]
09:01 <mvernon@cumin1003> START - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies rolling restart_daemons on A:thanos-fe [production]
09:01 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host dbproxy1028.eqiad.wmnet with OS trixie [production]
09:00 <moritzm> restart FPM on Phabricator hosts to pick up OpenSSL updates [production]
09:00 <mvernon@cumin2002> START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe [production]
07:50 <root@cumin2002> DONE (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Hokwelum out of all services on: 2432 hosts [production]
07:40 <moritzm> installing openssl security updates [production]
06:16 <moritzm> updated thirdparty/node22 to node 20.20.0 [production]
06:08 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db1263 (T415786)', diff saved to https://phabricator.wikimedia.org/P89032 and previous config saved to /var/cache/conftool/dbconfig/20260226-060809-marostegui.json [production]
06:08 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1263.eqiad.wmnet with reason: Maintenance [production]
06:07 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1262 (T415786)', diff saved to https://phabricator.wikimedia.org/P89031 and previous config saved to /var/cache/conftool/dbconfig/20260226-060755-marostegui.json [production]
05:52 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1262', diff saved to https://phabricator.wikimedia.org/P89030 and previous config saved to /var/cache/conftool/dbconfig/20260226-055246-marostegui.json [production]
05:37 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1262', diff saved to https://phabricator.wikimedia.org/P89029 and previous config saved to /var/cache/conftool/dbconfig/20260226-053739-marostegui.json [production]
05:22 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1262 (T415786)', diff saved to https://phabricator.wikimedia.org/P89028 and previous config saved to /var/cache/conftool/dbconfig/20260226-052230-marostegui.json [production]
02:13 <mwpresync@deploy2002> Finished scap build-images: Publishing wmf/next image (duration: 13m 12s) [production]
02:00 <mwpresync@deploy2002> Started scap build-images: Publishing wmf/next image [production]
2026-02-25 §
23:36 <swfrench@deploy2002> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
23:35 <swfrench@deploy2002> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
23:32 <urbanecm@deploy2002> Finished scap sync-world: Backport for [[gerrit:1244012|SECURITY: ReassignMentees: Handle hidden users correctly (T418222)]], [[gerrit:1244011|SECURITY: ReassignMentees: Handle hidden users correctly (T418222)]] (duration: 07m 01s) [production]
23:28 <urbanecm@deploy2002> urbanecm: Continuing with sync [production]
23:27 <urbanecm@deploy2002> urbanecm: Backport for [[gerrit:1244012|SECURITY: ReassignMentees: Handle hidden users correctly (T418222)]], [[gerrit:1244011|SECURITY: ReassignMentees: Handle hidden users correctly (T418222)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
23:25 <urbanecm@deploy2002> Started scap sync-world: Backport for [[gerrit:1244012|SECURITY: ReassignMentees: Handle hidden users correctly (T418222)]], [[gerrit:1244011|SECURITY: ReassignMentees: Handle hidden users correctly (T418222)]] [production]
23:20 <swfrench@deploy2002> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
23:18 <swfrench@deploy2002> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
23:17 <swfrench@deploy2002> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
23:15 <swfrench@deploy2002> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
23:15 <swfrench@deploy2002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
23:14 <swfrench@deploy2002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
23:08 <urbanecm@deploy2002> Finished scap sync-world: Backport for [[gerrit:1243860|tests: Introduce MentorRemoverTest]] (duration: 07m 12s) [production]
23:04 <urbanecm@deploy2002> urbanecm: Continuing with sync [production]
23:03 <urbanecm@deploy2002> urbanecm: Backport for [[gerrit:1243860|tests: Introduce MentorRemoverTest]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
23:02 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host ms-fe2024.codfw.wmnet with OS bullseye [production]
23:01 <urbanecm@deploy2002> Started scap sync-world: Backport for [[gerrit:1243860|tests: Introduce MentorRemoverTest]] [production]
22:48 <cjming> end of UTC late backport window [production]
22:43 <cjming@deploy2002> Finished scap sync-world: Backport for [[gerrit:1243965|GetSecurityLogContextHandler: Add IP reputation country code (T415354)]], [[gerrit:1243966|GetSecurityLogContextHandler: Add IP reputation country code (T415354)]] (duration: 08m 11s) [production]
22:39 <vriley@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]