1-50 of 10000 results (90ms)
2026-02-25 §
10:12 <btullis@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on dse-k8s-worker1024.eqiad.wmnet with reason: host reimage [production]
10:09 <fceratto@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dborch1003.eqiad.wmnet with reason: host reimage [production]
10:02 <fceratto@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on dborch1003.eqiad.wmnet with reason: host reimage [production]
09:57 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host dse-k8s-worker1024.eqiad.wmnet with OS bookworm [production]
09:54 <fceratto@cumin1003> START - Cookbook sre.hosts.reimage for host dborch1003.eqiad.wmnet with OS trixie [production]
09:54 <btullis@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
09:54 <btullis@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Updating records after renaming and moving vlan of some an-worker hosts - btullis@cumin1003" [production]
09:53 <btullis@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Updating records after renaming and moving vlan of some an-worker hosts - btullis@cumin1003" [production]
09:52 <elukey> uploaded python3-wmflib_3.0.0 to apt.wikimedia.org bullseye-wikimedia,bookworm-wikimedia,trixie-wikimedia [production]
09:48 <btullis@cumin1003> START - Cookbook sre.dns.netbox [production]
09:22 <XioNoX> push pfw policies - T418305 [production]
08:46 <ammarpad@deploy2002> mwscript-k8s job started: extensions/CentralAuth/maintenance/fixStuckGlobalRename.php --wiki=mediawikiwiki --logwiki=metawiki Egortropeano Fortuna1992 # T418331 [production]
08:45 <ammarpad@deploy2002> mwscript-k8s job started: extensions/CentralAuth/maintenance/fixStuckGlobalRename.php --wiki=gawiki --logwiki=metawiki DroopyDoggy AlterDiegos # T418330 [production]
08:20 <slyngshede@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp2045.codfw.wmnet with reason: host reimage [production]
08:14 <slyngshede@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on cp2045.codfw.wmnet with reason: host reimage [production]
07:59 <slyngshede@cumin1003> START - Cookbook sre.hosts.reimage for host cp2045.codfw.wmnet with OS trixie [production]
06:16 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dbproxy1023.eqiad.wmnet with OS trixie [production]
05:59 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dbproxy1023.eqiad.wmnet with reason: host reimage [production]
05:54 <marostegui@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on dbproxy1023.eqiad.wmnet with reason: host reimage [production]
05:38 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host dbproxy1023.eqiad.wmnet with OS trixie [production]
02:25 <marostegui@cumin1003> dbctl commit (dc=all): 'Depooling db1261 (T415786)', diff saved to https://phabricator.wikimedia.org/P89022 and previous config saved to /var/cache/conftool/dbconfig/20260225-022502-marostegui.json [production]
02:24 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1261.eqiad.wmnet with reason: Maintenance [production]
02:24 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1260 (T415786)', diff saved to https://phabricator.wikimedia.org/P89021 and previous config saved to /var/cache/conftool/dbconfig/20260225-022446-marostegui.json [production]
02:23 <ryankemper> [WDQS] Restart codfw wdqs-main [production]
02:13 <mwpresync@deploy2002> Finished scap build-images: Publishing wmf/next image (duration: 12m 49s) [production]
02:09 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1260', diff saved to https://phabricator.wikimedia.org/P89020 and previous config saved to /var/cache/conftool/dbconfig/20260225-020938-marostegui.json [production]
02:00 <mwpresync@deploy2002> Started scap build-images: Publishing wmf/next image [production]
01:54 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1260', diff saved to https://phabricator.wikimedia.org/P89019 and previous config saved to /var/cache/conftool/dbconfig/20260225-015430-marostegui.json [production]
01:39 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1260 (T415786)', diff saved to https://phabricator.wikimedia.org/P89018 and previous config saved to /var/cache/conftool/dbconfig/20260225-013921-marostegui.json [production]
00:25 <zabe@deploy2002> Finished scap sync-world: Backport for [[gerrit:1243274|Start reading from new file tables on all small wikis (T416548)]] (duration: 06m 40s) [production]
00:22 <zabe@deploy2002> zabe: Continuing with sync [production]
00:21 <zabe@deploy2002> zabe: Backport for [[gerrit:1243274|Start reading from new file tables on all small wikis (T416548)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
00:19 <zabe@deploy2002> Started scap sync-world: Backport for [[gerrit:1243274|Start reading from new file tables on all small wikis (T416548)]] [production]
00:11 <zabe> zabe@deploy2002:~$ foreachwiki extensions/TimedMediaHandler/maintenance/migrateTranscodeStates.php # T415064 [production]
00:10 <zabe@deploy2002> Finished scap sync-world: Backport for [[gerrit:1241167|Update documenation to reference config-schema.php]] (duration: 07m 20s) [production]
00:06 <zabe@deploy2002> zabe: Continuing with sync [production]
00:05 <zabe@deploy2002> zabe: Backport for [[gerrit:1241167|Update documenation to reference config-schema.php]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
00:02 <zabe@deploy2002> Started scap sync-world: Backport for [[gerrit:1241167|Update documenation to reference config-schema.php]] [production]
2026-02-24 §
23:41 <swfrench-wmf> built envoy images (1.35.7-3) - T364245 [production]
23:29 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ncmonitor1001.eqiad.wmnet with OS trixie [production]
23:04 <ryankemper> [WDQS] `ryankemper@cumin2002:~$ sudo -E cumin 'A:wdqs-main AND P{wdqs2*} AND NOT P{wdqs2012*}' 'systemctl restart wdqs-blazegraph'` (2012 still seems healthy, rest are all not) [production]
22:59 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ncmonitor1001.eqiad.wmnet with reason: host reimage [production]
22:58 <ryankemper> [WDQS] `ryankemper@cumin2002:~$ sudo -E cumin 'A:wdqs-main AND P{wdqs1*}' 'systemctl restart wdqs-blazegraph'` [production]
22:52 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ncmonitor1001.eqiad.wmnet with reason: host reimage [production]
22:40 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host ncmonitor1001.eqiad.wmnet with OS trixie [production]
22:37 <brett> import ncmonitor 3.1.0~deb13u1 into trixie-wikimedia (T401832) [production]
22:35 <hashar> Restarted Gerrit due to a replication config issue [production]
21:23 <aaron@deploy2002> Finished scap sync-world: Backport for [[gerrit:1224253|Switch math sandbox specs to plain wikimedia.org (T418188)]], [[gerrit:1224228|Copy rest_v1-wikimedia.json to standard-docroot (T418188)]] (duration: 07m 20s) [production]
21:19 <aaron@deploy2002> aaron: Continuing with sync [production]
21:19 <aaron@deploy2002> aaron: Backport for [[gerrit:1224253|Switch math sandbox specs to plain wikimedia.org (T418188)]], [[gerrit:1224228|Copy rest_v1-wikimedia.json to standard-docroot (T418188)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]