201-250 of 10000 results (98ms)
2026-01-22 ยง
11:56 <blake@cumin1003> START - Cookbook sre.switchdc.mediawiki.09-unlock-scap for datacenter switchover from codfw to eqiad [production]
11:55 <Dreamy_Jazz> Run of script for T413868 has finished on s8 [production]
11:55 <blake@cumin1003> END (PASS) - Cookbook sre.switchdc.mediawiki.09-unlock-scap (exit_code=0) for datacenter switchover from codfw to eqiad [production]
11:55 <root@deploy2002> Unlocked for deployment [ALL REPOSITORIES]: Datacenter switchover from codfw to eqiad - T330996 (duration: 00m 39s) [production]
11:54 <root@deploy2002> Forcefully removing global lock: Datacenter switchover from codfw to eqiad - T12345 [production]
11:54 <blake@cumin1003> START - Cookbook sre.switchdc.mediawiki.09-unlock-scap for datacenter switchover from codfw to eqiad [production]
11:54 <blake@cumin1003> END (PASS) - Cookbook sre.switchdc.mediawiki.00-lock-scap (exit_code=0) for datacenter switchover from codfw to eqiad [production]
11:54 <root@deploy2002> Locking from deployment [ALL REPOSITORIES]: Datacenter switchover from codfw to eqiad - T330996 [production]
11:54 <blake@cumin1003> START - Cookbook sre.switchdc.mediawiki.00-lock-scap for datacenter switchover from codfw to eqiad [production]
11:52 <arnaudb@cumin1003> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: Upgrade gitlab [production]
11:51 <arnaudb@cumin1003> END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1003.wikimedia.org with reason: Upgrade gitlab [production]
11:47 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host ms-be2077 [production]
11:47 <mvernon@cumin2002> START - Cookbook sre.hosts.move-vlan for host ms-be2077 [production]
11:47 <mvernon@cumin2002> START - Cookbook sre.hosts.reimage for host ms-be2077.codfw.wmnet with OS bullseye [production]
11:46 <mvernon@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host ms-be2077.codfw.wmnet with OS bullseye [production]
11:42 <arnaudb@cumin1003> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1003.wikimedia.org with reason: Upgrade gitlab [production]
11:32 <daniel@deploy2002> helmfile [aux-k8s-eqiad] DONE helmfile.d/aux-k8s-services/redioscope: apply [production]
11:31 <daniel@deploy2002> helmfile [aux-k8s-eqiad] START helmfile.d/aux-k8s-services/redioscope: apply [production]
11:28 <mvernon@cumin2002> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host ms-be2077 [production]
11:28 <mvernon@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ms-be2077 [production]
11:28 <mvernon@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host ms-be2077 [production]
11:28 <mvernon@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) ms-be2077.codfw.wmnet 238.32.192.10.in-addr.arpa 8.3.2.0.2.3.0.0.2.9.1.0.0.1.0.0.3.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
11:28 <mvernon@cumin2002> START - Cookbook sre.dns.wipe-cache ms-be2077.codfw.wmnet 238.32.192.10.in-addr.arpa 8.3.2.0.2.3.0.0.2.9.1.0.0.1.0.0.3.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
11:28 <mvernon@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
11:28 <mvernon@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host ms-be2077 - mvernon@cumin2002" [production]
11:28 <mvernon@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host ms-be2077 - mvernon@cumin2002" [production]
11:22 <mvernon@cumin2002> START - Cookbook sre.dns.netbox [production]
11:21 <btullis@cumin1003> END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on P{dse-k8s-worker1006.eqiad.wmnet} and (A:dse-k8s-master-eqiad or A:dse-k8s-worker-eqiad) [production]
11:21 <mvernon@cumin2002> START - Cookbook sre.hosts.move-vlan for host ms-be2077 [production]
11:20 <mvernon@cumin2002> START - Cookbook sre.hosts.reimage for host ms-be2077.codfw.wmnet with OS bullseye [production]
11:08 <a-pizzata@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-content-history-reconcile-enrich: apply [production]
11:08 <a-pizzata@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-content-history-reconcile-enrich: apply [production]
10:43 <btullis@cumin1003> START - Cookbook sre.k8s.reboot-nodes rolling reboot on P{dse-k8s-worker1006.eqiad.wmnet} and (A:dse-k8s-master-eqiad or A:dse-k8s-worker-eqiad) [production]
10:35 <moritzm> installing systemd bugfix updates from Bookworm point release [production]
10:14 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dbproxy2007.codfw.wmnet with OS trixie [production]
09:50 <marostegui@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dbproxy2007.codfw.wmnet with reason: host reimage [production]
09:48 <jgiannelos@deploy2002> helmfile [staging] DONE helmfile.d/services/mobileapps: apply [production]
09:48 <jgiannelos@deploy2002> helmfile [staging] START helmfile.d/services/mobileapps: apply [production]
09:46 <marostegui@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on dbproxy2007.codfw.wmnet with reason: host reimage [production]
09:44 <aklapper@deploy2002> rebuilt and synchronized wikiversions files: group2 to 1.46.0-wmf.12 refs T413803 [production]
09:31 <marostegui@cumin1003> START - Cookbook sre.hosts.reimage for host dbproxy2007.codfw.wmnet with OS trixie [production]
09:26 <aklapper@deploy2002> rebuilt and synchronized wikiversions files: group1 to 1.46.0-wmf.12 refs T413803 [production]
09:18 <aklapper@deploy2002> Finished scap sync-world: Backport for [[gerrit:1229724|Revert "Fix DivisionByZeroError when calculating bitrate" (T415169)]] (duration: 07m 13s) [production]
09:13 <aklapper@deploy2002> jforrester, aklapper: Continuing with sync [production]
09:13 <aklapper@deploy2002> jforrester, aklapper: Backport for [[gerrit:1229724|Revert "Fix DivisionByZeroError when calculating bitrate" (T415169)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
09:10 <aklapper@deploy2002> Started scap sync-world: Backport for [[gerrit:1229724|Revert "Fix DivisionByZeroError when calculating bitrate" (T415169)]] [production]
08:06 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Depooling db2225 (T410589)', diff saved to https://phabricator.wikimedia.org/P87861 and previous config saved to /var/cache/conftool/dbconfig/20260122-080653-ladsgroup.json [production]
08:06 <ladsgroup@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db2225.codfw.wmnet with reason: Maintenance [production]
08:06 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2207 (T410589)', diff saved to https://phabricator.wikimedia.org/P87860 and previous config saved to /var/cache/conftool/dbconfig/20260122-080629-ladsgroup.json [production]
07:56 <ladsgroup@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db2207', diff saved to https://phabricator.wikimedia.org/P87859 and previous config saved to /var/cache/conftool/dbconfig/20260122-075620-ladsgroup.json [production]