|
2026-01-22
ยง
|
| 11:51 |
<arnaudb@cumin1003> |
END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1003.wikimedia.org with reason: Upgrade gitlab |
[production] |
| 11:47 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host ms-be2077 |
[production] |
| 11:47 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.move-vlan for host ms-be2077 |
[production] |
| 11:47 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.reimage for host ms-be2077.codfw.wmnet with OS bullseye |
[production] |
| 11:46 |
<mvernon@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host ms-be2077.codfw.wmnet with OS bullseye |
[production] |
| 11:42 |
<arnaudb@cumin1003> |
START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1003.wikimedia.org with reason: Upgrade gitlab |
[production] |
| 11:32 |
<daniel@deploy2002> |
helmfile [aux-k8s-eqiad] DONE helmfile.d/aux-k8s-services/redioscope: apply |
[production] |
| 11:31 |
<daniel@deploy2002> |
helmfile [aux-k8s-eqiad] START helmfile.d/aux-k8s-services/redioscope: apply |
[production] |
| 11:28 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host ms-be2077 |
[production] |
| 11:28 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host ms-be2077 |
[production] |
| 11:28 |
<mvernon@cumin2002> |
START - Cookbook sre.network.configure-switch-interfaces for host ms-be2077 |
[production] |
| 11:28 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) ms-be2077.codfw.wmnet 238.32.192.10.in-addr.arpa 8.3.2.0.2.3.0.0.2.9.1.0.0.1.0.0.3.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors |
[production] |
| 11:28 |
<mvernon@cumin2002> |
START - Cookbook sre.dns.wipe-cache ms-be2077.codfw.wmnet 238.32.192.10.in-addr.arpa 8.3.2.0.2.3.0.0.2.9.1.0.0.1.0.0.3.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors |
[production] |
| 11:28 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
| 11:28 |
<mvernon@cumin2002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host ms-be2077 - mvernon@cumin2002" |
[production] |
| 11:28 |
<mvernon@cumin2002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host ms-be2077 - mvernon@cumin2002" |
[production] |
| 11:22 |
<mvernon@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
| 11:21 |
<btullis@cumin1003> |
END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on P{dse-k8s-worker1006.eqiad.wmnet} and (A:dse-k8s-master-eqiad or A:dse-k8s-worker-eqiad) |
[production] |
| 11:21 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.move-vlan for host ms-be2077 |
[production] |
| 11:20 |
<mvernon@cumin2002> |
START - Cookbook sre.hosts.reimage for host ms-be2077.codfw.wmnet with OS bullseye |
[production] |
| 11:08 |
<a-pizzata@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-content-history-reconcile-enrich: apply |
[production] |
| 11:08 |
<a-pizzata@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-content-history-reconcile-enrich: apply |
[production] |
| 10:43 |
<btullis@cumin1003> |
START - Cookbook sre.k8s.reboot-nodes rolling reboot on P{dse-k8s-worker1006.eqiad.wmnet} and (A:dse-k8s-master-eqiad or A:dse-k8s-worker-eqiad) |
[production] |
| 10:35 |
<moritzm> |
installing systemd bugfix updates from Bookworm point release |
[production] |
| 10:14 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dbproxy2007.codfw.wmnet with OS trixie |
[production] |
| 09:50 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dbproxy2007.codfw.wmnet with reason: host reimage |
[production] |
| 09:48 |
<jgiannelos@deploy2002> |
helmfile [staging] DONE helmfile.d/services/mobileapps: apply |
[production] |
| 09:48 |
<jgiannelos@deploy2002> |
helmfile [staging] START helmfile.d/services/mobileapps: apply |
[production] |
| 09:46 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on dbproxy2007.codfw.wmnet with reason: host reimage |
[production] |
| 09:44 |
<aklapper@deploy2002> |
rebuilt and synchronized wikiversions files: group2 to 1.46.0-wmf.12 refs T413803 |
[production] |
| 09:31 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.reimage for host dbproxy2007.codfw.wmnet with OS trixie |
[production] |
| 09:26 |
<aklapper@deploy2002> |
rebuilt and synchronized wikiversions files: group1 to 1.46.0-wmf.12 refs T413803 |
[production] |
| 09:18 |
<aklapper@deploy2002> |
Finished scap sync-world: Backport for [[gerrit:1229724|Revert "Fix DivisionByZeroError when calculating bitrate" (T415169)]] (duration: 07m 13s) |
[production] |
| 09:13 |
<aklapper@deploy2002> |
jforrester, aklapper: Continuing with sync |
[production] |
| 09:13 |
<aklapper@deploy2002> |
jforrester, aklapper: Backport for [[gerrit:1229724|Revert "Fix DivisionByZeroError when calculating bitrate" (T415169)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. |
[production] |
| 09:10 |
<aklapper@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1229724|Revert "Fix DivisionByZeroError when calculating bitrate" (T415169)]] |
[production] |
| 08:06 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Depooling db2225 (T410589)', diff saved to https://phabricator.wikimedia.org/P87861 and previous config saved to /var/cache/conftool/dbconfig/20260122-080653-ladsgroup.json |
[production] |
| 08:06 |
<ladsgroup@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db2225.codfw.wmnet with reason: Maintenance |
[production] |
| 08:06 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2207 (T410589)', diff saved to https://phabricator.wikimedia.org/P87860 and previous config saved to /var/cache/conftool/dbconfig/20260122-080629-ladsgroup.json |
[production] |
| 07:56 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2207', diff saved to https://phabricator.wikimedia.org/P87859 and previous config saved to /var/cache/conftool/dbconfig/20260122-075620-ladsgroup.json |
[production] |
| 07:46 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2207', diff saved to https://phabricator.wikimedia.org/P87858 and previous config saved to /var/cache/conftool/dbconfig/20260122-074612-ladsgroup.json |
[production] |
| 07:36 |
<ladsgroup@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db2207 (T410589)', diff saved to https://phabricator.wikimedia.org/P87857 and previous config saved to /var/cache/conftool/dbconfig/20260122-073604-ladsgroup.json |
[production] |
| 07:15 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host dbproxy2006.codfw.wmnet with OS trixie |
[production] |
| 06:55 |
<marostegui@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dbproxy2006.codfw.wmnet with reason: host reimage |
[production] |
| 06:51 |
<marostegui@cumin1003> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db1239.eqiad.wmnet with reason: Maintenance |
[production] |
| 06:51 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1235 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87856 and previous config saved to /var/cache/conftool/dbconfig/20260122-065138-marostegui.json |
[production] |
| 06:49 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on dbproxy2006.codfw.wmnet with reason: host reimage |
[production] |
| 06:41 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1235', diff saved to https://phabricator.wikimedia.org/P87855 and previous config saved to /var/cache/conftool/dbconfig/20260122-064131-marostegui.json |
[production] |
| 06:33 |
<marostegui@cumin1003> |
START - Cookbook sre.hosts.reimage for host dbproxy2006.codfw.wmnet with OS trixie |
[production] |
| 06:31 |
<marostegui@cumin1003> |
dbctl commit (dc=all): 'Repooling after maintenance db1235', diff saved to https://phabricator.wikimedia.org/P87854 and previous config saved to /var/cache/conftool/dbconfig/20260122-063122-marostegui.json |
[production] |