101-150 of 10000 results (27ms)
2026-01-23 ยง
13:59 <jclark@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on tools-k8s-worker1001.eqiad.wmnet with reason: host reimage [production]
13:59 <jclark@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on tools-k8s-worker1004.eqiad.wmnet with reason: host reimage [production]
13:55 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host tools-k8s-worker1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:51 <jclark@cumin1003> START - Cookbook sre.hosts.reimage for host tools-k8s-worker1002.eqiad.wmnet with OS trixie [production]
13:49 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host tools-k8s-worker1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:48 <jclark@cumin1003> START - Cookbook sre.hosts.reimage for host tools-k8s-ctrl1002.eqiad.wmnet with OS trixie [production]
13:48 <jclark@cumin1003> START - Cookbook sre.hosts.reimage for host tools-k8s-worker1004.eqiad.wmnet with OS trixie [production]
13:48 <jclark@cumin1003> START - Cookbook sre.hosts.reimage for host tools-k8s-ctrl1001.eqiad.wmnet with OS trixie [production]
13:48 <jclark@cumin1003> START - Cookbook sre.hosts.reimage for host tools-k8s-worker1001.eqiad.wmnet with OS trixie [production]
13:44 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host tools-k8s-worker1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:44 <jclark@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host tools-k8s-worker1003 [production]
13:44 <jclark@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host tools-k8s-worker1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:44 <jclark@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host tools-k8s-worker1003 [production]
13:42 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host tools-k8s-worker1004.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:41 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host tools-k8s-worker1001.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:40 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host tools-k8s-worker1002.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:40 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host tools-k8s-ctrl1002.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:40 <jclark@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host tools-k8s-ctrl1001.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:34 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host tools-k8s-worker1004.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:33 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host tools-k8s-worker1003.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:33 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host tools-k8s-worker1001.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:33 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host tools-k8s-worker1002.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:32 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host tools-k8s-ctrl1002.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:32 <jclark@cumin1003> START - Cookbook sre.hosts.provision for host tools-k8s-ctrl1001.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
13:29 <jclark@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:29 <jclark@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: added network and mgmt tools-k8 - jclark@cumin1003" [production]
13:29 <jclark@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: added network and mgmt tools-k8 - jclark@cumin1003" [production]
13:26 <jclark@cumin1003> START - Cookbook sre.dns.netbox [production]
13:02 <taavi> switch https://gitlab.wikimedia.org/repos/cloud/wmcs/utils to fast-forward only merging mode [admin]
12:55 <cgoubert@deploy2002> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
12:53 <cgoubert@deploy2002> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
12:32 <moritzm> uploaded dnsmasq 2.92-1~wmf12u to bookworm-wikimedia/main T396864 [production]
12:18 <aokoth@cumin1003> END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1004.wikimedia.org with reason: Security Update [production]
11:07 <aokoth@cumin1003> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1004.wikimedia.org with reason: Security Update [production]
10:07 <jgiannelos@deploy2002> helmfile [codfw] DONE helmfile.d/services/mobileapps: apply [production]
10:07 <jgiannelos@deploy2002> helmfile [codfw] START helmfile.d/services/mobileapps: apply [production]
10:06 <jgiannelos@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply [production]
10:06 <jgiannelos@deploy2002> helmfile [eqiad] START helmfile.d/services/mobileapps: apply [production]
10:05 <jgiannelos@deploy2002> helmfile [staging] DONE helmfile.d/services/mobileapps: apply [production]
10:05 <jgiannelos@deploy2002> helmfile [staging] START helmfile.d/services/mobileapps: apply [production]
10:03 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti-test2003.codfw.wmnet [production]
09:56 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti-test2003.codfw.wmnet [production]
09:52 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti-test2001.codfw.wmnet [production]
09:46 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti-test2001.codfw.wmnet [production]
09:07 <moritzm> installing Linux 6.1.159 on Bookworm hosts [production]
08:12 <marostegui@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on dbstore1008.eqiad.wmnet with reason: Maintenance [production]
08:12 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1251 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87887 and previous config saved to /var/cache/conftool/dbconfig/20260123-081240-marostegui.json [production]
08:02 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1251', diff saved to https://phabricator.wikimedia.org/P87886 and previous config saved to /var/cache/conftool/dbconfig/20260123-080232-marostegui.json [production]
07:52 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1251', diff saved to https://phabricator.wikimedia.org/P87885 and previous config saved to /var/cache/conftool/dbconfig/20260123-075223-marostegui.json [production]
07:42 <marostegui@cumin1003> dbctl commit (dc=all): 'Repooling after maintenance db1251 (T411163 T411164)', diff saved to https://phabricator.wikimedia.org/P87884 and previous config saved to /var/cache/conftool/dbconfig/20260123-074215-marostegui.json [production]