1-50 of 10000 results (100ms)
2026-05-14 ยง
18:40 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host db1281.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:29 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1274.eqiad.wmnet with reason: host reimage [production]
18:25 <vriley@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on db1274.eqiad.wmnet with reason: host reimage [production]
18:17 <vriley@cumin1003> START - Cookbook sre.hosts.provision for host db1281.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
18:16 <vriley@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
18:14 <vriley@cumin1003> START - Cookbook sre.dns.netbox [production]
18:09 <vriley@cumin1003> START - Cookbook sre.hosts.reimage for host db1274.eqiad.wmnet with OS bookworm [production]
17:32 <jforrester@deploy1003> helmfile [staging] DONE helmfile.d/services/wikifunctions: apply [production]
17:31 <jforrester@deploy1003> helmfile [staging] START helmfile.d/services/wikifunctions: apply [production]
17:23 <aokoth@cumin1003> END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab2002.wikimedia.org with reason: Security Release - T426298 [production]
17:17 <bd808@deploy1003> helmfile [eqiad] DONE helmfile.d/services/developer-portal: apply [production]
17:17 <bd808@deploy1003> helmfile [eqiad] START helmfile.d/services/developer-portal: apply [production]
17:16 <bd808@deploy1003> helmfile [codfw] DONE helmfile.d/services/developer-portal: apply [production]
17:16 <bd808@deploy1003> helmfile [codfw] START helmfile.d/services/developer-portal: apply [production]
17:16 <bd808@deploy1003> helmfile [staging] DONE helmfile.d/services/developer-portal: apply [production]
17:15 <bd808@deploy1003> helmfile [staging] START helmfile.d/services/developer-portal: apply [production]
17:14 <aokoth@cumin1003> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab2002.wikimedia.org with reason: Security Release - T426298 [production]
17:10 <cmooney@dns2005> END - running authdns-update [production]
17:09 <cmooney@dns2005> START - running authdns-update [production]
17:06 <aokoth@cumin1003> END (PASS) - Cookbook sre.gitlab.upgrade (exit_code=0) on GitLab host gitlab1003.wikimedia.org with reason: Security Release - T426298 [production]
16:58 <aokoth@cumin1003> START - Cookbook sre.gitlab.upgrade on GitLab host gitlab1003.wikimedia.org with reason: Security Release - T426298 [production]
16:49 <atsuko@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/dse-k8s-services/opensearch-toolhub-test: apply [production]
16:49 <atsuko@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/dse-k8s-services/opensearch-toolhub-test: apply [production]
16:36 <atsuko@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/opensearch-toolhub-test: apply [production]
16:36 <atsuko@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/opensearch-toolhub-test: apply [production]
16:35 <elukey@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host kafka-logging1006.eqiad.wmnet with OS trixie [production]
16:31 <topranks> disable core router direct link at esams now that traffic is flowing via switches T424611 [production]
16:25 <topranks> disable core router direct link at drmrs now that traffic is flowing via switches T424611 [production]
16:21 <topranks> disable core router direct link at magru now that traffic is flowing via switches T424611 [production]
16:20 <rzl@deploy1003> helmfile [codfw] DONE helmfile.d/services/mw-cron: apply [production]
16:20 <rzl@deploy1003> helmfile [codfw] START helmfile.d/services/mw-cron: apply [production]
16:19 <rzl@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply [production]
16:17 <rzl@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-cron: apply [production]
16:16 <vriley@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host db1289.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
16:15 <vriley@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host db1290.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
16:14 <vriley@cumin1003> START - Cookbook sre.hosts.provision for host db1289.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
16:13 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1288.eqiad.wmnet with OS bookworm [production]
16:13 <vriley@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - vriley@cumin1003" [production]
16:12 <vriley@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - vriley@cumin1003" [production]
16:11 <rzl@deploy1003> helmfile [eqiad] DONE helmfile.d/services/mw-cron: apply [production]
16:07 <cmooney@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:07 <cmooney@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: remove records for deleted IPs esams,drmrs and magru - cmooney@cumin1003" [production]
16:07 <cmooney@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: remove records for deleted IPs esams,drmrs and magru - cmooney@cumin1003" [production]
16:06 <rzl@deploy1003> helmfile [eqiad] START helmfile.d/services/mw-cron: apply [production]
16:04 <cmooney@cumin1003> START - Cookbook sre.dns.netbox [production]
15:59 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply [production]
15:59 <vriley@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host db1289.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:59 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply [production]
15:57 <vriley@cumin1003> START - Cookbook sre.hosts.provision for host db1290.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART and with Dell SCP reboot policy FORCED [production]
15:56 <vriley@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host db1290 [production]