251-300 of 7768 results (19ms)
2025-06-09 §
21:48 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cirrussearch2115.codfw.wmnet [production]
21:41 <bking@cumin2002> START - Cookbook sre.hosts.reboot-single for host cirrussearch2115.codfw.wmnet [production]
21:36 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts cirrussearch2115.codfw.wmnet [production]
21:36 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts cirrussearch2115.codfw.wmnet [production]
21:21 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cirrussearch2114.codfw.wmnet [production]
21:19 <bking@cumin2002> END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts cirrussearch2113.codfw.wmnet [production]
21:19 <bking@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts cirrussearch2113.codfw.wmnet [production]
21:12 <bking@cumin2002> START - Cookbook sre.hosts.reboot-single for host cirrussearch2114.codfw.wmnet [production]
21:09 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cirrussearch2115.codfw.wmnet [production]
21:01 <bking@cumin2002> START - Cookbook sre.hosts.reboot-single for host cirrussearch2115.codfw.wmnet [production]
17:21 <inflatador> bking@cumin1003 power down cirrussearch1063 to prevent logspam T394350 [production]
2025-06-06 §
21:02 <bking@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10 days, 0:00:00 on relforge[1003-1004].eqiad.wmnet with reason: downtime before decom [production]
19:11 <bking@cumin2002> END (PASS) - Cookbook sre.elasticsearch.rolling-operation (exit_code=0) Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_eqiad: T383811 - bking@cumin2002 [production]
17:20 <bking@cumin2002> START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_eqiad: T383811 - bking@cumin2002 [production]
17:08 <bking@cumin2002> END (FAIL) - Cookbook sre.elasticsearch.rolling-operation (exit_code=99) Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_eqiad: T383811 - bking@cumin2002 [production]
17:06 <bking@cumin2002> START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (3 nodes at a time) for ElasticSearch cluster search_eqiad: T383811 - bking@cumin2002 [production]
2025-06-04 §
19:36 <bking@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
19:23 <bking@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
19:22 <bking@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
2025-06-02 §
21:22 <bking@cumin2002> END (PASS) - Cookbook sre.elasticsearch.ban (exit_code=0) Banning hosts: cirrussearch205*,cirrussearch2060* for T395855 - bking@cumin2002 [production]
21:22 <bking@cumin2002> START - Cookbook sre.elasticsearch.ban Banning hosts: cirrussearch205*,cirrussearch2060* for T395855 - bking@cumin2002 [production]
21:16 <bking@cumin2002> conftool action : set/pooled=no; selector: name=cirrussearch2055.codfw.wmnet|cirrussearch2056.codfw.wmnet|cirrussearch2057.codfw.wmnet|cirrussearch2058.codfw.wmnet|cirrussearch2059.codfw.wmnet|cirrussearch2060.codfw.wmnet|cirrussearch2091.codfw.wmnet [production]
21:13 <bking@cumin2002> conftool action : set/pooled=yes:weight=10; selector: name=cirrussearch.*.codfw.wmnet [production]
14:35 <bking@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:35 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts elastic1067.eqiad.wmnet [production]
14:35 <bking@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: elastic1067.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - bking@cumin2002" [production]
14:35 <bking@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: elastic1067.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - bking@cumin2002" [production]
13:43 <bking@cumin2002> START - Cookbook sre.dns.netbox [production]
13:38 <bking@cumin2002> START - Cookbook sre.hosts.decommission for hosts elastic1067.eqiad.wmnet [production]
13:37 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts cirrussearch[1064-1066].eqiad.wmnet [production]
13:37 <bking@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cirrussearch[1064-1066].eqiad.wmnet decommissioned, removing all IPs except the asset tag one - bking@cumin2002" [production]
13:37 <bking@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:37 <bking@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cirrussearch[1064-1066].eqiad.wmnet decommissioned, removing all IPs except the asset tag one - bking@cumin2002" [production]
13:21 <bking@cumin2002> START - Cookbook sre.dns.netbox [production]
13:09 <bking@cumin2002> START - Cookbook sre.hosts.decommission for hosts cirrussearch[1064-1066].eqiad.wmnet [production]
2025-05-30 §
21:08 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts cirrussearch[1055-1059].eqiad.wmnet [production]
21:08 <bking@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
21:08 <bking@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cirrussearch[1055-1059].eqiad.wmnet decommissioned, removing all IPs except the asset tag one - bking@cumin2002" [production]
21:08 <bking@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cirrussearch[1055-1059].eqiad.wmnet decommissioned, removing all IPs except the asset tag one - bking@cumin2002" [production]
21:04 <bking@cumin2002> START - Cookbook sre.dns.netbox [production]
20:49 <bking@cumin2002> START - Cookbook sre.hosts.decommission for hosts cirrussearch[1055-1059].eqiad.wmnet [production]
20:04 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=99) for hosts relforge[1003-1004].eqiad.wmnet [production]
20:00 <bking@deploy1003> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
20:00 <bking@deploy1003> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
19:51 <bking@deploy1003> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
19:51 <bking@deploy1003> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
19:45 <bking@cumin2002> START - Cookbook sre.hosts.decommission for hosts relforge[1003-1004].eqiad.wmnet [production]
2025-05-29 §
19:05 <bking@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on relforge[1003-1004,1008-1009].eqiad.wmnet with reason: noisy alerts [production]
16:25 <bking@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on cirrussearch[2112-2113].codfw.wmnet with reason: firmware update [production]
16:24 <bking@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cirrussearch[2111-2112].codfw.wmnet [production]