1-50 of 10000 results (70ms)
2025-01-10 §
22:49 <eileen> config revision changed from cf756e5f to 51a3e52e [production]
22:45 <bking@cumin2002> END (PASS) - Cookbook sre.elasticsearch.ban (exit_code=0) Banning hosts: cloudelastic1005*,cloudelastic1006* for ban hosts prior to decom - bking@cumin2002 - T380937 [production]
22:45 <bking@cumin2002> START - Cookbook sre.elasticsearch.ban Banning hosts: cloudelastic1005*,cloudelastic1006* for ban hosts prior to decom - bking@cumin2002 - T380937 [production]
22:45 <bking@cumin2002> END (FAIL) - Cookbook sre.elasticsearch.ban (exit_code=99) Banning hosts: cloudelastic1005,cloudelastic1006 for ban hosts prior to decom - bking@cumin2002 - T380937 [production]
22:45 <bking@cumin2002> START - Cookbook sre.elasticsearch.ban Banning hosts: cloudelastic1005,cloudelastic1006 for ban hosts prior to decom - bking@cumin2002 - T380937 [production]
22:21 <eileen> config revision changed from 2a572b99 to cf756e5f [production]
22:11 <eileen> config revision changed from e0866d2f to 2a572b99 [production]
20:01 <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-codfw: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002 [production]
19:43 <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-codfw: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002 [production]
19:41 <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching A:ml-cache-eqiad: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002 [production]
19:26 <btullis@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on eventlog1003.eqiad.wmnet with reason: Shutting down VM in preparation for decommissioning [production]
19:26 <btullis@cumin1002> START - Cookbook sre.hosts.downtime for 7 days, 0:00:00 on eventlog1003.eqiad.wmnet with reason: Shutting down VM in preparation for decommissioning [production]
19:23 <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-eqiad: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002 [production]
18:49 <sukhe> sudo cumin 'P:Mediawiki::Maintenance' 'run-puppet-agent': CR 1109755 [production]
16:56 <cmooney@dns2005> END - running authdns-update [production]
16:55 <cmooney@dns2005> START - running authdns-update [production]
16:50 <cmooney@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:50 <cmooney@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add dns names in newly assigned wmcs private ipv6 ranges - cmooney@cumin1002" [production]
16:50 <cmooney@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add dns names in newly assigned wmcs private ipv6 ranges - cmooney@cumin1002" [production]
16:47 <cmooney@cumin1002> START - Cookbook sre.dns.netbox [production]
16:28 <cmooney@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:28 <cmooney@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add dns names in newly assigned wmcs private ipv6 ranges - cmooney@cumin1002" [production]
16:28 <eevans@deploy2002> helmfile [codfw] DONE helmfile.d/services/data-gateway: apply [production]
16:28 <cmooney@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add dns names in newly assigned wmcs private ipv6 ranges - cmooney@cumin1002" [production]
16:27 <eevans@deploy2002> helmfile [codfw] START helmfile.d/services/data-gateway: apply [production]
16:26 <eevans@deploy2002> helmfile [eqiad] DONE helmfile.d/services/data-gateway: apply [production]
16:25 <eevans@deploy2002> helmfile [eqiad] START helmfile.d/services/data-gateway: apply [production]
16:23 <cmooney@cumin1002> START - Cookbook sre.dns.netbox [production]
16:23 <cmooney@cumin1002> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
16:19 <cmooney@cumin1002> START - Cookbook sre.dns.netbox [production]
15:46 <eevans@deploy2002> helmfile [staging] DONE helmfile.d/services/data-gateway: apply [production]
15:45 <eevans@deploy2002> helmfile [staging] START helmfile.d/services/data-gateway: apply [production]
15:36 <jelto@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[2199-2202].codfw.wmnet [production]
15:36 <jelto@cumin1002> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[2199-2202].codfw.wmnet [production]
15:34 <jelto> homer 'cr*codfw*' commit 'T377877' [production]
15:33 <jelto> homer 'lsw1-d1-codfw*' commit 'T377877' [production]
15:33 <jelto> homer 'lsw1-d5-codfw*' commit 'T377877' [production]
15:32 <jelto> homer 'lsw1-d3-codfw*' commit 'T377877' [production]
15:32 <jelto@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2202.codfw.wmnet with OS bookworm [production]
15:25 <jelto@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2201.codfw.wmnet with OS bookworm [production]
15:25 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on db2240.codfw.wmnet with reason: maintenance [production]
15:25 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 10:00:00 on db2240.codfw.wmnet with reason: maintenance [production]
15:21 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db2240.codfw.wmnet with reason: maintenance [production]
15:20 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1:00:00 on db2240.codfw.wmnet with reason: maintenance [production]
15:20 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db2240 to make it candidate master', diff saved to https://phabricator.wikimedia.org/P71984 and previous config saved to /var/cache/conftool/dbconfig/20250110-152035-marostegui.json [production]
15:12 <jelto@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2202.codfw.wmnet with reason: host reimage [production]
15:08 <jelto@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2202.codfw.wmnet with reason: host reimage [production]
15:06 <jelto@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2201.codfw.wmnet with reason: host reimage [production]
15:02 <jelto@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2201.codfw.wmnet with reason: host reimage [production]
14:49 <jelto@cumin1002> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host wikikube-worker2202 [production]