4101-4150 of 10000 results (103ms)
2024-04-30 ยง
22:05 <fabfur@cumin1002> START - Cookbook sre.hosts.reimage for host cp7013.magru.wmnet with OS bullseye [production]
22:04 <fabfur@cumin1002> START - Cookbook sre.hosts.reimage for host cp7014.magru.wmnet with OS bullseye [production]
22:02 <btullis@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cephosd1005.eqiad.wmnet with reason: host reimage [production]
21:56 <btullis@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cephosd1005.eqiad.wmnet with reason: host reimage [production]
21:50 <btullis@cumin1002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts cephadm1001.eqiad.wmnet [production]
21:50 <btullis@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
21:50 <btullis@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cephadm1001.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - btullis@cumin1002" [production]
21:49 <btullis@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cephadm1001.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - btullis@cumin1002" [production]
21:37 <btullis@cumin1002> START - Cookbook sre.hosts.reimage for host cephosd1005.eqiad.wmnet with OS bullseye [production]
21:33 <mutante> grafana2001 - sudo -u loki /usr/bin/loki -config.file=/etc/loki/loki-local-config.yaml in an attempt to debug issue on grafana-next.wikimedia.org [production]
21:18 <btullis@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cephosd1004.eqiad.wmnet with OS bullseye [production]
21:06 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ncredir1002.eqiad.wmnet with OS bookworm [production]
21:03 <btullis@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cephosd1004.eqiad.wmnet with reason: host reimage [production]
20:58 <btullis@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cephosd1004.eqiad.wmnet with reason: host reimage [production]
20:56 <btullis@cumin1002> START - Cookbook sre.dns.netbox [production]
20:55 <sukhe@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp7015.magru.wmnet with OS bullseye [production]
20:55 <sukhe@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - sukhe@cumin1002" [production]
20:54 <sukhe@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - sukhe@cumin1002" [production]
20:51 <btullis@cumin1002> START - Cookbook sre.hosts.decommission for hosts cephadm1001.eqiad.wmnet [production]
20:50 <sukhe@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on dns7002.wikimedia.org with reason: reimaged again [production]
20:50 <sukhe@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on dns7002.wikimedia.org with reason: reimaged again [production]
20:49 <sukhe@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host dns7002.wikimedia.org with OS bookworm [production]
20:46 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ncredir1002.eqiad.wmnet with reason: host reimage [production]
20:43 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ncredir1002.eqiad.wmnet with reason: host reimage [production]
20:40 <aokoth@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host lists1004.wikimedia.org with OS bookworm [production]
20:40 <aokoth@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - aokoth@cumin1002" [production]
20:39 <cjming> end of UTC late backport window [production]
20:39 <btullis@cumin1002> START - Cookbook sre.hosts.reimage for host cephosd1004.eqiad.wmnet with OS bullseye [production]
20:38 <cjming@deploy1002> Finished scap: Backport for [[gerrit:1025819|Deploy a11y settings to testwiki (T362147)]] (duration: 21m 00s) [production]
20:38 <aokoth@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - aokoth@cumin1002" [production]
20:37 <sukhe@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp7016.magru.wmnet with OS bullseye [production]
20:37 <sukhe@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - sukhe@cumin1002" [production]
20:36 <sukhe@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - sukhe@cumin1002" [production]
20:31 <sukhe@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp7015.magru.wmnet with reason: host reimage [production]
20:30 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host ncredir1002.eqiad.wmnet with OS bookworm [production]
20:28 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ncredir1001.eqiad.wmnet with OS bookworm [production]
20:27 <sukhe@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp7015.magru.wmnet with reason: host reimage [production]
20:26 <cjming@deploy1002> ksarabia and cjming: Continuing with sync [production]
20:21 <aokoth@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lists1004.wikimedia.org with reason: host reimage [production]
20:20 <cjming@deploy1002> ksarabia and cjming: Backport for [[gerrit:1025819|Deploy a11y settings to testwiki (T362147)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
20:18 <aokoth@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on lists1004.wikimedia.org with reason: host reimage [production]
20:17 <cjming@deploy1002> Started scap: Backport for [[gerrit:1025819|Deploy a11y settings to testwiki (T362147)]] [production]
20:16 <sukhe@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dns7002.wikimedia.org with reason: host reimage [production]
20:13 <sukhe@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp7016.magru.wmnet with reason: host reimage [production]
20:11 <sukhe@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on dns7002.wikimedia.org with reason: host reimage [production]
20:10 <sukhe@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp7016.magru.wmnet with reason: host reimage [production]
20:10 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ncredir1001.eqiad.wmnet with reason: host reimage [production]
20:07 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ncredir1001.eqiad.wmnet with reason: host reimage [production]
20:03 <aokoth@cumin1002> START - Cookbook sre.hosts.reimage for host lists1004.wikimedia.org with OS bookworm [production]
20:01 <aokoth@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host lists1004.wikimedia.org with OS bookworm [production]