4051-4100 of 10000 results (105ms)
2023-08-17 §
20:41 <thcipriani@deploy1002> thcipriani: Backport for [[gerrit:950011|Revert "Add newline to README for backport training"]] synced to the testservers mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option) [production]
20:40 <bking@cumin1001> START - Cookbook sre.hosts.reimage for host wdqs1011.eqiad.wmnet with OS bullseye [production]
20:40 <bking@cumin1001> START - Cookbook sre.hosts.reimage for host wdqs1010.eqiad.wmnet with OS bullseye [production]
20:40 <thcipriani@deploy1002> Started scap: Backport for [[gerrit:950011|Revert "Add newline to README for backport training"]] [production]
20:34 <thcipriani@deploy1002> Finished scap: Backport for [[gerrit:950036|Add newline to README for backport training]] (duration: 13m 29s) [production]
20:30 <urandom> Rolling Cassandra restart eqiad/a (RESTBase cluster) — T339298 [production]
20:28 <thcipriani@deploy1002> thcipriani: Continuing with sync [production]
20:22 <thcipriani@deploy1002> thcipriani: Backport for [[gerrit:950036|Add newline to README for backport training]] synced to the testservers mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option) [production]
20:21 <thcipriani@deploy1002> Started scap: Backport for [[gerrit:950036|Add newline to README for backport training]] [production]
20:20 <cmooney@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
20:20 <cmooney@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add reverses for Lumen transport esams eqiad - cmooney@cumin1001" [production]
20:20 <urandom> Rolling Cassandra restart codfw/d (RESTBase cluster) — T339298 [production]
20:19 <cmooney@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add reverses for Lumen transport esams eqiad - cmooney@cumin1001" [production]
20:12 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum4001.ulsfo.wmnet with OS bookworm [production]
20:06 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
20:03 <urandom> Rolling Cassandra restart codfw/c (RESTBase cluster) — T339298 [production]
19:35 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ncredir3004.esams.wmnet with OS bullseye [production]
19:25 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum4001.ulsfo.wmnet with reason: host reimage [production]
19:22 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on durum4001.ulsfo.wmnet with reason: host reimage [production]
19:22 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum5002.eqsin.wmnet with OS bookworm [production]
19:20 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum5001.eqsin.wmnet with OS bookworm [production]
19:11 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ncredir3004.esams.wmnet with reason: host reimage [production]
19:07 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ncredir3004.esams.wmnet with reason: host reimage [production]
19:02 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host durum4001.ulsfo.wmnet with OS bookworm [production]
19:01 <brett@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host durum4001.ulsfo.wmnet with OS bookworm [production]
18:54 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum6002.drmrs.wmnet with OS bookworm [production]
18:54 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ncredir3003.esams.wmnet with OS bullseye [production]
18:53 <urandom> Rolling Cassandra restart codfw/b — T339298 [production]
18:51 <sukhe@cumin2002> START - Cookbook sre.hosts.reimage for host ncredir3004.esams.wmnet with OS bullseye [production]
18:49 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum6001.drmrs.wmnet with OS bookworm [production]
18:41 <sukhe> force agent run on A:acmechief for CR 950005 [production]
18:35 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum1002.eqiad.wmnet with OS bookworm [production]
18:35 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum5002.eqsin.wmnet with reason: host reimage [production]
18:33 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum5001.eqsin.wmnet with reason: host reimage [production]
18:30 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on durum5002.eqsin.wmnet with reason: host reimage [production]
18:29 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on durum5001.eqsin.wmnet with reason: host reimage [production]
18:26 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum4002.ulsfo.wmnet with OS bookworm [production]
18:25 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum2001.codfw.wmnet with OS bookworm [production]
18:21 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum2002.codfw.wmnet with OS bookworm [production]
18:17 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ncredir3003.esams.wmnet with reason: host reimage [production]
18:15 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum6001.drmrs.wmnet with reason: host reimage [production]
18:13 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum1001.eqiad.wmnet with OS bookworm [production]
18:12 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum6002.drmrs.wmnet with reason: host reimage [production]
18:12 <bking@cumin1001> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) for new host flink-zk2003.codfw.wmnet [production]
18:12 <bking@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host flink-zk2003.codfw.wmnet with OS bookworm [production]
18:12 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ncredir3003.esams.wmnet with reason: host reimage [production]
18:10 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum4002.ulsfo.wmnet with reason: host reimage [production]
18:08 <dancy@deploy1002> rebuilt and synchronized wikiversions files: group2 wikis to 1.41.0-wmf.22 refs T343724 [production]
18:07 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on durum6001.drmrs.wmnet with reason: host reimage [production]
18:07 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on durum6002.drmrs.wmnet with reason: host reimage [production]