2201-2250 of 10000 results (82ms)
2023-08-17 §
21:21 <cmooney@cumin1001> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
21:11 <ebernhardson@deploy1002> Finished deploy [airflow-dags/search@1d60a29]: make wikibase ttl imports to hdfs world readable (duration: 00m 11s) [production]
21:11 <ebernhardson@deploy1002> Started deploy [airflow-dags/search@1d60a29]: make wikibase ttl imports to hdfs world readable [production]
21:10 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
21:09 <cmooney@cumin1001> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
21:03 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
21:03 <cmooney@cumin1001> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
20:58 <bking@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs1011.eqiad.wmnet with reason: host reimage [production]
20:56 <urandom> Rolling Cassandra restart eqiad/d (RESTBase cluster) — T339298 [production]
20:55 <bking@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs1011.eqiad.wmnet with reason: host reimage [production]
20:50 <thcipriani@deploy1002> Finished scap: Backport for [[gerrit:950011|Revert "Add newline to README for backport training"]] (duration: 10m 43s) [production]
20:49 <urandom> Rolling Cassandra restart eqiad/b (RESTBase cluster) — T339298 [production]
20:48 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
20:46 <sukhe> restart pybal on lvs3008 [production]
20:44 <bking@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wdqs1010.eqiad.wmnet with OS bullseye [production]
20:44 <thcipriani@deploy1002> thcipriani: Continuing with sync [production]
20:41 <sukhe> restart pybal on lvs3010 [production]
20:41 <thcipriani@deploy1002> thcipriani: Backport for [[gerrit:950011|Revert "Add newline to README for backport training"]] synced to the testservers mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option) [production]
20:40 <bking@cumin1001> START - Cookbook sre.hosts.reimage for host wdqs1011.eqiad.wmnet with OS bullseye [production]
20:40 <bking@cumin1001> START - Cookbook sre.hosts.reimage for host wdqs1010.eqiad.wmnet with OS bullseye [production]
20:40 <thcipriani@deploy1002> Started scap: Backport for [[gerrit:950011|Revert "Add newline to README for backport training"]] [production]
20:34 <thcipriani@deploy1002> Finished scap: Backport for [[gerrit:950036|Add newline to README for backport training]] (duration: 13m 29s) [production]
20:30 <urandom> Rolling Cassandra restart eqiad/a (RESTBase cluster) — T339298 [production]
20:28 <thcipriani@deploy1002> thcipriani: Continuing with sync [production]
20:22 <thcipriani@deploy1002> thcipriani: Backport for [[gerrit:950036|Add newline to README for backport training]] synced to the testservers mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option) [production]
20:21 <thcipriani@deploy1002> Started scap: Backport for [[gerrit:950036|Add newline to README for backport training]] [production]
20:20 <cmooney@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
20:20 <cmooney@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add reverses for Lumen transport esams eqiad - cmooney@cumin1001" [production]
20:20 <urandom> Rolling Cassandra restart codfw/d (RESTBase cluster) — T339298 [production]
20:19 <cmooney@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add reverses for Lumen transport esams eqiad - cmooney@cumin1001" [production]
20:12 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum4001.ulsfo.wmnet with OS bookworm [production]
20:06 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
20:03 <urandom> Rolling Cassandra restart codfw/c (RESTBase cluster) — T339298 [production]
19:35 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ncredir3004.esams.wmnet with OS bullseye [production]
19:25 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum4001.ulsfo.wmnet with reason: host reimage [production]
19:22 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on durum4001.ulsfo.wmnet with reason: host reimage [production]
19:22 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum5002.eqsin.wmnet with OS bookworm [production]
19:20 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum5001.eqsin.wmnet with OS bookworm [production]
19:11 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ncredir3004.esams.wmnet with reason: host reimage [production]
19:07 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on ncredir3004.esams.wmnet with reason: host reimage [production]
19:02 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host durum4001.ulsfo.wmnet with OS bookworm [production]
19:01 <brett@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host durum4001.ulsfo.wmnet with OS bookworm [production]
18:54 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum6002.drmrs.wmnet with OS bookworm [production]
18:54 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ncredir3003.esams.wmnet with OS bullseye [production]
18:53 <urandom> Rolling Cassandra restart codfw/b — T339298 [production]
18:51 <sukhe@cumin2002> START - Cookbook sre.hosts.reimage for host ncredir3004.esams.wmnet with OS bullseye [production]
18:49 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum6001.drmrs.wmnet with OS bookworm [production]
18:41 <sukhe> force agent run on A:acmechief for CR 950005 [production]
18:35 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum1002.eqiad.wmnet with OS bookworm [production]
18:35 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum5002.eqsin.wmnet with reason: host reimage [production]