1-50 of 10000 results (21ms)
2024-09-30 ยง
22:44 <jhathaway@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1002.eqiad.wmnet with OS bookworm [production]
22:00 <zabe@deploy2002> Finished scap sync-world: Backport for [[gerrit:1076760|s3: Reduce revision-slots cache expiry to 60 seconds (T183490)]] (duration: 06m 54s) [production]
21:55 <zabe@deploy2002> zabe: Continuing with sync [production]
21:55 <zabe@deploy2002> zabe: Backport for [[gerrit:1076760|s3: Reduce revision-slots cache expiry to 60 seconds (T183490)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:54 <jhathaway@cumin1002> START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm [production]
21:54 <jhathaway@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1002.eqiad.wmnet with OS bookworm [production]
21:53 <zabe@deploy2002> Started scap sync-world: Backport for [[gerrit:1076760|s3: Reduce revision-slots cache expiry to 60 seconds (T183490)]] [production]
21:47 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
21:47 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
21:30 <zabe@deploy2002> Finished scap sync-world: Backport for [[gerrit:1076834|Make revision-slots expiry configurable (T183490)]] (duration: 07m 42s) [production]
21:26 <zabe@deploy2002> zabe: Continuing with sync [production]
21:25 <zabe@deploy2002> zabe: Backport for [[gerrit:1076834|Make revision-slots expiry configurable (T183490)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:23 <zabe@deploy2002> Started scap sync-world: Backport for [[gerrit:1076834|Make revision-slots expiry configurable (T183490)]] [production]
21:06 <jhathaway@cumin1002> START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm [production]
20:56 <jhathaway@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1002.eqiad.wmnet with OS bookworm [production]
20:56 <sukhe@cumin1002> END (PASS) - Cookbook sre.dns.admin (exit_code=0) DNS admin: pool site ulsfo [reason: repool ulsfo as cr3-ulsfo was replaced, T375345] [production]
20:56 <sukhe@cumin1002> START - Cookbook sre.dns.admin DNS admin: pool site ulsfo [reason: repool ulsfo as cr3-ulsfo was replaced, T375345] [production]
20:45 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
20:35 <ebernhardson@deploy2002> Finished scap sync-world: Backport for [[gerrit:1074257|ClosedWikiProvider: Support canAlwaysAutocreate option (T374987)]] (duration: 10m 40s) [production]
20:34 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
20:30 <ebernhardson@deploy2002> ebernhardson: Continuing with sync [production]
20:27 <ebernhardson@deploy2002> ebernhardson: Backport for [[gerrit:1074257|ClosedWikiProvider: Support canAlwaysAutocreate option (T374987)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
20:24 <ebernhardson@deploy2002> Started scap sync-world: Backport for [[gerrit:1074257|ClosedWikiProvider: Support canAlwaysAutocreate option (T374987)]] [production]
20:22 <ebernhardson@deploy2002> Finished scap sync-world: Backport for [[gerrit:1076810|Change votewiki language back to English. (T302443)]] (duration: 06m 41s) [production]
20:18 <ebernhardson@deploy2002> ebernhardson, ahonc: Continuing with sync [production]
20:18 <ebernhardson@deploy2002> ebernhardson, ahonc: Backport for [[gerrit:1076810|Change votewiki language back to English. (T302443)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
20:16 <ebernhardson@deploy2002> Started scap sync-world: Backport for [[gerrit:1076810|Change votewiki language back to English. (T302443)]] [production]
20:13 <ebernhardson@deploy2002> Finished scap sync-world: Backport for [[gerrit:1070282|cirrus: Remove unused Regex pool counter (T369808)]] (duration: 07m 34s) [production]
20:12 <wmbot~dcaro@urcuchillay> END (FAIL) - Cookbook wmcs.ceph.osd.undrain_node (exit_code=99) (T372814) [admin]
20:08 <ebernhardson@deploy2002> ebernhardson: Continuing with sync [production]
20:07 <ebernhardson@deploy2002> ebernhardson: Backport for [[gerrit:1070282|cirrus: Remove unused Regex pool counter (T369808)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
20:05 <ebernhardson@deploy2002> Started scap sync-world: Backport for [[gerrit:1070282|cirrus: Remove unused Regex pool counter (T369808)]] [production]
19:56 <jhathaway@cumin1002> START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm [production]
19:56 <jhathaway@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1002.eqiad.wmnet with OS bookworm [production]
19:08 <sukhe@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for lvs[4008-4010].ulsfo.wmnet [production]
19:08 <sukhe@cumin1002> START - Cookbook sre.hosts.remove-downtime for lvs[4008-4010].ulsfo.wmnet [production]
18:41 <jhathaway@cumin1002> START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm [production]
18:41 <jhathaway@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1002.eqiad.wmnet with OS bookworm [production]
18:25 <taavi> run striker migrations T359428 [tools]
18:04 <sukhe@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on lvs[4008-4010].ulsfo.wmnet with reason: site is depooled, cr3-ulsfo is being replaced [production]
18:04 <sukhe@cumin1002> START - Cookbook sre.hosts.downtime for 4:00:00 on lvs[4008-4010].ulsfo.wmnet with reason: site is depooled, cr3-ulsfo is being replaced [production]
17:47 <ebernhardson@deploy2002> helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
17:47 <ebernhardson@deploy2002> helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
17:44 <ebernhardson@deploy2002> helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
17:44 <ebernhardson@deploy2002> helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
17:25 <ebernhardson@deploy2002> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
17:24 <ebernhardson@deploy2002> helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply [production]
17:22 <jhathaway@cumin1002> START - Cookbook sre.hosts.reimage for host sretest1002.eqiad.wmnet with OS bookworm [production]
17:22 <jhathaway@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host sretest1002.eqiad.wmnet with OS bookworm [production]
17:12 <ebernhardson@deploy2002> helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]