401-450 of 10000 results (17ms)
2025-01-23 §
02:06 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1012.eqiad.wmnet with reason: host reimage [production]
01:50 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
01:49 <andrew@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
01:27 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1012.eqiad.wmnet with reason: host reimage [production]
01:23 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1012.eqiad.wmnet with reason: host reimage [production]
01:06 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
01:06 <andrew@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
01:00 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
00:59 <andrew@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
00:54 <tzatziki> removing 2 files for legal compliance [production]
00:44 <tzatziki> removing 1 file for legal complaince [production]
00:42 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
00:41 <andrew@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
2025-01-22 §
23:58 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
23:57 <andrew@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
22:36 <mforns> [data lake temp accounts] re-ran DAG mediawiki_history_denormalized for 2024-12 [analytics]
22:24 <dmartin@deploy2002> helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply [production]
22:23 <dmartin@deploy2002> helmfile [eqiad] START helmfile.d/services/wikifunctions: apply [production]
22:23 <dmartin@deploy2002> helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply [production]
22:22 <dmartin@deploy2002> helmfile [codfw] START helmfile.d/services/wikifunctions: apply [production]
22:20 <dmartin@deploy2002> helmfile [staging] DONE helmfile.d/services/wikifunctions: apply [production]
22:20 <dmartin@deploy2002> helmfile [staging] START helmfile.d/services/wikifunctions: apply [production]
22:20 <kamila@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[1135-1141].eqiad.wmnet [production]
22:20 <kamila@cumin1002> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[1135-1141].eqiad.wmnet [production]
22:13 <dmartin@deploy2002> helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply [production]
22:12 <dmartin@deploy2002> helmfile [eqiad] START helmfile.d/services/wikifunctions: apply [production]
22:12 <dmartin@deploy2002> helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply [production]
22:11 <dmartin@deploy2002> helmfile [codfw] START helmfile.d/services/wikifunctions: apply [production]
22:06 <dmartin@deploy2002> helmfile [staging] DONE helmfile.d/services/wikifunctions: apply [production]
22:06 <dmartin@deploy2002> helmfile [staging] START helmfile.d/services/wikifunctions: apply [production]
22:03 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1139.eqiad.wmnet with OS bookworm [production]
21:59 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1138.eqiad.wmnet with OS bookworm [production]
21:55 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1137.eqiad.wmnet with OS bookworm [production]
21:52 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1140.eqiad.wmnet with OS bookworm [production]
21:49 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1135.eqiad.wmnet with OS bookworm [production]
21:45 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1141.eqiad.wmnet with OS bookworm [production]
21:44 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1139.eqiad.wmnet with reason: host reimage [production]
21:42 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1136.eqiad.wmnet with OS bookworm [production]
21:40 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1138.eqiad.wmnet with reason: host reimage [production]
21:37 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1137.eqiad.wmnet with reason: host reimage [production]
21:36 <dzahn@dns1004> END - running authdns-update [production]
21:34 <cjming> end of UTC late backport window [production]
21:34 <dzahn@dns1004> START - running authdns-update [production]
21:34 <cjming@deploy2002> Finished scap sync-world: Backport for [[gerrit:1113512|Add a few more contextual attributes to web base (T373715)]] (duration: 11m 41s) [production]
21:33 <cmooney@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on netflow7001.magru.wmnet with reason: disabling alerts as I'm running gnmic manually rather than with systemd [production]
21:33 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1140.eqiad.wmnet with reason: host reimage [production]
21:29 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1135.eqiad.wmnet with reason: host reimage [production]
21:27 <cjming@deploy2002> cjming: Continuing with sync [production]
21:27 <cjming@deploy2002> cjming: Backport for [[gerrit:1113512|Add a few more contextual attributes to web base (T373715)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
21:26 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]