4151-4200 of 10000 results (166ms)
2025-01-23 §
06:42 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db2189 T383709', diff saved to https://phabricator.wikimedia.org/P72237 and previous config saved to /var/cache/conftool/dbconfig/20250123-064241-marostegui.json [production]
06:42 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 6:00:00 on db2189.codfw.wmnet with reason: Onsite work [production]
06:41 <marostegui> Powering off db2189 for onsite maintenance T383709 [production]
02:35 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
02:10 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1012.eqiad.wmnet with reason: host reimage [production]
02:06 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1012.eqiad.wmnet with reason: host reimage [production]
01:50 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
01:49 <andrew@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
01:27 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudcephosd1012.eqiad.wmnet with reason: host reimage [production]
01:23 <andrew@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudcephosd1012.eqiad.wmnet with reason: host reimage [production]
01:06 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
01:06 <andrew@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
01:00 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
00:59 <andrew@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
00:54 <tzatziki> removing 2 files for legal compliance [production]
00:44 <tzatziki> removing 1 file for legal complaince [production]
00:42 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
00:41 <andrew@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
2025-01-22 §
23:58 <andrew@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
23:57 <andrew@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephosd1012.eqiad.wmnet with OS bullseye [production]
22:24 <dmartin@deploy2002> helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply [production]
22:23 <dmartin@deploy2002> helmfile [eqiad] START helmfile.d/services/wikifunctions: apply [production]
22:23 <dmartin@deploy2002> helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply [production]
22:22 <dmartin@deploy2002> helmfile [codfw] START helmfile.d/services/wikifunctions: apply [production]
22:20 <dmartin@deploy2002> helmfile [staging] DONE helmfile.d/services/wikifunctions: apply [production]
22:20 <dmartin@deploy2002> helmfile [staging] START helmfile.d/services/wikifunctions: apply [production]
22:20 <kamila@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[1135-1141].eqiad.wmnet [production]
22:20 <kamila@cumin1002> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker[1135-1141].eqiad.wmnet [production]
22:13 <dmartin@deploy2002> helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply [production]
22:12 <dmartin@deploy2002> helmfile [eqiad] START helmfile.d/services/wikifunctions: apply [production]
22:12 <dmartin@deploy2002> helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply [production]
22:11 <dmartin@deploy2002> helmfile [codfw] START helmfile.d/services/wikifunctions: apply [production]
22:06 <dmartin@deploy2002> helmfile [staging] DONE helmfile.d/services/wikifunctions: apply [production]
22:06 <dmartin@deploy2002> helmfile [staging] START helmfile.d/services/wikifunctions: apply [production]
22:03 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1139.eqiad.wmnet with OS bookworm [production]
21:59 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1138.eqiad.wmnet with OS bookworm [production]
21:55 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1137.eqiad.wmnet with OS bookworm [production]
21:52 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1140.eqiad.wmnet with OS bookworm [production]
21:49 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1135.eqiad.wmnet with OS bookworm [production]
21:45 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1141.eqiad.wmnet with OS bookworm [production]
21:44 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1139.eqiad.wmnet with reason: host reimage [production]
21:42 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1136.eqiad.wmnet with OS bookworm [production]
21:40 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1138.eqiad.wmnet with reason: host reimage [production]
21:37 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1137.eqiad.wmnet with reason: host reimage [production]
21:36 <dzahn@dns1004> END - running authdns-update [production]
21:34 <cjming> end of UTC late backport window [production]
21:34 <dzahn@dns1004> START - running authdns-update [production]
21:34 <cjming@deploy2002> Finished scap sync-world: Backport for [[gerrit:1113512|Add a few more contextual attributes to web base (T373715)]] (duration: 11m 41s) [production]
21:33 <cmooney@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on netflow7001.magru.wmnet with reason: disabling alerts as I'm running gnmic manually rather than with systemd [production]
21:33 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1140.eqiad.wmnet with reason: host reimage [production]