2351-2400 of 10000 results (98ms)
2024-02-13 ยง
11:32 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/mw-debug: apply [production]
11:32 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply [production]
11:31 <brouberol@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on apifeatureusage2001.codfw.wmnet with reason: host reimage [production]
11:31 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-debug: apply [production]
11:27 <hnowlan@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host mw2282.codfw.wmnet with OS bullseye [production]
11:24 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply [production]
11:24 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1150.eqiad.wmnet with reason: Maintenance [production]
11:24 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-debug: apply [production]
11:24 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1150.eqiad.wmnet with reason: Maintenance [production]
11:24 <claime> Change default maxUnavailable for mw-on-k8s to 10% [production]
11:21 <brouberol@cumin1002> START - Cookbook sre.hosts.reimage for host apifeatureusage2001.codfw.wmnet with OS bullseye [production]
11:20 <brouberol@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host apifeatureusage1001.eqiad.wmnet with OS bullseye [production]
11:14 <gmodena@deploy2002> helmfile [codfw] DONE helmfile.d/services/eventstreams: apply [production]
11:14 <gmodena@deploy2002> helmfile [codfw] START helmfile.d/services/eventstreams: apply [production]
11:14 <hnowlan@cumin2002> START - Cookbook sre.hosts.reimage for host mw2282.codfw.wmnet with OS bullseye [production]
11:13 <hnowlan@cumin2002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host mw2282.codfw.wmnet with OS bullseye [production]
11:12 <gmodena@deploy2002> helmfile [eqiad] DONE helmfile.d/services/eventstreams: apply [production]
11:11 <gmodena@deploy2002> helmfile [eqiad] START helmfile.d/services/eventstreams: apply [production]
11:10 <gmodena@deploy2002> helmfile [staging] DONE helmfile.d/services/eventstreams: apply [production]
11:10 <gmodena@deploy2002> helmfile [staging] START helmfile.d/services/eventstreams: apply [production]
11:04 <brouberol@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on apifeatureusage1001.eqiad.wmnet with reason: host reimage [production]
11:01 <brouberol@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on apifeatureusage1001.eqiad.wmnet with reason: host reimage [production]
11:01 <cgoubert@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw2388.codfw.wmnet [production]
11:01 <cgoubert@cumin2002> START - Cookbook sre.hosts.remove-downtime for mw2388.codfw.wmnet [production]
10:57 <hnowlan@cumin2002> START - Cookbook sre.hosts.reimage for host mw2282.codfw.wmnet with OS bullseye [production]
10:49 <brouberol@cumin1002> START - Cookbook sre.hosts.reimage for host apifeatureusage1001.eqiad.wmnet with OS bullseye [production]
10:41 <brouberol@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host apifeatureusage1001.eqiad.wmnet with OS bookworm [production]
10:39 <brouberol@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on apifeatureusage1001.eqiad.wmnet with reason: host reimage [production]
10:36 <brouberol@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on apifeatureusage1001.eqiad.wmnet with reason: host reimage [production]
10:25 <brouberol@cumin1002> START - Cookbook sre.hosts.reimage for host apifeatureusage1001.eqiad.wmnet with OS bookworm [production]
10:23 <brouberol@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host apifeatureusage1001.eqiad.wmnet with OS bookworm [production]
10:23 <kharlan@deploy2002> helmfile [codfw] DONE helmfile.d/services/ipoid: apply [production]
10:23 <kharlan@deploy2002> helmfile [codfw] START helmfile.d/services/ipoid: apply [production]
10:22 <kharlan@deploy2002> helmfile [eqiad] DONE helmfile.d/services/ipoid: apply [production]
10:22 <kharlan@deploy2002> helmfile [eqiad] START helmfile.d/services/ipoid: apply [production]
10:22 <kharlan@deploy2002> helmfile [staging] DONE helmfile.d/services/ipoid: apply [production]
10:22 <kharlan@deploy2002> helmfile [staging] START helmfile.d/services/ipoid: apply [production]
10:09 <brouberol@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on apifeatureusage1001.eqiad.wmnet with reason: host reimage [production]
10:06 <brouberol@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on apifeatureusage1001.eqiad.wmnet with reason: host reimage [production]
10:05 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host clouddb2002-dev.codfw.wmnet [production]
09:58 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host clouddb2002-dev.codfw.wmnet [production]
09:57 <brouberol@cumin1002> START - Cookbook sre.hosts.reimage for host apifeatureusage1001.eqiad.wmnet with OS bookworm [production]
09:23 <stran@deploy2002> helmfile [codfw] DONE helmfile.d/services/ipoid: apply [production]
09:22 <stran@deploy2002> helmfile [codfw] START helmfile.d/services/ipoid: apply [production]
09:22 <stran@deploy2002> helmfile [eqiad] DONE helmfile.d/services/ipoid: apply [production]
09:22 <akosiaris> delete sessionstore pod to force rescheduling [production]
09:21 <stran@deploy2002> helmfile [eqiad] START helmfile.d/services/ipoid: apply [production]
09:20 <stran@deploy2002> helmfile [staging] DONE helmfile.d/services/ipoid: apply [production]
09:20 <brouberol@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host apifeatureusage1001.eqiad.wmnet with OS bookworm [production]
09:20 <stran@deploy2002> helmfile [staging] START helmfile.d/services/ipoid: apply [production]