1101-1150 of 10000 results (55ms)
2022-06-17 ยง
09:55 <btullis@deploy1002> helmfile [codfw] START helmfile.d/services/datahub: apply on main [production]
09:52 <btullis@deploy1002> helmfile [staging] DONE helmfile.d/services/datahub: sync on main [production]
09:52 <btullis@deploy1002> helmfile [staging] START helmfile.d/services/datahub: apply on main [production]
09:51 <klausman@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve2007.codfw.wmnet [production]
09:44 <klausman@cumin1001> START - Cookbook sre.hosts.reboot-single for host ml-serve2007.codfw.wmnet [production]
09:41 <klausman@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve2006.codfw.wmnet [production]
09:35 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host webperf1004.eqiad.wmnet [production]
09:34 <klausman@cumin1001> START - Cookbook sre.hosts.reboot-single for host ml-serve2006.codfw.wmnet [production]
09:33 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host webperf1004.eqiad.wmnet [production]
09:32 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host webperf1003.eqiad.wmnet [production]
09:30 <klausman@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve2005.codfw.wmnet [production]
09:28 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host webperf1003.eqiad.wmnet [production]
09:25 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host webperf2004.codfw.wmnet [production]
09:24 <klausman@cumin1001> START - Cookbook sre.hosts.reboot-single for host ml-serve2005.codfw.wmnet [production]
09:23 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host webperf2004.codfw.wmnet [production]
09:23 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on ganeti4004.ulsfo.wmnet with reason: Enable virt in BIOS [production]
09:23 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 4 days, 0:00:00 on ganeti4004.ulsfo.wmnet with reason: Enable virt in BIOS [production]
09:19 <klausman@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve2004.codfw.wmnet [production]
09:18 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host webperf2003.codfw.wmnet [production]
09:14 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host webperf2003.codfw.wmnet [production]
09:11 <klausman@cumin1001> START - Cookbook sre.hosts.reboot-single for host ml-serve2004.codfw.wmnet [production]
09:09 <klausman@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve2003.codfw.wmnet [production]
09:01 <klausman@cumin1001> START - Cookbook sre.hosts.reboot-single for host ml-serve2003.codfw.wmnet [production]
08:58 <klausman@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve2002.codfw.wmnet [production]
08:51 <klausman@cumin1001> START - Cookbook sre.hosts.reboot-single for host ml-serve2002.codfw.wmnet [production]
08:47 <klausman@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-serve2001.codfw.wmnet [production]
08:39 <klausman@cumin1001> START - Cookbook sre.hosts.reboot-single for host ml-serve2001.codfw.wmnet [production]
08:21 <klausman@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on ml-serve-ctrl[2001-2002].codfw.wmnet with reason: Rebooting to activate new kernel for T310483 [production]
08:21 <klausman@cumin1001> START - Cookbook sre.hosts.downtime for 0:30:00 on ml-serve-ctrl[2001-2002].codfw.wmnet with reason: Rebooting to activate new kernel for T310483 [production]
08:17 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on ganeti4004.ulsfo.wmnet with reason: Enable virt in BIOS [production]
08:17 <jmm@cumin2002> START - Cookbook sre.hosts.downtime for 1:00:00 on ganeti4004.ulsfo.wmnet with reason: Enable virt in BIOS [production]
08:17 <klausman@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-staging2002.codfw.wmnet [production]
08:10 <klausman@cumin1001> START - Cookbook sre.hosts.reboot-single for host ml-staging2002.codfw.wmnet [production]
08:08 <klausman@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-staging2001.codfw.wmnet [production]
08:02 <klausman@cumin1001> START - Cookbook sre.hosts.reboot-single for host ml-staging2001.codfw.wmnet [production]
07:41 <klausman@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on ml-staging-ctrl[2001-2002].codfw.wmnet with reason: Rebooting to activate new kernel for T310483 [production]
07:41 <klausman@cumin1001> START - Cookbook sre.hosts.downtime for 0:30:00 on ml-staging-ctrl[2001-2002].codfw.wmnet with reason: Rebooting to activate new kernel for T310483 [production]
02:51 <pt1979@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host aqs1018.eqiad.wmnet with OS bullseye [production]
02:39 <pt1979@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aqs1018.eqiad.wmnet with reason: host reimage [production]
02:36 <pt1979@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on aqs1018.eqiad.wmnet with reason: host reimage [production]
02:10 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
02:09 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
02:09 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
02:08 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
02:06 <tstarling@deploy1002> Synchronized wmf-config/InitialiseSettings.php: (no justification provided) (duration: 03m 43s) [production]
02:02 <pt1979@cumin1001> START - Cookbook sre.hosts.reimage for host aqs1018.eqiad.wmnet with OS bullseye [production]
01:54 <pt1979@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host aqs1017.eqiad.wmnet with OS bullseye [production]
01:43 <pt1979@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aqs1017.eqiad.wmnet with reason: host reimage [production]
01:39 <pt1979@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on aqs1017.eqiad.wmnet with reason: host reimage [production]
01:07 <pt1979@cumin1001> START - Cookbook sre.hosts.reimage for host aqs1017.eqiad.wmnet with OS bullseye [production]