5251-5300 of 10000 results (53ms)
2022-03-23 ยง
15:02 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1031.eqiad.wmnet with reason: host reimage [production]
15:01 <mmandere@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cp1080.eqiad.wmnet with reason: host reimage [production]
15:00 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1030.eqiad.wmnet with OS bullseye [production]
14:59 <andrew@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1031.eqiad.wmnet with reason: host reimage [production]
14:50 <bking@cumin1001> END (PASS) - Cookbook sre.wdqs.reboot (exit_code=0) [production]
14:48 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
14:47 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
14:47 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
14:46 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
14:45 <andrew@cumin1001> START - Cookbook sre.hosts.reimage for host cloudvirt1031.eqiad.wmnet with OS bullseye [production]
14:44 <andrew@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on cloudvirt1030.eqiad.wmnet with reason: host reimage [production]
14:44 <andrew@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1030.eqiad.wmnet with reason: host reimage [production]
14:44 <mmandere@cumin1001> START - Cookbook sre.hosts.reimage for host cp1080.eqiad.wmnet with OS buster [production]
14:41 <urbanecm@deploy1002> Synchronized php-1.39.0-wmf.3/extensions/WikimediaMaintenance/addWiki.php: 9a0aed0: addWiki: Create GrowthExperiment tables for all new Wikipedias (T304052) (duration: 01m 06s) [production]
14:38 <bblack@cumin1001> conftool action : set/pooled=yes; selector: name=cp1085.eqiad.wmnet [production]
14:37 <mmandere> depool cp1080 for reimage - T290005 [production]
14:33 <andrew@cumin1001> START - Cookbook sre.hosts.reimage for host cloudvirt1030.eqiad.wmnet with OS bullseye [production]
14:31 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
14:30 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
14:30 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
14:29 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
14:28 <bking@cumin1001> START - Cookbook sre.wdqs.reboot [production]
14:27 <bking@cumin1001> END (PASS) - Cookbook sre.wdqs.reboot (exit_code=0) [production]
14:23 <bblack> reboot cp1085 (downtimed) [production]
14:20 <bking@cumin1001> START - Cookbook sre.wdqs.reboot [production]
14:19 <bking@cumin1001> conftool action : set/pooled=yes; selector: name=wcqs1002.eqiad.wmnet [production]
14:18 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1029.eqiad.wmnet with OS bullseye [production]
14:11 <bking@cumin1001> END (FAIL) - Cookbook sre.wdqs.reboot (exit_code=99) [production]
14:10 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1027.eqiad.wmnet with OS bullseye [production]
14:09 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
14:08 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
14:08 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
14:06 <mmandere> pool cp1082 with HAProxy as TLS termination layer - T290005 [production]
14:04 <bking@cumin1001> START - Cookbook sre.wdqs.reboot [production]
14:04 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
14:04 <bking@cumin1001> END (FAIL) - Cookbook sre.wdqs.reboot (exit_code=99) [production]
14:04 <bking@cumin1001> START - Cookbook sre.wdqs.reboot [production]
14:04 <bking@cumin1001> END (FAIL) - Cookbook sre.wdqs.reboot (exit_code=99) [production]
14:04 <bking@cumin1001> START - Cookbook sre.wdqs.reboot [production]
14:00 <mmandere@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp1082.eqiad.wmnet with OS buster [production]
14:00 <bking@cumin1001> END (PASS) - Cookbook sre.wdqs.reboot (exit_code=0) [production]
13:59 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1029.eqiad.wmnet with reason: host reimage [production]
13:59 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
13:58 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
13:58 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
13:57 <bking@cumin1001> START - Cookbook sre.wdqs.reboot [production]
13:55 <andrew@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1029.eqiad.wmnet with reason: host reimage [production]
13:54 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
13:51 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubernetes1010.eqiad.wmnet with OS bullseye [production]
13:50 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1027.eqiad.wmnet with reason: host reimage [production]