3051-3100 of 10000 results (52ms)
2022-03-23 ยง
22:05 <andrew@cumin1001> START - Cookbook sre.hosts.reimage for host cloudvirt1042.eqiad.wmnet with OS bullseye [production]
22:05 <andrew@cumin1001> START - Cookbook sre.hosts.reimage for host cloudvirt1041.eqiad.wmnet with OS bullseye [production]
21:55 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1040.eqiad.wmnet with OS bullseye [production]
21:54 <wm-bot> Drained 'cloudvirt1042.eqiad.wmnet'. (T281276) - cookbook ran by andrew@buster [admin]
21:42 <bking@cumin1001> END (PASS) - Cookbook sre.elasticsearch.rolling-operation (exit_code=0) Operation.UPGRADE (1 nodes at a time) for ElasticSearch cluster cloudelastic: cloudelastic ES 6.8 upgrade - bking@cumin1001 - T301956 [production]
21:35 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1040.eqiad.wmnet with reason: host reimage [production]
21:31 <andrew@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1040.eqiad.wmnet with reason: host reimage [production]
21:27 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
21:26 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
21:26 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
21:25 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
21:24 <cjming@deploy1002> Synchronized wmf-config/InitialiseSettings-labs.php: Config: [[gerrit:773331|Enable split A/B testing on beta cluster (T301584)]] (duration: 00m 50s) [production]
21:19 <wm-bot> Set cloudvirt 'cloudvirt1042.eqiad.wmnet' maintenance. (T281276) - cookbook ran by andrew@buster [admin]
21:19 <wm-bot> Draining 'cloudvirt1042.eqiad.wmnet'. (T281276) - cookbook ran by andrew@buster [admin]
21:18 <andrew@cumin1001> START - Cookbook sre.hosts.reimage for host cloudvirt1040.eqiad.wmnet with OS bullseye [production]
21:15 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
21:15 <catrope@deploy1002> Synchronized wmf-config/CommonSettings.php: Config: [[gerrit:772408|Allow autoconfirmed users to view basic IP information (T303858)]] and [[gerrit:767216|Enable IPInfo on testwiki (T260598)]] (duration: 00m 50s) [production]
21:13 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
21:13 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
21:12 <wm-bot> Drained 'cloudvirt1040.eqiad.wmnet'. (T281276) - cookbook ran by andrew@buster [admin]
21:12 <wm-bot> Set cloudvirt 'cloudvirt1040.eqiad.wmnet' maintenance. (T281276) - cookbook ran by andrew@buster [admin]
21:11 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
21:09 <wm-bot> Draining 'cloudvirt1040.eqiad.wmnet'. (T281276) - cookbook ran by andrew@buster [admin]
21:08 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1039.eqiad.wmnet with OS bullseye [production]
21:07 <wm-bot> Set cloudvirt 'cloudvirt1040.eqiad.wmnet' maintenance. (T281276) - cookbook ran by andrew@buster [admin]
21:06 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
21:04 <wm-bot> Draining 'cloudvirt1040.eqiad.wmnet'. (T281276) - cookbook ran by andrew@buster [admin]
21:02 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
21:02 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
20:58 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
20:55 <wm-bot> Set cloudvirt 'cloudvirt1041.eqiad.wmnet' maintenance. (T281276) - cookbook ran by andrew@buster [admin]
20:54 <wm-bot> Draining 'cloudvirt1041.eqiad.wmnet'. (T281276) - cookbook ran by andrew@buster [admin]
20:53 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
20:53 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1037.eqiad.wmnet with OS bullseye [production]
20:52 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
20:52 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
20:52 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1038.eqiad.wmnet with OS bullseye [production]
20:51 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
20:48 <andrew@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1039.eqiad.wmnet with reason: host reimage [production]
20:47 <taavi> add samtar (User:TheresNoTime) as maintainer [tools.stewardbots]
20:46 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
20:46 <andrew@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1039.eqiad.wmnet with reason: host reimage [production]
20:45 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
20:45 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
20:44 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
20:44 <AntiComposite> `./stewardbots/StewardBot/manage.sh restart` timed out, did not auto rejoin [tools.stewardbots]
20:40 <catrope@deploy1002> Synchronized wmf-config/extension-list: Config: [[gerrit:771448|DynamicSidebar: remove unused extension (T304006)]] (duration: 00m 49s) [production]
20:37 <wm-bot> <bd808> Manual restart after seeing the bot drop out of #wikimedia-operations; Turns out the client rejoined before the restart, but no harm done in restarting [tools.jouncebot]
20:34 <catrope@deploy1002> Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:771447|DynamicSidebar: remove from InitialiseSettings]] (duration: 00m 51s) [production]
20:33 <andrew@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on cloudvirt1037.eqiad.wmnet with reason: host reimage [production]