2022-10-20
ยง
|
19:59 |
<robh@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
19:57 |
<robh@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
19:57 |
<herron@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host thanos-fe1003.eqiad.wmnet |
[production] |
19:55 |
<wm-bot2> |
Set cloudvirt 'cloudvirt1046.eqiad.wmnet' maintenance (downtime id: b65da7e2-9fd2-4a1e-8dd4-88ca65936ae4, use this to unset). - cookbook ran by andrew@bullseye |
[admin] |
19:55 |
<wm-bot2> |
Set cloudvirt 'cloudvirt1045.eqiad.wmnet' maintenance (downtime id: cbbf114c-9a4c-4dd2-9b58-50da8bd896ca, use this to unset). - cookbook ran by andrew@bullseye |
[admin] |
19:55 |
<wm-bot2> |
Set cloudvirt 'cloudvirt1044.eqiad.wmnet' maintenance (downtime id: 5969beaf-72bf-4af9-ae4d-8e5c3331e78e, use this to unset). - cookbook ran by andrew@bullseye |
[admin] |
19:55 |
<wm-bot2> |
Set cloudvirt 'cloudvirt1047.eqiad.wmnet' maintenance (downtime id: b8f7389d-79b4-411d-b3d5-ec8f92eba101, use this to unset). - cookbook ran by andrew@bullseye |
[admin] |
19:54 |
<wm-bot2> |
Draining 'cloudvirt1047.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:54 |
<wm-bot2> |
Safe rebooting 'cloudvirt1047.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:54 |
<wm-bot2> |
Draining 'cloudvirt1046.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:54 |
<wm-bot2> |
Safe rebooting 'cloudvirt1046.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:54 |
<wm-bot2> |
Draining 'cloudvirt1045.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:54 |
<wm-bot2> |
Safe rebooting 'cloudvirt1045.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:54 |
<wm-bot2> |
Draining 'cloudvirt1044.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:54 |
<wm-bot2> |
Safe rebooting 'cloudvirt1044.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:53 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1146:3312 (T321312)', diff saved to https://phabricator.wikimedia.org/P35731 and previous config saved to /var/cache/conftool/dbconfig/20221020-195331-ladsgroup.json |
[production] |
19:53 |
<wm-bot2> |
Safe reboot of 'cloudvirt1051.eqiad.wmnet' finished successfully. - cookbook ran by andrew@bullseye |
[admin] |
19:53 |
<wm-bot2> |
Unset cloudvirt 'cloudvirt1051.eqiad.wmnet' maintenance. - cookbook ran by andrew@bullseye |
[admin] |
19:51 |
<wm-bot2> |
Safe reboot of 'cloudvirt1049.eqiad.wmnet' finished successfully. - cookbook ran by andrew@bullseye |
[admin] |
19:51 |
<wm-bot2> |
Unset cloudvirt 'cloudvirt1049.eqiad.wmnet' maintenance. - cookbook ran by andrew@bullseye |
[admin] |
19:50 |
<wm-bot2> |
Safe reboot of 'cloudvirt1048.eqiad.wmnet' finished successfully. - cookbook ran by andrew@bullseye |
[admin] |
19:50 |
<wm-bot2> |
Unset cloudvirt 'cloudvirt1048.eqiad.wmnet' maintenance. - cookbook ran by andrew@bullseye |
[admin] |
19:50 |
<sukhe@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp4037.ulsfo.wmnet with OS buster |
[production] |
19:49 |
<wm-bot> |
<root> Restarted to rejoin channels |
[tools.stashbot] |
19:49 |
<wm-bot2> |
Drained 'cloudvirt1051.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:41 |
<dzahn@cumin2002> |
START - Cookbook sre.gitlab.reboot-runner rolling reboot on A:codfw and (A:gitlab-runner) |
[production] |
19:39 |
<herron@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host thanos-be2001.codfw.wmnet |
[production] |
19:38 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1145.eqiad.wmnet with reason: Maintenance |
[production] |
19:38 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1145.eqiad.wmnet with reason: Maintenance |
[production] |
19:37 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1144:3315 (T321312)', diff saved to https://phabricator.wikimedia.org/P35727 and previous config saved to /var/cache/conftool/dbconfig/20221020-193756-ladsgroup.json |
[production] |
19:37 |
<sukhe@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp4037.ulsfo.wmnet with OS buster |
[production] |
19:36 |
<wm-bot2> |
Set cloudvirt 'cloudvirt1050.eqiad.wmnet' maintenance (downtime id: c5dbfa7b-72fc-4156-8257-af224a725b78, use this to unset). - cookbook ran by andrew@bullseye |
[admin] |
19:36 |
<wm-bot2> |
Draining 'cloudvirt1050.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:36 |
<wm-bot2> |
Safe rebooting 'cloudvirt1050.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:33 |
<wm-bot2> |
Draining 'cloudvirt1050.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:33 |
<wm-bot2> |
Safe rebooting 'cloudvirt1050.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:33 |
<herron@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-fe2001.codfw.wmnet |
[production] |
19:31 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2170:3312', diff saved to https://phabricator.wikimedia.org/P35726 and previous config saved to /var/cache/conftool/dbconfig/20221020-193130-ladsgroup.json |
[production] |
19:30 |
<wm-bot2> |
Set cloudvirt 'cloudvirt1048.eqiad.wmnet' maintenance (downtime id: c1a3e92c-cb04-46e4-ac5d-784c79601b05, use this to unset). - cookbook ran by andrew@bullseye |
[admin] |
19:30 |
<wm-bot2> |
Set cloudvirt 'cloudvirt1049.eqiad.wmnet' maintenance (downtime id: 62fdc4ee-c8d5-4892-aeb9-a681cdcbc84b, use this to unset). - cookbook ran by andrew@bullseye |
[admin] |
19:29 |
<wm-bot2> |
Set cloudvirt 'cloudvirt1050.eqiad.wmnet' maintenance (downtime id: 9ec407cd-5c00-407a-a57d-794f6c68f947, use this to unset). - cookbook ran by andrew@bullseye |
[admin] |
19:29 |
<wm-bot2> |
Set cloudvirt 'cloudvirt1051.eqiad.wmnet' maintenance (downtime id: 712683ea-d58f-484b-8efb-89d1c21aa0e3, use this to unset). - cookbook ran by andrew@bullseye |
[admin] |
19:29 |
<bking@cumin2002> |
START - Cookbook sre.elasticsearch.rolling-operation Operation.REBOOT (1 nodes at a time) for ElasticSearch cluster relforge: apply updates - bking@cumin2002 - T321310 |
[production] |
19:29 |
<wm-bot2> |
Draining 'cloudvirt1048.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:29 |
<wm-bot2> |
Safe rebooting 'cloudvirt1048.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:29 |
<wm-bot2> |
Draining 'cloudvirt1049.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:29 |
<wm-bot2> |
Safe rebooting 'cloudvirt1049.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:28 |
<wm-bot2> |
Draining 'cloudvirt1050.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:28 |
<wm-bot2> |
Safe rebooting 'cloudvirt1050.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:28 |
<wm-bot2> |
Draining 'cloudvirt1051.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |