2022-10-20
ยง
|
20:05 |
<herron@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host thanos-fe1003.eqiad.wmnet |
[production] |
20:03 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2106 (T321312)', diff saved to https://phabricator.wikimedia.org/P35734 and previous config saved to /var/cache/conftool/dbconfig/20221020-200321-ladsgroup.json |
[production] |
20:03 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2106.codfw.wmnet with reason: Maintenance |
[production] |
20:03 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2106.codfw.wmnet with reason: Maintenance |
[production] |
20:02 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depooling db2175 (T318955)', diff saved to https://phabricator.wikimedia.org/P35733 and previous config saved to /var/cache/conftool/dbconfig/20221020-200205-ladsgroup.json |
[production] |
20:01 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2175.codfw.wmnet with reason: Maintenance |
[production] |
20:01 |
<dzahn@cumin2002> |
END (PASS) - Cookbook sre.gitlab.reboot-runner (exit_code=0) rolling reboot on A:codfw and (A:gitlab-runner) |
[production] |
20:01 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 8:00:00 on db2175.codfw.wmnet with reason: Maintenance |
[production] |
20:01 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db2170:3312 (T318955)', diff saved to https://phabricator.wikimedia.org/P35732 and previous config saved to /var/cache/conftool/dbconfig/20221020-200143-ladsgroup.json |
[production] |
20:00 |
<robh@cumin2002> |
START - Cookbook sre.hosts.provision for host cp4049.mgmt.ulsfo.wmnet with reboot policy FORCED |
[production] |
20:00 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2099.codfw.wmnet with reason: Maintenance |
[production] |
20:00 |
<robh@cumin2002> |
START - Cookbook sre.hosts.provision for host cp4047.mgmt.ulsfo.wmnet with reboot policy FORCED |
[production] |
19:59 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2099.codfw.wmnet with reason: Maintenance |
[production] |
19:59 |
<wm-bot> |
<root> restart wikibugs (again) |
[tools.wikibugs] |
19:59 |
<robh@cumin2002> |
END (PASS) - Cookbook sre.dns.netbox (exit_code=0) |
[production] |
19:57 |
<robh@cumin2002> |
START - Cookbook sre.dns.netbox |
[production] |
19:57 |
<herron@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host thanos-fe1003.eqiad.wmnet |
[production] |
19:55 |
<wm-bot2> |
Set cloudvirt 'cloudvirt1046.eqiad.wmnet' maintenance (downtime id: b65da7e2-9fd2-4a1e-8dd4-88ca65936ae4, use this to unset). - cookbook ran by andrew@bullseye |
[admin] |
19:55 |
<wm-bot2> |
Set cloudvirt 'cloudvirt1045.eqiad.wmnet' maintenance (downtime id: cbbf114c-9a4c-4dd2-9b58-50da8bd896ca, use this to unset). - cookbook ran by andrew@bullseye |
[admin] |
19:55 |
<wm-bot2> |
Set cloudvirt 'cloudvirt1044.eqiad.wmnet' maintenance (downtime id: 5969beaf-72bf-4af9-ae4d-8e5c3331e78e, use this to unset). - cookbook ran by andrew@bullseye |
[admin] |
19:55 |
<wm-bot2> |
Set cloudvirt 'cloudvirt1047.eqiad.wmnet' maintenance (downtime id: b8f7389d-79b4-411d-b3d5-ec8f92eba101, use this to unset). - cookbook ran by andrew@bullseye |
[admin] |
19:54 |
<wm-bot2> |
Draining 'cloudvirt1047.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:54 |
<wm-bot2> |
Safe rebooting 'cloudvirt1047.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:54 |
<wm-bot2> |
Draining 'cloudvirt1046.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:54 |
<wm-bot2> |
Safe rebooting 'cloudvirt1046.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:54 |
<wm-bot2> |
Draining 'cloudvirt1045.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:54 |
<wm-bot2> |
Safe rebooting 'cloudvirt1045.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:54 |
<wm-bot2> |
Draining 'cloudvirt1044.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:54 |
<wm-bot2> |
Safe rebooting 'cloudvirt1044.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:53 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1146:3312 (T321312)', diff saved to https://phabricator.wikimedia.org/P35731 and previous config saved to /var/cache/conftool/dbconfig/20221020-195331-ladsgroup.json |
[production] |
19:53 |
<wm-bot2> |
Safe reboot of 'cloudvirt1051.eqiad.wmnet' finished successfully. - cookbook ran by andrew@bullseye |
[admin] |
19:53 |
<wm-bot2> |
Unset cloudvirt 'cloudvirt1051.eqiad.wmnet' maintenance. - cookbook ran by andrew@bullseye |
[admin] |
19:51 |
<wm-bot2> |
Safe reboot of 'cloudvirt1049.eqiad.wmnet' finished successfully. - cookbook ran by andrew@bullseye |
[admin] |
19:51 |
<wm-bot2> |
Unset cloudvirt 'cloudvirt1049.eqiad.wmnet' maintenance. - cookbook ran by andrew@bullseye |
[admin] |
19:50 |
<wm-bot2> |
Safe reboot of 'cloudvirt1048.eqiad.wmnet' finished successfully. - cookbook ran by andrew@bullseye |
[admin] |
19:50 |
<wm-bot2> |
Unset cloudvirt 'cloudvirt1048.eqiad.wmnet' maintenance. - cookbook ran by andrew@bullseye |
[admin] |
19:50 |
<sukhe@cumin2002> |
START - Cookbook sre.hosts.reimage for host cp4037.ulsfo.wmnet with OS buster |
[production] |
19:49 |
<wm-bot> |
<root> Restarted to rejoin channels |
[tools.stashbot] |
19:49 |
<wm-bot2> |
Drained 'cloudvirt1051.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:41 |
<dzahn@cumin2002> |
START - Cookbook sre.gitlab.reboot-runner rolling reboot on A:codfw and (A:gitlab-runner) |
[production] |
19:39 |
<herron@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host thanos-be2001.codfw.wmnet |
[production] |
19:38 |
<ladsgroup@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1145.eqiad.wmnet with reason: Maintenance |
[production] |
19:38 |
<ladsgroup@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1145.eqiad.wmnet with reason: Maintenance |
[production] |
19:37 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Repooling after maintenance db1144:3315 (T321312)', diff saved to https://phabricator.wikimedia.org/P35727 and previous config saved to /var/cache/conftool/dbconfig/20221020-193756-ladsgroup.json |
[production] |
19:37 |
<sukhe@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp4037.ulsfo.wmnet with OS buster |
[production] |
19:36 |
<wm-bot2> |
Set cloudvirt 'cloudvirt1050.eqiad.wmnet' maintenance (downtime id: c5dbfa7b-72fc-4156-8257-af224a725b78, use this to unset). - cookbook ran by andrew@bullseye |
[admin] |
19:36 |
<wm-bot2> |
Draining 'cloudvirt1050.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:36 |
<wm-bot2> |
Safe rebooting 'cloudvirt1050.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:33 |
<wm-bot2> |
Draining 'cloudvirt1050.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |
19:33 |
<wm-bot2> |
Safe rebooting 'cloudvirt1050.eqiad.wmnet'. - cookbook ran by andrew@bullseye |
[admin] |