2401-2450 of 10000 results (48ms)
2022-07-18 ยง
20:02 <wm-bot2> Draining 'cloudvirt1022.eqiad.wmnet'. - cookbook ran by andrew@buster [admin]
20:02 <wm-bot2> Safe rebooting 'cloudvirt1022.eqiad.wmnet'. - cookbook ran by andrew@buster [admin]
20:02 <wm-bot2> Draining 'cloudvirt1021.eqiad.wmnet'. - cookbook ran by andrew@buster [admin]
20:02 <wm-bot2> Safe rebooting 'cloudvirt1021.eqiad.wmnet'. - cookbook ran by andrew@buster [admin]
20:01 <wm-bot2> Safe reboot of 'cloudvirt1017.eqiad.wmnet' finished successfully. - cookbook ran by andrew@buster [admin]
20:01 <wm-bot2> Unset cloudvirt 'cloudvirt1017.eqiad.wmnet' maintenance. - cookbook ran by andrew@buster [admin]
19:57 <wm-bot2> Drained 'cloudvirt1017.eqiad.wmnet'. - cookbook ran by andrew@buster [admin]
19:45 <ryankemper@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host elastic2066.codfw.wmnet with OS bullseye [production]
19:43 <dancy> Upgrading scap to 4.10.0-1+0~20220718175214.344~1.gbpe518a1 in beta cluster [releng]
19:42 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
19:41 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
19:41 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
19:40 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
19:37 <wm-bot2> Set cloudvirt 'cloudvirt1017.eqiad.wmnet' maintenance (downtime id: b0904345-9aa3-4202-b992-78644141517c, use this to unset). - cookbook ran by andrew@buster [admin]
19:36 <wm-bot2> Draining 'cloudvirt1017.eqiad.wmnet'. - cookbook ran by andrew@buster [admin]
19:36 <wm-bot2> Safe rebooting 'cloudvirt1017.eqiad.wmnet'. - cookbook ran by andrew@buster [admin]
19:31 <wm-bot2> Draining 'cloudvirt1017.eqiad.wmnet'. - cookbook ran by andrew@buster [admin]
19:31 <wm-bot2> Safe rebooting 'cloudvirt1017.eqiad.wmnet'. - cookbook ran by andrew@buster [admin]
19:30 <wm-bot2> Draining 'cloudvirt1017.eqiad.wmnet'. - cookbook ran by andrew@buster [admin]
19:30 <wm-bot2> Safe rebooting 'cloudvirt1017.eqiad.wmnet'. - cookbook ran by andrew@buster [admin]
19:04 <ryankemper@cumin1001> START - Cookbook sre.hosts.reimage for host elastic2066.codfw.wmnet with OS bullseye [production]
19:02 <ryankemper@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host elastic2066.codfw.wmnet with OS bullseye [production]
18:41 <marostegui@cumin1001> dbctl commit (dc=all): 'db1101:3318 (re)pooling @ 100%: After maintenance', diff saved to https://phabricator.wikimedia.org/P31385 and previous config saved to /var/cache/conftool/dbconfig/20220718-184146-root.json [production]
18:36 <ryankemper@cumin1001> START - Cookbook sre.hosts.reimage for host elastic2066.codfw.wmnet with OS bullseye [production]
18:35 <ryankemper@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host elastic2066.codfw.wmnet with OS bullseye [production]
18:26 <marostegui@cumin1001> dbctl commit (dc=all): 'db1101:3318 (re)pooling @ 75%: After maintenance', diff saved to https://phabricator.wikimedia.org/P31384 and previous config saved to /var/cache/conftool/dbconfig/20220718-182642-root.json [production]
18:17 <ryankemper@cumin1001> START - Cookbook sre.hosts.reimage for host elastic2066.codfw.wmnet with OS bullseye [production]
18:16 <ryankemper@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2065.codfw.wmnet with OS bullseye [production]
18:11 <marostegui@cumin1001> dbctl commit (dc=all): 'db1101:3318 (re)pooling @ 50%: After maintenance', diff saved to https://phabricator.wikimedia.org/P31382 and previous config saved to /var/cache/conftool/dbconfig/20220718-181138-root.json [production]
18:02 <ryankemper@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic2065.codfw.wmnet with reason: host reimage [production]
17:57 <ryankemper@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on elastic2065.codfw.wmnet with reason: host reimage [production]
17:56 <marostegui@cumin1001> dbctl commit (dc=all): 'db1101:3318 (re)pooling @ 25%: After maintenance', diff saved to https://phabricator.wikimedia.org/P31381 and previous config saved to /var/cache/conftool/dbconfig/20220718-175634-root.json [production]
17:43 <ryankemper@cumin1001> START - Cookbook sre.hosts.reimage for host elastic2065.codfw.wmnet with OS bullseye [production]
17:41 <marostegui@cumin1001> dbctl commit (dc=all): 'db1101:3318 (re)pooling @ 10%: After maintenance', diff saved to https://phabricator.wikimedia.org/P31380 and previous config saved to /var/cache/conftool/dbconfig/20220718-174130-root.json [production]
17:26 <marostegui@cumin1001> dbctl commit (dc=all): 'db1101:3318 (re)pooling @ 5%: After maintenance', diff saved to https://phabricator.wikimedia.org/P31379 and previous config saved to /var/cache/conftool/dbconfig/20220718-172626-root.json [production]
17:11 <marostegui@cumin1001> dbctl commit (dc=all): 'db1101:3318 (re)pooling @ 2%: After maintenance', diff saved to https://phabricator.wikimedia.org/P31378 and previous config saved to /var/cache/conftool/dbconfig/20220718-171122-root.json [production]
16:56 <marostegui@cumin1001> dbctl commit (dc=all): 'db1101:3318 (re)pooling @ 1%: After maintenance', diff saved to https://phabricator.wikimedia.org/P31377 and previous config saved to /var/cache/conftool/dbconfig/20220718-165617-root.json [production]
16:54 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1101:3318 (T313070)', diff saved to https://phabricator.wikimedia.org/P31376 and previous config saved to /var/cache/conftool/dbconfig/20220718-165455-marostegui.json [production]
16:53 <marostegui@cumin1001> dbctl commit (dc=all): 'Depooling db1101:3318 (T313070)', diff saved to https://phabricator.wikimedia.org/P31375 and previous config saved to /var/cache/conftool/dbconfig/20220718-165349-marostegui.json [production]
16:53 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1101.eqiad.wmnet with reason: Maintenance [production]
16:53 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1101.eqiad.wmnet with reason: Maintenance [production]
16:53 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1126 (T313070)', diff saved to https://phabricator.wikimedia.org/P31374 and previous config saved to /var/cache/conftool/dbconfig/20220718-165329-marostegui.json [production]
16:38 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1126', diff saved to https://phabricator.wikimedia.org/P31373 and previous config saved to /var/cache/conftool/dbconfig/20220718-163824-marostegui.json [production]
16:23 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1126', diff saved to https://phabricator.wikimedia.org/P31372 and previous config saved to /var/cache/conftool/dbconfig/20220718-162319-marostegui.json [production]
16:12 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
16:10 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]
16:09 <mwdebug-deploy@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply [production]
16:08 <marostegui@cumin1001> dbctl commit (dc=all): 'Repooling after maintenance db1126 (T313070)', diff saved to https://phabricator.wikimedia.org/P31371 and previous config saved to /var/cache/conftool/dbconfig/20220718-160813-marostegui.json [production]
16:07 <mwdebug-deploy@deploy1002> helmfile [eqiad] START helmfile.d/services/mwdebug: apply [production]
16:07 <marostegui@cumin1001> dbctl commit (dc=all): 'Depooling db1126 (T313070)', diff saved to https://phabricator.wikimedia.org/P31370 and previous config saved to /var/cache/conftool/dbconfig/20220718-160708-marostegui.json [production]