9901-9950 of 10000 results (61ms)
2021-03-04 ยง
16:53 <tarrow@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . [production]
16:52 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1068.eqiad.wmnet with reason: REIMAGE [production]
16:47 <pt1979@cumin2001> START - Cookbook sre.dns.netbox [production]
16:46 <bd808> Restarting docker process. Not sure if crash or another problem. [toolhub]
16:39 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1031.eqiad.wmnet [production]
16:34 <arturo> draining cloudvirt1032 for T275753 [admin]
16:33 <arturo> rebooting cloudvirt1031 for T275753 [admin]
16:33 <aborrero@cumin1001> START - Cookbook sre.hosts.reboot-single for host cloudvirt1031.eqiad.wmnet [production]
16:27 <elukey> drain + reimage analytics106[8,9] to Debian Buster (one is a journalnode) [analytics]
16:23 <jakob@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . [production]
16:20 <jakob@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'test' . [production]
16:13 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1026.eqiad.wmnet [production]
16:12 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db2145', diff saved to https://phabricator.wikimedia.org/P14635 and previous config saved to /var/cache/conftool/dbconfig/20210304-161226-marostegui.json [production]
16:11 <arturo> draining cloudvirt1031 for T275753 [admin]
16:09 <arturo> rebooting cloudvirt1026 for T275753 [admin]
16:08 <aborrero@cumin1001> START - Cookbook sre.hosts.reboot-single for host cloudvirt1026.eqiad.wmnet [production]
16:02 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1025.eqiad.wmnet [production]
15:57 <arturo> draining cloudvirt1026 for T275753 [admin]
15:55 <arturo> rebooting cloudvirt1025 for T275753 [admin]
15:55 <aborrero@cumin1001> START - Cookbook sre.hosts.reboot-single for host cloudvirt1025.eqiad.wmnet [production]
15:52 <jakob@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'test' . [production]
15:47 <hashar> Refreshing jobs based on releng/tox-buster to use latest image. That brings in tox installed with python3 instead of python2 # T276384 [releng]
15:42 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1024.eqiad.wmnet [production]
15:41 <arturo> draining cloudvirt1025 for T275753 [admin]
15:28 <jakob@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . [production]
15:28 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1067.eqiad.wmnet with reason: REIMAGE [production]
15:26 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1066.eqiad.wmnet with reason: REIMAGE [production]
15:26 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1067.eqiad.wmnet with reason: REIMAGE [production]
15:24 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1066.eqiad.wmnet with reason: REIMAGE [production]
15:21 <jakob@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . [production]
15:12 <elukey> drain + reimage analytics106[6,7] to Debian Buster [production]
15:12 <elukey> drain + reimage analytics106[6,7] to Debian Buster [analytics]
15:12 <arturo> rebooting cloudvirt1024 for T275753 [admin]
15:11 <aborrero@cumin1001> START - Cookbook sre.hosts.reboot-single for host cloudvirt1024.eqiad.wmnet [production]
15:00 <Majavah> remove graphoid role from deploymenr-sca[01-02] ref T276102 and it being decomissioned in T242855 [releng]
14:40 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1065.eqiad.wmnet with reason: REIMAGE [production]
14:38 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1065.eqiad.wmnet with reason: REIMAGE [production]
14:38 <pt1979@cumin2001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:35 <jayme@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:34 <pt1979@cumin2001> START - Cookbook sre.dns.netbox [production]
14:30 <jayme@cumin1001> START - Cookbook sre.dns.netbox [production]
14:23 <jayme@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts neon.eqiad.wmnet [production]
14:21 <elukey> drain + reimage analytics1065 to Debian Buster [analytics]
14:18 <jayme@cumin1001> START - Cookbook sre.hosts.decommission for hosts neon.eqiad.wmnet [production]
14:15 <jayme@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=99) for hosts neon.eqiad.wmnet [production]
14:15 <jayme@cumin1001> START - Cookbook sre.hosts.decommission for hosts neon.eqiad.wmnet [production]
14:04 <liw@deploy1002> rebuilt and synchronized wikiversions files: all wikis to 1.36.0-wmf.33 [production]
13:55 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1064.eqiad.wmnet with reason: REIMAGE [production]
13:53 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1063.eqiad.wmnet with reason: REIMAGE [production]
13:52 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1064.eqiad.wmnet with reason: REIMAGE [production]