3601-3650 of 10000 results (38ms)
2021-03-04 ยง
18:59 <cmjohnson@cumin1001> START - Cookbook sre.dns.netbox [production]
18:26 <jynus@cumin2001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on backup2003.codfw.wmnet with reason: REIMAGE [production]
18:25 <jynus@cumin2001> START - Cookbook sre.hosts.downtime for 2:00:00 on backup2003.codfw.wmnet with reason: REIMAGE [production]
17:39 <mutante> [deneb:~] $ sudo systemctl start cowbuilder_update_jessie-amd64 [production]
17:25 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
17:20 <cmjohnson@cumin1001> START - Cookbook sre.dns.netbox [production]
17:11 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on deploy1001.eqiad.wmnet with reason: decom [production]
17:11 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 4 days, 0:00:00 on deploy1001.eqiad.wmnet with reason: decom [production]
17:05 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1032.eqiad.wmnet [production]
16:59 <aborrero@cumin1001> START - Cookbook sre.hosts.reboot-single for host cloudvirt1032.eqiad.wmnet [production]
16:56 <tarrow@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'test' . [production]
16:56 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1069.eqiad.wmnet with reason: REIMAGE [production]
16:54 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1068.eqiad.wmnet with reason: REIMAGE [production]
16:54 <pt1979@cumin2001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:54 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1069.eqiad.wmnet with reason: REIMAGE [production]
16:53 <tarrow@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . [production]
16:52 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1068.eqiad.wmnet with reason: REIMAGE [production]
16:47 <pt1979@cumin2001> START - Cookbook sre.dns.netbox [production]
16:39 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1031.eqiad.wmnet [production]
16:33 <aborrero@cumin1001> START - Cookbook sre.hosts.reboot-single for host cloudvirt1031.eqiad.wmnet [production]
16:23 <jakob@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . [production]
16:20 <jakob@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'test' . [production]
16:13 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1026.eqiad.wmnet [production]
16:12 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db2145', diff saved to https://phabricator.wikimedia.org/P14635 and previous config saved to /var/cache/conftool/dbconfig/20210304-161226-marostegui.json [production]
16:08 <aborrero@cumin1001> START - Cookbook sre.hosts.reboot-single for host cloudvirt1026.eqiad.wmnet [production]
16:02 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1025.eqiad.wmnet [production]
15:55 <aborrero@cumin1001> START - Cookbook sre.hosts.reboot-single for host cloudvirt1025.eqiad.wmnet [production]
15:52 <jakob@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'test' . [production]
15:42 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1024.eqiad.wmnet [production]
15:28 <jakob@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . [production]
15:28 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1067.eqiad.wmnet with reason: REIMAGE [production]
15:26 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1066.eqiad.wmnet with reason: REIMAGE [production]
15:26 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1067.eqiad.wmnet with reason: REIMAGE [production]
15:24 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1066.eqiad.wmnet with reason: REIMAGE [production]
15:21 <jakob@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . [production]
15:12 <elukey> drain + reimage analytics106[6,7] to Debian Buster [production]
15:11 <aborrero@cumin1001> START - Cookbook sre.hosts.reboot-single for host cloudvirt1024.eqiad.wmnet [production]
14:40 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1065.eqiad.wmnet with reason: REIMAGE [production]
14:38 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1065.eqiad.wmnet with reason: REIMAGE [production]
14:38 <pt1979@cumin2001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:35 <jayme@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:34 <pt1979@cumin2001> START - Cookbook sre.dns.netbox [production]
14:30 <jayme@cumin1001> START - Cookbook sre.dns.netbox [production]
14:23 <jayme@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts neon.eqiad.wmnet [production]
14:18 <jayme@cumin1001> START - Cookbook sre.hosts.decommission for hosts neon.eqiad.wmnet [production]
14:15 <jayme@cumin1001> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=99) for hosts neon.eqiad.wmnet [production]
14:15 <jayme@cumin1001> START - Cookbook sre.hosts.decommission for hosts neon.eqiad.wmnet [production]
14:04 <liw@deploy1002> rebuilt and synchronized wikiversions files: all wikis to 1.36.0-wmf.33 [production]
13:55 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1064.eqiad.wmnet with reason: REIMAGE [production]
13:53 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1063.eqiad.wmnet with reason: REIMAGE [production]