5001-5050 of 10000 results (32ms)
2021-03-04 ยง
19:11 <jforrester@deploy1002> Synchronized php-1.36.0-wmf.33/extensions/FlaggedRevs/frontend/specialpages/reports/ProblemChanges.php: T276386 Fix fatal calls to getConfig (duration: 01m 12s) [production]
19:06 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
18:59 <cmjohnson@cumin1001> START - Cookbook sre.dns.netbox [production]
18:36 <andrewbogott> rebooting cloudmetrics1002; the console is hanging [admin]
18:26 <jynus@cumin2001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on backup2003.codfw.wmnet with reason: REIMAGE [production]
18:25 <jynus@cumin2001> START - Cookbook sre.hosts.downtime for 2:00:00 on backup2003.codfw.wmnet with reason: REIMAGE [production]
17:39 <mutante> [deneb:~] $ sudo systemctl start cowbuilder_update_jessie-amd64 [production]
17:25 <cmjohnson@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
17:20 <cmjohnson@cumin1001> START - Cookbook sre.dns.netbox [production]
17:11 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on deploy1001.eqiad.wmnet with reason: decom [production]
17:11 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 4 days, 0:00:00 on deploy1001.eqiad.wmnet with reason: decom [production]
17:05 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1032.eqiad.wmnet [production]
16:59 <arturo> rebooting cloudvirt1032 for T275753 [admin]
16:59 <aborrero@cumin1001> START - Cookbook sre.hosts.reboot-single for host cloudvirt1032.eqiad.wmnet [production]
16:56 <tarrow@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'test' . [production]
16:56 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1069.eqiad.wmnet with reason: REIMAGE [production]
16:54 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1068.eqiad.wmnet with reason: REIMAGE [production]
16:54 <pt1979@cumin2001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:54 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1069.eqiad.wmnet with reason: REIMAGE [production]
16:53 <tarrow@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . [production]
16:52 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1068.eqiad.wmnet with reason: REIMAGE [production]
16:47 <pt1979@cumin2001> START - Cookbook sre.dns.netbox [production]
16:46 <bd808> Restarting docker process. Not sure if crash or another problem. [toolhub]
16:39 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1031.eqiad.wmnet [production]
16:34 <arturo> draining cloudvirt1032 for T275753 [admin]
16:33 <arturo> rebooting cloudvirt1031 for T275753 [admin]
16:33 <aborrero@cumin1001> START - Cookbook sre.hosts.reboot-single for host cloudvirt1031.eqiad.wmnet [production]
16:27 <elukey> drain + reimage analytics106[8,9] to Debian Buster (one is a journalnode) [analytics]
16:23 <jakob@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . [production]
16:20 <jakob@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'test' . [production]
16:13 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1026.eqiad.wmnet [production]
16:12 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db2145', diff saved to https://phabricator.wikimedia.org/P14635 and previous config saved to /var/cache/conftool/dbconfig/20210304-161226-marostegui.json [production]
16:11 <arturo> draining cloudvirt1031 for T275753 [admin]
16:09 <arturo> rebooting cloudvirt1026 for T275753 [admin]
16:08 <aborrero@cumin1001> START - Cookbook sre.hosts.reboot-single for host cloudvirt1026.eqiad.wmnet [production]
16:02 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1025.eqiad.wmnet [production]
15:57 <arturo> draining cloudvirt1026 for T275753 [admin]
15:55 <arturo> rebooting cloudvirt1025 for T275753 [admin]
15:55 <aborrero@cumin1001> START - Cookbook sre.hosts.reboot-single for host cloudvirt1025.eqiad.wmnet [production]
15:52 <jakob@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'test' . [production]
15:47 <hashar> Refreshing jobs based on releng/tox-buster to use latest image. That brings in tox installed with python3 instead of python2 # T276384 [releng]
15:42 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1024.eqiad.wmnet [production]
15:41 <arturo> draining cloudvirt1025 for T275753 [admin]
15:28 <jakob@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . [production]
15:28 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1067.eqiad.wmnet with reason: REIMAGE [production]
15:26 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on analytics1066.eqiad.wmnet with reason: REIMAGE [production]
15:26 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1067.eqiad.wmnet with reason: REIMAGE [production]
15:24 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on analytics1066.eqiad.wmnet with reason: REIMAGE [production]
15:21 <jakob@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'termbox' for release 'staging' . [production]
15:12 <elukey> drain + reimage analytics106[6,7] to Debian Buster [production]