6551-6600 of 10000 results (30ms)
2022-01-14 ยง
15:00 <bblack> silenced site=drmrs in alertmanager, I think [production]
14:59 <hashar> Starting VM integration-agent-docker-1022 which was in shutdown state since December and is Bullseye based # T290783 [releng]
13:49 <hashar> Restarting all CI Docker agents via Horizon to apply new flavor settings T265615 T299211 [releng]
13:31 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host pc2011.codfw.wmnet with OS bullseye [production]
13:20 <hnowlan@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host restbase2009.codfw.wmnet with OS buster [production]
12:59 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host pc2011.codfw.wmnet with OS bullseye [production]
12:53 <hnowlan@cumin1001> START - Cookbook sre.hosts.reimage for host restbase2009.codfw.wmnet with OS buster [production]
12:51 <hnowlan@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host restbase2009.codfw.wmnet with OS buster [production]
12:49 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ganeti1024.eqiad.wmnet with OS buster [production]
12:22 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host ganeti1024.eqiad.wmnet with OS buster [production]
12:20 <hnowlan@cumin1001> START - Cookbook sre.hosts.reimage for host restbase2009.codfw.wmnet with OS buster [production]
12:18 <hnowlan@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host restbase2009.codfw.wmnet with OS buster [production]
11:56 <wm-bot> removing grid node toolsbeta-sgewebgen-10-5 (depool/drain, remove VM and reconfigure grid) - cookbook ran by arturo@nostromo [toolsbeta]
11:51 <hnowlan@cumin1001> START - Cookbook sre.hosts.reimage for host restbase2009.codfw.wmnet with OS buster [production]
11:49 <wm-bot> removing grid node toolsbeta-sgeexec-10-5 (depool/drain, remove VM and reconfigure grid) - cookbook ran by arturo@nostromo [toolsbeta]
11:49 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on restbase2009.codfw.wmnet with reason: not in restbase cluster, used for testing [production]
11:48 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on restbase2009.codfw.wmnet with reason: not in restbase cluster, used for testing [production]
11:45 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ganeti1023.eqiad.wmnet with OS buster [production]
11:18 <jmm@cumin2002> START - Cookbook sre.hosts.reimage for host ganeti1023.eqiad.wmnet with OS buster [production]
11:01 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM archiva1002.wikimedia.org [production]
11:00 <moritzm> systemctl reset-failed ifup@ens5.service on archiva1002 T273026 [production]
10:56 <moritzm> rebooting archiva1002 (running archiva.wikimedia.org) [production]
10:56 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM archiva1002.wikimedia.org [production]
10:55 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host elastic2051.codfw.wmnet with OS stretch [production]
10:50 <moritzm> systemctl reset-failed ifup@ens5.service on an-test-ui1001 T273026 [production]
10:50 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM an-test-ui1001.eqiad.wmnet [production]
10:42 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM an-test-ui1001.eqiad.wmnet [production]
10:21 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM an-test-presto1001.eqiad.wmnet [production]
10:17 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM an-test-presto1001.eqiad.wmnet [production]
10:07 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM matomo1002.eqiad.wmnet [production]
10:05 <moritzm> rebooting matomo1002 (running piwik.wikimedia.org) [production]
10:04 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM matomo1002.eqiad.wmnet [production]
09:59 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM an-test-druid1001.eqiad.wmnet [production]
09:57 <wm-bot> removing grid node toolsbeta-sgeexec-10-6.toolsbeta.eqiad1.wikimedia.cloud (depool/drain, remove VM and reconfigure grid) - cookbook ran by arturo@nostromo [toolsbeta]
09:55 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM an-test-druid1001.eqiad.wmnet [production]
09:53 <wm-bot> removing grid node toolsbeta-sgeexec-10-6.toolsbeta.eqiad1.wikimedia.org (depool/drain, remove VM and reconfigure grid) - cookbook ran by arturo@nostromo [toolsbeta]
09:44 <wm-bot> removing grid node toolsbeta-sgeweblight-10-2 (depool/drain, remove VM and reconfigure grid) - cookbook ran by arturo@nostromo [toolsbeta]
09:38 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM apt1001.wikimedia.org [production]
09:35 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM apt1001.wikimedia.org [production]
09:32 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM install1003.wikimedia.org [production]
09:28 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM install1003.wikimedia.org [production]
09:22 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM an-test-client1001.eqiad.wmnet [production]
09:19 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM an-test-client1001.eqiad.wmnet [production]
09:11 <marostegui> Move pc1014 from pc1 to pc2 T299046 [production]
09:05 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host pc2013.codfw.wmnet with OS bullseye [production]
09:03 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM an-tool1009.eqiad.wmnet [production]
09:01 <moritzm> rebooting an-tool1009 (running hue.wikimedia.org) [production]
09:01 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM an-tool1009.eqiad.wmnet [production]
09:00 <moritzm> systemctl reset-failed ifup@ens5.service on an-tool1005 T273026 [production]
09:00 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM an-tool1008.eqiad.wmnet [production]