1601-1650 of 10000 results (70ms)
2019-10-09 ยง
11:25 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
11:25 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
11:25 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
11:25 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
11:25 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
11:05 <Amir1> EU SWAT is done [production]
11:04 <ladsgroup@deploy1001> Synchronized wmf-config/InitialiseSettings.php: [[gerrit:541777|Put write both limit down to Q70m for item terms (T234948)]] (duration: 01m 10s) [production]
11:04 <@> helmfile [EQIAD] Ran 'sync' command on namespace 'restrouter' for release 'production' . [production]
10:58 <akosiaris@> helmfile [EQIAD] Ran 'apply' command on namespace 'kube-system' for release 'calico-policy-controller' . [production]
10:44 <arturo> cloudvirt1013 rebooted well [admin]
10:33 <arturo> several sgewebgrid-lighttpd nodes (9) not available because cloudvirt1013 is rebooting [tools]
10:32 <arturo> cloudvirt1013 is rebooting [admin]
10:32 <arturo> cloudvirt1012 rebooted just fine (very slow, 35 VMs) [admin]
10:21 <arturo> several worker nodes (7) not available because cloudvirt1012 is rebooting [tools]
10:20 <arturo> cloudvirt1012 is rebooting [admin]
10:19 <arturo> cloudvirt1009 rebooted just fine (very slow though) [admin]
10:18 <@> helmfile [EQIAD] Ran 'sync' command on namespace 'restrouter' for release 'production' . [production]
10:16 <@> helmfile [EQIAD] Ran 'apply' command on namespace 'restrouter' for release 'production' . [production]
10:08 <arturo> several worker nodes (6) not available because cloudvirt1009 is rebooting [tools]
10:06 <arturo> cloudvirt1009 is rebooting [admin]
10:06 <arturo> cloudvirt1008 rebooted just fine (very slow though) [admin]
09:59 <arturo> several worker nodes (5) not available because cloudvirt1008 is rebooting [tools]
09:58 <arturo> cloudvirt1008 is rebooting [admin]
09:53 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
09:53 <aborrero@cumin1001> START - Cookbook sre.hosts.downtime [production]
09:53 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
09:53 <aborrero@cumin1001> START - Cookbook sre.hosts.downtime [production]
09:52 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
09:52 <aborrero@cumin1001> START - Cookbook sre.hosts.downtime [production]
09:52 <arturo> icinga downtime toolschecker, paws, etc for 2h, because cloudvirt reboots [admin]
09:48 <aborrero@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
09:48 <aborrero@cumin1001> START - Cookbook sre.hosts.downtime [production]
09:44 <moritzm> draining ganeti1007 for upcoming reboot (combined kernel/qemu security updates) [production]
09:39 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
09:39 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
09:00 <jmm@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) [production]
08:59 <jmm@cumin1001> START - Cookbook sre.hosts.decommission [production]
08:50 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1113:3316 for schema change, temporarily pool db1085 as vslow,dump', diff saved to https://phabricator.wikimedia.org/P9276 and previous config saved to /var/cache/conftool/dbconfig/20191009-085016-marostegui.json [production]
08:47 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db1085 after schema change', diff saved to https://phabricator.wikimedia.org/P9275 and previous config saved to /var/cache/conftool/dbconfig/20191009-084732-marostegui.json [production]
08:39 <vgutierrez> Switch cp1082 from nginx to ats-tls - T231433 [production]
08:24 <moritzm> draining ganeti1006 for upcoming reboot (combined kernel/qemu security updates) [production]
08:18 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
08:18 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
08:18 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
08:18 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
08:14 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
08:14 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
08:01 <vgutierrez> Switch cp2011 from nginx to ats-tls - T231433 [production]
07:48 <moritzm> reduced RAM assignment for boron to 8G [production]
07:38 <vgutierrez> Switch cp3038 from nginx to ats-tls - T231433 [production]