2051-2100 of 10000 results (47ms)
2019-10-09 ยง
14:05 <jbond@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
14:03 <jbond@cumin1001> START - Cookbook sre.hosts.downtime [production]
14:02 <moritzm> rebalancing Ganeti eqiad/row A after rolling reboots of Ganeti nodes [production]
13:48 <jbond42> reimage puppetmaster2001 [production]
13:37 <vgutierrez> repooling cp1085 - T231525 [production]
13:37 <marostegui@cumin1001> dbctl commit (dc=all): 'depool db1075', diff saved to https://phabricator.wikimedia.org/P9280 and previous config saved to /var/cache/conftool/dbconfig/20191009-133709-marostegui.json [production]
13:13 <mobrovac@deploy1001> Finished deploy [restbase/deploy@aaadd73]: Parsoid: Retry fetching stashes with undefined as the revid - T234928 (duration: 14m 26s) [production]
12:59 <mobrovac@deploy1001> Started deploy [restbase/deploy@aaadd73]: Parsoid: Retry fetching stashes with undefined as the revid - T234928 [production]
12:56 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1099:3318 for schema change T233625', diff saved to https://phabricator.wikimedia.org/P9279 and previous config saved to /var/cache/conftool/dbconfig/20191009-125641-marostegui.json [production]
12:42 <marostegui> Stop MySQL and power off db1074 for BBU replacement T231638 [production]
12:42 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1074 for BBU replacement T231638', diff saved to https://phabricator.wikimedia.org/P9278 and previous config saved to /var/cache/conftool/dbconfig/20191009-124218-marostegui.json [production]
12:41 <mobrovac@deploy1001> Finished deploy [restbase/deploy@068d2ed]: Feed: Use Wikifeeds; Parsoid: Use the ETag revid for stashing and use the same ETag for stashing and response, take #2 (duration: 08m 18s) [production]
12:40 <marostegui@cumin1001> dbctl commit (dc=all): 'Repool db1105:3312 after schema change', diff saved to https://phabricator.wikimedia.org/P9277 and previous config saved to /var/cache/conftool/dbconfig/20191009-124035-marostegui.json [production]
12:38 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
12:38 <vgutierrez@cumin1001> START - Cookbook sre.hosts.downtime [production]
12:37 <arturo> drain tools-worker-1038 to rebalance load in the k8s cluster [tools]
12:36 <moritzm> disabled puppet on DNS recursors for staged rollout of ferm NTP change [production]
12:35 <jbond42> reimage puppetmaster2002 [production]
12:35 <arturo> uncordon tools-worker-1029 (was disabled for unknown reasons) [tools]
12:33 <arturo> drain tools-worker-1010 to rebalance load [tools]
12:32 <mobrovac@deploy1001> Started deploy [restbase/deploy@068d2ed]: Feed: Use Wikifeeds; Parsoid: Use the ETag revid for stashing and use the same ETag for stashing and response, take #2 [production]
12:30 <mobrovac@deploy1001> Finished deploy [restbase/deploy@068d2ed]: Feed: Use Wikifeeds; Parsoid: Use the ETag revid for stashing and use the same ETag for stashing and response - T170455 T234928 (duration: 09m 40s) [production]
12:28 <vgutierrez> depooling cp1085 for a power drain - T231525 [production]
12:20 <mobrovac@deploy1001> Started deploy [restbase/deploy@068d2ed]: Feed: Use Wikifeeds; Parsoid: Use the ETag revid for stashing and use the same ETag for stashing and response - T170455 T234928 [production]
12:13 <moritzm> draining ganeti1001 for upcoming reboot (combined kernel/qemu security updates) [production]
12:10 <moritzm> failover Ganeti master in eqiad to ganeti1003 [production]
12:03 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
12:03 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
12:02 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
12:02 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
12:02 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
12:02 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
11:59 <arturo> re-create toolsbeta-test-proxy-01 as Debian Buster (T235059) [toolsbeta]
11:32 <moritzm> draining ganeti1008 for upcoming reboot (combined kernel/qemu security updates) [production]
11:25 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
11:25 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
11:25 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
11:25 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
11:25 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
11:25 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
11:05 <Amir1> EU SWAT is done [production]
11:04 <ladsgroup@deploy1001> Synchronized wmf-config/InitialiseSettings.php: [[gerrit:541777|Put write both limit down to Q70m for item terms (T234948)]] (duration: 01m 10s) [production]
11:04 <@> helmfile [EQIAD] Ran 'sync' command on namespace 'restrouter' for release 'production' . [production]
10:58 <akosiaris@> helmfile [EQIAD] Ran 'apply' command on namespace 'kube-system' for release 'calico-policy-controller' . [production]
10:44 <arturo> cloudvirt1013 rebooted well [admin]
10:33 <arturo> several sgewebgrid-lighttpd nodes (9) not available because cloudvirt1013 is rebooting [tools]
10:32 <arturo> cloudvirt1013 is rebooting [admin]
10:32 <arturo> cloudvirt1012 rebooted just fine (very slow, 35 VMs) [admin]
10:21 <arturo> several worker nodes (7) not available because cloudvirt1012 is rebooting [tools]
10:20 <arturo> cloudvirt1012 is rebooting [admin]