8651-8700 of 10000 results (105ms)
2019-09-30 ยง
15:49 <moritzm> installing console-setup bugfixes from Buster 10.1 point release [production]
15:46 <cdanis@cumin1001> END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) [production]
15:46 <cdanis@cumin1001> START - Cookbook sre.ganeti.makevm [production]
15:42 <moritzm> failover Ganeti master in codfw to ganeti2001 [production]
15:23 <James_F> Zuul: Added TheSandDoctor to the CI whitelist. [releng]
15:09 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
15:09 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
14:59 <tgr> restarted docker on discuss-space for T234218 [discourse]
14:44 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
14:44 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
14:29 <moritzm> draining ganeti2007 for upcoming reboot (combined kernel/qemu security updates) [production]
14:17 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
14:17 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
14:08 <moritzm> draining ganeti2006 for upcoming reboot (combined kernel/qemu security updates) [production]
14:00 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
14:00 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
13:54 <moritzm> draining ganeti2005 for upcoming reboot (combined kernel/qemu security updates) [production]
13:49 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
13:49 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
13:10 <hashar> integration-castor03 : sudo mkdir -p /usr/local/lib/python2.7/dist-packages [releng]
12:53 <jbond@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
12:51 <jbond@cumin1001> START - Cookbook sre.hosts.downtime [production]
12:49 <hashar> Fixed cumin on integration project # T234203 [releng]
12:33 <kart_> Update cxserver to 2019-09-26-034732-production (T233834, T232674, T233085) [production]
12:29 <@> helmfile [EQIAD] Ran 'apply' command on namespace 'cxserver' for release 'production' . [production]
12:29 <jbond42> offline puppetmaster2002 to reimage https://gerrit.wikimedia.org/r/c/operations/puppet/+/539322 [production]
12:27 <@> helmfile [CODFW] Ran 'apply' command on namespace 'cxserver' for release 'production' . [production]
12:24 <@> helmfile [STAGING] Ran 'apply' command on namespace 'cxserver' for release 'staging' . [production]
12:00 <Urbanecm> EU SWAT done #2 [production]
12:00 <urbanecm@deploy1001> Synchronized wmf-config/throttle.php: SWAT: 3f4f242: New throttle rule for Czech wiki course (T234113) (duration: 00m 56s) [production]
11:57 <Urbanecm> Reopen EU SWAT to deploy throttle rule for October 02 (T234113) [production]
11:54 <raynor> EU SWAT finished [production]
11:54 <pmiazga@deploy1001> Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:538296|Enable alternate mobile link for it, nl, ko wikis. (T206497)]] (duration: 00m 57s) [production]
11:27 <kartik@deploy1001> Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit|539517|Enable CX out of beta in Tagalog and Central Bikol WPs (T233006, T233007)]] (duration: 00m 59s) [production]
11:20 <hashar> Restarting Docker on integration-agent-puppet-docker-1001 # T234197 [production]
11:08 <hashar> Restarting Docker on CI agents to clear out some docker/iptables oddity # T234197 [releng]
11:08 <hashar> Restarting Docker on CI agents to clear out some docker/iptables oddity # T234197 [production]
10:59 <hashar> Restarted Docker on integration-agent-docker-1001 T234197 [releng]
10:48 <hashar> CI outage is tracked in https://phabricator.wikimedia.org/T234197 [production]
10:42 <moritzm> draining ganeti2004 for upcoming reboot (combined kernel/qemu security updates) [production]
10:40 <hashar> CI down due to some DNS related failure on the hosts :-\ [production]
10:30 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
10:30 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
10:21 <arturo> we installed ferm in every VM by mistake. Deleting it and forcing a puppet agent run to try to go back to a clean state. [admin]
09:38 <arturo> downtime toolschecker for 24h [admin]
09:33 <arturo> force update ferm cloud-wide (in all VMs) for T153468 [admin]
09:30 <moritzm> uploading ferm 2.4.1+wmf2+deb9u1 for stretch-wikimedia, fixes AAAA lookups (T153468) [production]
09:11 <moritzm> draining ganeti2002 for upcoming reboot (combined kernel/qemu security updates) [production]
09:10 <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db2091:3314 for a schema change - T233625', diff saved to https://phabricator.wikimedia.org/P9217 and previous config saved to /var/cache/conftool/dbconfig/20190930-091043-marostegui.json [production]
08:00 <moritzm> installing e2fsprogs security updates on Stretch/Buster [production]