2019-09-30
§
|
16:34 |
<ppchelko@deploy1001> |
Finished deploy [cpjobqueue/deploy@79db711]: Take job domain into account for deduplication T234226 (duration: 01m 17s) |
[production] |
16:32 |
<krinkle@deploy1001> |
Synchronized wmf-config/abusefilter.php: 0aa4b4b5ab9a2e4 (duration: 00m 57s) |
[production] |
16:32 |
<ppchelko@deploy1001> |
Started deploy [cpjobqueue/deploy@79db711]: Take job domain into account for deduplication T234226 |
[production] |
16:25 |
<cdanis@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
16:25 |
<cdanis@cumin1001> |
END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) |
[production] |
16:25 |
<cdanis@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
15:49 |
<moritzm> |
installing console-setup bugfixes from Buster 10.1 point release |
[production] |
15:46 |
<cdanis@cumin1001> |
END (FAIL) - Cookbook sre.ganeti.makevm (exit_code=99) |
[production] |
15:46 |
<cdanis@cumin1001> |
START - Cookbook sre.ganeti.makevm |
[production] |
15:42 |
<moritzm> |
failover Ganeti master in codfw to ganeti2001 |
[production] |
15:09 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
15:09 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:44 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:44 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:29 |
<moritzm> |
draining ganeti2007 for upcoming reboot (combined kernel/qemu security updates) |
[production] |
14:17 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:17 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
14:08 |
<moritzm> |
draining ganeti2006 for upcoming reboot (combined kernel/qemu security updates) |
[production] |
14:00 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
14:00 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
13:54 |
<moritzm> |
draining ganeti2005 for upcoming reboot (combined kernel/qemu security updates) |
[production] |
13:49 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
13:49 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
12:53 |
<jbond@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
12:51 |
<jbond@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
12:33 |
<kart_> |
Update cxserver to 2019-09-26-034732-production (T233834, T232674, T233085) |
[production] |
12:29 |
<@> |
helmfile [EQIAD] Ran 'apply' command on namespace 'cxserver' for release 'production' . |
[production] |
12:29 |
<jbond42> |
offline puppetmaster2002 to reimage https://gerrit.wikimedia.org/r/c/operations/puppet/+/539322 |
[production] |
12:27 |
<@> |
helmfile [CODFW] Ran 'apply' command on namespace 'cxserver' for release 'production' . |
[production] |
12:24 |
<@> |
helmfile [STAGING] Ran 'apply' command on namespace 'cxserver' for release 'staging' . |
[production] |
12:00 |
<Urbanecm> |
EU SWAT done #2 |
[production] |
12:00 |
<urbanecm@deploy1001> |
Synchronized wmf-config/throttle.php: SWAT: 3f4f242: New throttle rule for Czech wiki course (T234113) (duration: 00m 56s) |
[production] |
11:57 |
<Urbanecm> |
Reopen EU SWAT to deploy throttle rule for October 02 (T234113) |
[production] |
11:54 |
<raynor> |
EU SWAT finished |
[production] |
11:54 |
<pmiazga@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:538296|Enable alternate mobile link for it, nl, ko wikis. (T206497)]] (duration: 00m 57s) |
[production] |
11:27 |
<kartik@deploy1001> |
Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit|539517|Enable CX out of beta in Tagalog and Central Bikol WPs (T233006, T233007)]] (duration: 00m 59s) |
[production] |
11:20 |
<hashar> |
Restarting Docker on integration-agent-puppet-docker-1001 # T234197 |
[production] |
11:08 |
<hashar> |
Restarting Docker on CI agents to clear out some docker/iptables oddity # T234197 |
[production] |
10:48 |
<hashar> |
CI outage is tracked in https://phabricator.wikimedia.org/T234197 |
[production] |
10:42 |
<moritzm> |
draining ganeti2004 for upcoming reboot (combined kernel/qemu security updates) |
[production] |
10:40 |
<hashar> |
CI down due to some DNS related failure on the hosts :-\ |
[production] |
10:30 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
10:30 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:30 |
<moritzm> |
uploading ferm 2.4.1+wmf2+deb9u1 for stretch-wikimedia, fixes AAAA lookups (T153468) |
[production] |
09:11 |
<moritzm> |
draining ganeti2002 for upcoming reboot (combined kernel/qemu security updates) |
[production] |
09:10 |
<marostegui@cumin1001> |
dbctl commit (dc=all): 'Depool db2091:3314 for a schema change - T233625', diff saved to https://phabricator.wikimedia.org/P9217 and previous config saved to /var/cache/conftool/dbconfig/20190930-091043-marostegui.json |
[production] |
08:00 |
<moritzm> |
installing e2fsprogs security updates on Stretch/Buster |
[production] |
07:56 |
<marostegui> |
Stop dbstore1003:3311 for troubleshooting |
[production] |
06:47 |
<moritzm> |
installing exim security updates on buster |
[production] |