7401-7450 of 10000 results (81ms)
2019-06-06 ยง
13:44 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
13:44 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
13:36 <gehel@cumin1001> START - Cookbook sre.wdqs.restart-wdqs [production]
13:35 <zfilipin@deploy1001> rebuilt and synchronized wikiversions files: all wikis to 1.34.0-wmf.8 [production]
13:35 <gehel@cumin1001> END (FAIL) - Cookbook sre.wdqs.restart-wdqs (exit_code=99) [production]
13:35 <gehel@cumin1001> START - Cookbook sre.wdqs.restart-wdqs [production]
13:34 <gehel@cumin1001> END (PASS) - Cookbook sre.wdqs.restart-wdqs (exit_code=0) [production]
13:33 <gehel@cumin1001> START - Cookbook sre.wdqs.restart-wdqs [production]
13:32 <gehel@cumin1001> END (PASS) - Cookbook sre.wdqs.restart-wdqs (exit_code=0) [production]
13:31 <gehel@cumin1001> START - Cookbook sre.wdqs.restart-wdqs [production]
12:44 <jbond42> reimage neodymium [production]
12:23 <_joe_> running puppet, restarting php-fpm on the canaries to pick up the new opcache size [production]
12:11 <ema> cp1075: repool with varnish 5.1.3-1wm10 T224694 [production]
12:10 <elukey> restart mcrouter on mw2235 [production]
12:05 <Lucas_WMDE> EU SWAT done [production]
12:04 <lucaswerkmeister-wmde@deploy1001> Synchronized wmf-config/: SWAT: [[gerrit:514700|Revert "Specify $wgWBRepoSettings['conceptBaseUri']" (duration: 00m 56s) [production]
12:00 <ema> cp1075: upgrade varnish to 5.1.3-1wm10 T224694 [production]
11:55 <lucaswerkmeister-wmde@deploy1001> scap failed: average error rate on 8/11 canaries increased by 10x (rerun with --force to override this check, see https://logstash.wikimedia.org/goto/db09a36be5ed3e81155041f7d46ad040 for details) [production]
11:48 <Urbanecm> running mwscript namespaceDupes.php --wiki=thwikisource --fix (T216322) [production]
11:47 <Urbanecm> running mwscript namespaceDupes.php --wiki=thwikibooks --fix for T216322 [production]
11:46 <urbanecm@deploy1001> Synchronized wmf-config/InitialiseSettings.php: [[:gerrit:514678|Add new namespaces for several Thai projects]] (T216322) (duration: 00m 54s) [production]
11:38 <lucaswerkmeister-wmde@deploy1001> Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:514534|Remove unused config variable wgWikibaseEnableSenses]] (duration: 00m 55s) [production]
11:23 <gehel@cumin2001> END (PASS) - Cookbook sre.postgresql.postgres-init (exit_code=0) [production]
11:22 <lucaswerkmeister-wmde@deploy1001> Synchronized php-1.34.0-wmf.8/extensions/CirrusSearch/: SWAT: [[gerrit:514566|Fix event validation error for cirrussearch-request event]] (duration: 01m 06s) [production]
10:55 <elukey> restart mcrouter on mw2163 (codfw mcrouter proxy) [production]
10:43 <mobrovac@deploy1001> scap-helm mathoid finished [production]
10:43 <mobrovac@deploy1001> scap-helm mathoid cluster codfw completed [production]
10:43 <mobrovac@deploy1001> scap-helm mathoid cluster eqiad completed [production]
10:43 <mobrovac@deploy1001> scap-helm mathoid upgrade production stable/mathoid -f mathoid-values.yaml [namespace: mathoid, clusters: eqiad,codfw] [production]
10:30 <ema> varnish 5.1.3-1wm10 uploaded to stretch-wikimedia T224694 [production]
10:19 <elukey> rolling restart of mcrouter on mw1* hosts to pick up config change (batch of 5 hosts, depool/run-puppet/pool) [production]
10:12 <elukey> disable puppet on mw1* and mw[2163,2235,2255,2271] as prep step for mcrouter config deploy [production]
10:10 <fsero> rollbacked last deployment of mathoid to revision 16 [production]
09:59 <mobrovac@deploy1001> scap-helm mathoid finished [production]
09:59 <mobrovac@deploy1001> scap-helm mathoid cluster codfw completed [production]
09:59 <mobrovac@deploy1001> scap-helm mathoid cluster eqiad completed [production]
09:59 <mobrovac@deploy1001> scap-helm mathoid upgrade production stable/mathoid -f mathoid-values.yaml [namespace: mathoid, clusters: eqiad,codfw] [production]
09:31 <moritzm> rebooting mwdebug2002 for some tests [production]
09:31 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) [production]
09:30 <jmm@cumin2001> START - Cookbook sre.hosts.downtime [production]
09:28 <moritzm> updating qemu on ganeti2004 for some tests [production]
09:24 <gehel@cumin2001> START - Cookbook sre.postgresql.postgres-init [production]
08:38 <marostegui> Stop MySQL on db1117:3322 - this will trigger haproxy alerts - T222682 [production]
07:35 <marostegui@deploy1001> Synchronized wmf-config/db-eqiad.php: Repool db1121 after upgrade T224852 (duration: 00m 53s) [production]
07:20 <marostegui> Stop MySQL on db1121 for upgrade, this will generate lag on labs hosts for s6 - T224852 [production]
07:16 <marostegui@deploy1001> Synchronized wmf-config/db-codfw.php: Promote db2046 to s6 master as db2039 will be decommissioned T221533 (duration: 00m 55s) [production]
06:31 <marostegui> Start topology changes on s6 codfw to promote db2046 as master - T221533 [production]
06:23 <marostegui@deploy1001> Synchronized wmf-config/db-eqiad.php: Depool db1121 for upgrade T224852 (duration: 00m 55s) [production]
06:15 <marostegui@deploy1001> Synchronized wmf-config/db-eqiad.php: Fully repool db1091 after getting its BBU replaced (duration: 00m 54s) [production]
06:01 <marostegui@deploy1001> Synchronized wmf-config/db-eqiad.php: More traffic to db1091 after getting its BBU replaced (duration: 01m 01s) [production]