6401-6450 of 10000 results (53ms)
2017-04-19 §
14:12 <switchdc> (volans@sarin) START TASK - switchdc.stages.t02_start_mediawiki_readonly(eqiad, codfw) Set MediaWiki in read-only mode (db_from config already merged and git pulled) [production]
14:09 <switchdc> (volans@sarin) END TASK - switchdc.stages.t01_stop_maintenance(eqiad, codfw) Successfully completed [production]
14:07 <switchdc> (volans@sarin) START TASK - switchdc.stages.t01_stop_maintenance(eqiad, codfw) Stop MediaWiki maintenance in the old master DC [production]
14:06 <godog> stop swiftrepl on ms-fe1005 for codfw switchover [production]
14:06 <switchdc> (volans@sarin) END TASK - switchdc.stages.t00_reduce_ttl(eqiad, codfw) Successfully completed [production]
14:06 <switchdc> (volans@sarin) START TASK - switchdc.stages.t00_reduce_ttl(eqiad, codfw) Reduce the TTL of all the MediaWiki discovery records [production]
14:06 <switchdc> (volans@sarin) END TASK - switchdc.stages.t00_disable_puppet(eqiad, codfw) Successfully completed [production]
14:05 <switchdc> (volans@sarin) START TASK - switchdc.stages.t00_disable_puppet(eqiad, codfw) Disabling puppet on selected hosts [production]
14:00 <bblack@neodymium> conftool action : set/pooled=yes; selector: name=cp2014.codfw.wmnet,service=varnish-be [production]
13:42 <bblack@neodymium> conftool action : set/pooled=no; selector: name=cp2014.codfw.wmnet,service=varnish-be [production]
13:28 <urandom> cqlsh -f /etc/cassandra/adduser.cql, recreating user/perms (as-needed) [production]
12:38 <urandom> T163292: Starting removal of Cassandra instance restbase1018-c.eqiad.wmnet [production]
11:36 <oblivian:> Setting swift-rw in eqiad DOWN [production]
11:36 <oblivian:> Setting swift-rw in codfw UP [production]
11:36 <ema> repool varnish-be on cp3044 [production]
11:23 <godog> add naos to git-deploy term on common-infrastructure4 - T162900 [production]
11:03 <switchdc> (oblivian@sarin) END TASK - switchdc.stages.t04_cache_wipe(eqiad, codfw) Successfully completed [production]
10:57 <switchdc> (oblivian@sarin) START TASK - switchdc.stages.t04_cache_wipe(eqiad, codfw) wipe and warmup caches [production]
10:56 <_joe_> running the warmup stage in codfw for final testing [production]
10:41 <ema> depool varnish-be on cp3044 because of mailbox lag issues [production]
09:34 <moritzm> installing dbus security updates [production]
09:11 <elukey> cleaning up ocg1003's /srv/deployment/ocg/postmortem dir (root partition filled up) [production]
07:26 <hoo> Updated the sites and site_identifiers tables on all Wikidata clients for T149522. [production]
06:57 <switchdc> (oblivian@sarin) END TASK - switchdc.stages.t06_redis(codfw, eqiad) Successfully completed [production]
06:56 <switchdc> (oblivian@sarin) START TASK - switchdc.stages.t06_redis(codfw, eqiad) Switch the Redis replication [production]
06:52 <_joe_> artificially stopping slave replication on rdb2001 for a final test of the switchover redis stage [production]
03:53 <urandom> T163292: Starting removal of Cassandra instance restbase1018-b.eqiad.wmnet [production]
03:49 <mobrovac@tin> Started restart [restbase/deploy@1bfada4]: (no justification provided) [production]
03:40 <mobrovac@tin> Started restart [restbase/deploy@1bfada4]: Kick RB to pick up restbase1018 instances are gone [production]
03:32 <mobrovac@tin> Finished deploy [changeprop/deploy@a19ebf8]: Temp: Decrease the transclusion update from 400 to 200 for T163292 (duration: 00m 53s) [production]
03:31 <mobrovac@tin> Started deploy [changeprop/deploy@a19ebf8]: Temp: Decrease the transclusion update from 400 to 200 for T163292 [production]
01:58 <mutante> naos: rsyncd is of course legitimately running on a deployment server sepearate from this (unlike in other cases where we used it for syncing during migration), so this was just the one config fragment for /home and not removing the service or anything [production]
01:56 <mutante> naos: manually deleting rsyncd config remnants (puppet wouldn't know to clean up after itself) [production]
01:47 <mutante> rsyncing /home from mira to naos (T162900) [production]
01:21 <urandom> T163292: Starting removal of Cassandra instance restbase1018-a.eqiad.wmnet [production]
2017-04-18 §
23:04 <dzahn@puppetmaster1001> conftool action : set/pooled=no; selector: name=restbase1018.eqiad.wmnet [production]
23:02 <mutante> ms1001 - deleting old GlobalCert SSL cert for dumps.wm that was about to expire and is replaced by Letsencrypt, [production]
22:30 <mutante> ocg1003 gzipping ocg.log for disk space [production]
21:12 <bblack@neodymium> conftool action : set/pooled=yes; selector: name=cp2002.codfw.wmnet,service=varnish-be [production]
20:36 <bblack@neodymium> conftool action : set/pooled=no; selector: name=cp2002.codfw.wmnet,service=varnish-be [production]
17:26 <mobrovac@tin> Finished deploy [restbase/deploy@1bfada4]: Blacklist all user pages on commons (duration: 07m 12s) [production]
17:26 <ssastry@tin> Finished deploy [parsoid/deploy@b067328]: Deploying Parsoid to bump heap limits to 900m (from 600m) (duration: 06m 25s) [production]
17:19 <ssastry@tin> Started deploy [parsoid/deploy@b067328]: Deploying Parsoid to bump heap limits to 900m (from 600m) [production]
17:19 <mobrovac@tin> Started deploy [restbase/deploy@1bfada4]: Blacklist all user pages on commons [production]
17:12 <XenoRyet> updated tools from a8b8d7242799b61dd2a48ef4e804164cd1818bc9 to a1e9342e093a85032255fc1d9904db7df13680b7 [production]
17:09 <elukey> restart nutcracker in codfw (profile::mediawiki::nutcracker) to make sure that all the daemons are running with the latest config [production]
16:26 <bblack> completed Traffic-layer portions of codfw switchover ( https://wikitech.wikimedia.org/wiki/Switch_Datacenter#Switchover_2 ) [production]
16:21 <bblack> starting Traffic-layer portions of codfw switchover ( https://wikitech.wikimedia.org/wiki/Switch_Datacenter#Switchover_2 ) [production]
16:15 <jynus> reimporting some rows to dbstore1002 on jawiki and ruwiki T160509 [production]
16:12 <godog> reboot tin to fix cpu mhz issue and check bios settings - T163158 [production]