2301-2350 of 10000 results (38ms)
2017-04-12 ยง
11:02 <akosiaris> upgrade puppet across the trusty fleet to 3.8. T162462 [production]
10:34 <hashar> Upgrading Jenkins "Email Extension" plugin 2.57.1..2.57.2 and restarting Jenkins [production]
10:07 <hashar> Upgrading Jenkins "Git client" plugin 2.3.0..2.4.1 and restarting Jenkins [production]
09:58 <switchdc> (volans@neodymium) END TASK - switchdc.stages.t07_coredb_masters_readwrite(codfw, eqiad) Successfully completed [production]
09:58 <switchdc> (volans@neodymium) START TASK - switchdc.stages.t07_coredb_masters_readwrite(codfw, eqiad) set core DB masters in read-write mode [production]
09:56 <switchdc> (volans@neodymium) END TASK - switchdc.stages.t03_coredb_masters_readonly(codfw, eqiad) Failed to execute [production]
09:56 <switchdc> (volans@neodymium) START TASK - switchdc.stages.t03_coredb_masters_readonly(codfw, eqiad) set core DB masters in read-only mode [production]
09:53 <_joe_> removing the old directory of data from ocg1003 [production]
09:52 <volans> testing t03 and t07 DB-RO/RW stages of switchdc (codfw->eqiad), we are already in that situation, t03 will fail the verfication, is expected [production]
09:52 <godog> swift codfw-prod: ms-be2001 - ms-be2012 initial decom - T162785 [production]
09:47 <_joe_> remounting the new partition under /srv/deployment/ocg/output, cleaning out the old dir. Will cause a service interruption for requests to ocg1003 for a few minutes. T162780 [production]
09:42 <gehel> starting load on elastic2020 - T149006 [production]
09:41 <addshore@tin> Synchronized wmf-config/InitialiseSettings.php: [[gerrit:347819|wmgUseGettingStarted false for dewiki]] (duration: 00m 45s) [production]
09:26 <addshore@tin> Synchronized wmf-config/InitialiseSettings.php: [[gerrit:347817|WMDE Spring campaign - Add logging from WikimediaEvent]] (duration: 00m 46s) [production]
09:22 <hashar> Restarting Jenkins for Matrix related plugins updates (3) [production]
09:12 <_joe_> copying data from / to the neww partition on ocg1003 T162462 [production]
09:10 <hashar> Restarting Jenkins for plugins update (2) [production]
09:06 <_joe_> creating a LVM volume on ocg1003 [production]
09:05 <hashar> Restarting Jenkins for plugins update [production]
08:59 <addshore@tin> Synchronized php-1.29.0-wmf.19/extensions/WikimediaEvents/extension.json: [[gerrit:347815|patch1]] & [[gerrit:347774|patch2]] WMDE Spring campaign PT2/2 (duration: 00m 45s) [production]
08:58 <addshore@tin> Synchronized php-1.29.0-wmf.19/extensions/WikimediaEvents/WikimediaEventsHooks.php: [[gerrit:347815|patch1]] & [[gerrit:347774|patch2]] WMDE Spring campaign PT1/2 (duration: 00m 47s) [production]
08:52 <ema> upgrade cache_upload to linux 4.9 T162029 [production]
08:44 <gehel> reimaging elastic2020 for testing - T149006 [production]
08:24 <switchdc> (oblivian@sarin) END TASK - switchdc.stages.t09_start_maintenance(codfw, eqiad) Successfully completed [production]
08:22 <switchdc> (oblivian@sarin) START TASK - switchdc.stages.t09_start_maintenance(codfw, eqiad) Start MediaWiki maintenance in the new master DC [production]
08:14 <switchdc> (oblivian@sarin) END TASK - switchdc.stages.t08_stop_mediawiki_readonly(codfw, eqiad) Failed to execute [production]
08:14 <root@tin> Synchronized wmf-config/db-eqiad.php: Set MediaWiki in read-write mode in datacenter eqiad (duration: 00m 35s) [production]
08:13 <switchdc> (oblivian@sarin) START TASK - switchdc.stages.t08_stop_mediawiki_readonly(codfw, eqiad) Set MediaWiki in read-write mode (db_to config already merged and git pulled) [production]
08:09 <switchdc> (oblivian@sarin) END TASK - switchdc.stages.t06_redis(codfw, eqiad) Successfully completed [production]
08:09 <switchdc> (oblivian@sarin) START TASK - switchdc.stages.t06_redis(codfw, eqiad) Switch the Redis replication [production]
08:02 <switchdc> (oblivian@sarin) END TASK - switchdc.stages.t05_switch_datacenter(codfw, eqiad) Successfully completed [production]
08:02 <switchdc> (oblivian@sarin) START TASK - switchdc.stages.t05_switch_datacenter(codfw, eqiad) Switch MediaWiki configuration to the new datacenter [production]
08:00 <switchdc> (oblivian@sarin) END TASK - switchdc.stages.t09_restore_ttl(codfw, eqiad) Successfully completed [production]
07:59 <switchdc> (oblivian@sarin) START TASK - switchdc.stages.t09_restore_ttl(codfw, eqiad) Restore the TTL of all the MediaWiki discovery records [production]
07:58 <switchdc> (oblivian@sarin) END TASK - switchdc.stages.t05_switch_traffic(codfw, eqiad) Successfully completed [production]
07:55 <switchdc> (oblivian@sarin) START TASK - switchdc.stages.t05_switch_traffic(codfw, eqiad) Switch traffic flow to the appservers in the new datacenter [production]
07:55 <_joe_> resuming non-dry run tests of switchdc, all logs from switchdc by me are just tests [production]
06:57 <_joe_> the last messages are just a test and nothing was really done, as codfw is already in read-only mode right now [production]
06:57 <switchdc> (oblivian@sarin) END TASK - switchdc.stages.t02_start_mediawiki_readonly(codfw, eqiad) Failed to execute [production]
06:57 <root@tin> Synchronized wmf-config/db-codfw.php: Set MediaWiki in read-only mode in datacenter codfw (duration: 00m 23s) [production]
06:57 <switchdc> (oblivian@sarin) MediaWiki read-only period starts at: 2017-04-12 06:56:53.822926 [production]
06:56 <switchdc> (oblivian@sarin) START TASK - switchdc.stages.t02_start_mediawiki_readonly(codfw, eqiad) Set MediaWiki in read-only mode (db_from config already merged and git pulled) [production]
06:53 <switchdc> (oblivian@sarin) END TASK - switchdc.stages.t01_stop_maintenance(codfw, eqiad) Failed to execute [production]
06:53 <switchdc> (oblivian@sarin) START TASK - switchdc.stages.t01_stop_maintenance(codfw, eqiad) Stop MediaWiki maintenance in the old master DC [production]
06:50 <_joe_> testing switchover codfw => eqiad, no destructive actions will be taken [production]
06:42 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Repool db1093 - T17441 (duration: 00m 46s) [production]
06:37 <elukey> reimage mw2246.codfw.wmnet mw2152.codfw.wmnet to remove the /tmp partition (codfw videoscalers, switchover prep) [production]
06:32 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Repool db1072 - T132416 (duration: 00m 46s) [production]
06:28 <_joe_> killing long-running puppet-agent on db2058 too [production]
06:20 <_joe_> killing badly-started puppet agents on mc1010, tempdb2001,db1090, db2058, hydrogen, possibly others later [production]