production SAL

3651-3700 of 10000 results (40ms)

2016-06-29 §
12:32	<moritzm>	powercycling elastic1010, stuck on reboot	[production]
12:11	<moritzm>	powercycling mw1260, stuck on reboot	[production]
11:38	<jynus>	halfway moving otrs backups from dbstore1001 to es2001	[production]
11:27	<gehel>	powercycling elastic1009 - stuck in reboot	[production]
11:11	<moritzm>	powercycling mw1223, stuck on reboot	[production]
11:09	<gehel>	deleting broken dewiki_titlesuggest index from codfw (T138811)	[production]
10:31	<elukey>	rebooting analytics100[12] (Hadoop Yarn/HDFS master and standby) - One at the time forcing failover manually with daemon restarts	[production]
09:54	<moritzm>	powercycling mw1163, stuck on reboot	[production]
09:23	<gehel>	banning elastic1001 to 1016 from cluster to prepare their decommissioning (T138329)	[production]
09:20	<ema>	upgrading diamond to 3.5-6 (T138758)	[production]
09:01	<elukey>	rebooting analytics1028->1057 for kernel upgrades (Hadoop worker nodes)	[production]
08:55	<moritzm>	powercycling mw1111, stuck on reboot	[production]
08:44	<elukey>	puppet stopped on analytics1027 to prevent Camus job to run (prep step for Hadoop kernel upgrades)	[production]
08:40	<moritzm>	powercycling mw1108, stuck on reboot	[production]
08:12	<moritzm>	powercycling mw1099, stuck on reboot	[production]
08:12	<moritzm>	powercycling mw1097, stuck on reboot	[production]
08:05	<moritzm>	powercycling mw1092, stuck on reboot	[production]
07:47	<moritzm>	rolling reboot of appservers in eqiad for kernel security update	[production]
07:16	<moritzm>	powercycling snapshot1002, reboot stuck	[production]
07:11	<moritzm>	powercycling snapshot1001, reboot stuck	[production]
06:58	<moritzm>	rebooting most snapshot hosts for kernel security update	[production]
03:28	<krinkle@tin>	Synchronized wmf-config/InitialiseSettings.php: test2wiki (duration: 00m 33s)	[production]
02:56	<l10nupdate@tin>	ResourceLoader cache refresh completed at Wed Jun 29 02:56:32 UTC 2016 (duration 6m 30s)	[production]
02:50	<mwdeploy@tin>	scap sync-l10n completed (1.28.0-wmf.8) (duration: 04m 48s)	[production]
02:30	<chasemp>	labstore1004 is replicating NFS/DRBD shares to labstore1005 and they are large and it's taking a long time	[production]
02:29	<mwdeploy@tin>	scap sync-l10n completed (1.28.0-wmf.7) (duration: 09m 21s)	[production]
02:18	<twentyafterfour@tin>	rebuilt wikiversions.php and synchronized wikiversions files: sync wikiversions.json - group0 to 1.28.0-wmf.8 refs T137492	[production]
02:16	<twentyafterfour>	promoting group0 to 1.28.0-wmf.8	[production]
00:02	<twentyafterfour@tin>	Finished scap: sync new branch, testwiki to php-1.28.0-wmf.8 refs T137492 (duration: 51m 59s)	[production]
2016-06-28 §
23:10	<twentyafterfour@tin>	Started scap: sync new branch, testwiki to php-1.28.0-wmf.8 refs T137492	[production]
23:10	<Krenair>	wikitech-static working now, poke me on IRC or file a #wikitech.wikimedia.org ticket if you find any issues	[production]
23:10	<twentyafterfour>	syncing new branch 1.28.0-wmf.8 refs T137492	[production]
23:04	<ebernhardson@tin>	Synchronized php-1.28.0-wmf.7/extensions/EventBus/EventBus.php: SWAT: EventBus: Match the expected format of response log key (duration: 00m 31s)	[production]
23:01	<Krenair>	Updating MW version on wikitech-static to 1.27 (LTS) - https://lists.wikimedia.org/pipermail/mediawiki-announce/2016-June/000191.html	[production]
21:59	<halfak>	deploying ores beec291	[production]
21:33	<twentyafterfour@tin>	rebuilt wikiversions.php and synchronized wikiversions files: all wikis to 1.28.0-wmf.7	[production]
21:31	<twentyafterfour@tin>	Synchronized php-1.28.0-wmf.7/extensions/AbuseFilter/: deploy https://gerrit.wikimedia.org/r/#/c/296464/ refs T138550 T136973 (duration: 00m 36s)	[production]
21:24	<twentyafterfour>	deploying wmf.7 yet again, once CI finishes testing https://gerrit.wikimedia.org/r/#/c/296464/ refs T138550 T136973	[production]
20:24	<twentyafterfour@tin>	rebuilt wikiversions.php and synchronized wikiversions files: once again rolling back to wmf.6 refs T136973 T138550	[production]
20:11	<twentyafterfour@tin>	rebuilt wikiversions.php and synchronized wikiversions files: all wikis to 1.28.0-wmf.7	[production]
20:09	<twentyafterfour@tin>	Synchronized php-1.28.0-wmf.7/extensions/AbuseFilter/: deploying https://gerrit.wikimedia.org/r/#/c/296440/ refs T138550, T136973 (duration: 02m 06s)	[production]
20:09	<twentyafterfour>	deploying https://gerrit.wikimedia.org/r/#/c/296440/ to hopefully unblock wmf.7 deployments. refs T138550, T136973	[production]
20:08	<gehel>	disabling puppet on wdqs100[12] to cleanup after failed scap3 deplyoment	[production]
19:33	<twentyafterfour@tin>	rebuilt wikiversions.php and synchronized wikiversions files: Rolling back to wmf.6: save time regression is still present in wmf.7	[production]
19:32	<twentyafterfour>	Rolling back to wmf.6: T138550 is still a problem	[production]
19:23	<twentyafterfour@tin>	rebuilt wikiversions.php and synchronized wikiversions files: all wikis to 1.28.0-wmf.7	[production]
19:23	<twentyafterfour>	Deploying 1.28.0-wmf.7 to all wikis	[production]
18:23	<mutante>	zosma - fresh install, sign puppet certs, initial puppet run	[production]
16:16	<gehel>	starting rolling restart of elasticsearch codfw cluster (T138811)	[production]
15:25	<thcipriani@tin>	Synchronized portals: SWAT: [[gerrit:296399\|Bumping portals to master (T136874)]] (duration: 00m 29s)	[production]