production SAL

751-800 of 10000 results (26ms)

2016-06-29 §
10:31	<elukey>	rebooting analytics100[12] (Hadoop Yarn/HDFS master and standby) - One at the time forcing failover manually with daemon restarts	[production]
09:54	<moritzm>	powercycling mw1163, stuck on reboot	[production]
09:23	<gehel>	banning elastic1001 to 1016 from cluster to prepare their decommissioning (T138329)	[production]
09:20	<ema>	upgrading diamond to 3.5-6 (T138758)	[production]
09:01	<elukey>	rebooting analytics1028->1057 for kernel upgrades (Hadoop worker nodes)	[production]
08:55	<moritzm>	powercycling mw1111, stuck on reboot	[production]
08:44	<elukey>	puppet stopped on analytics1027 to prevent Camus job to run (prep step for Hadoop kernel upgrades)	[production]
08:40	<moritzm>	powercycling mw1108, stuck on reboot	[production]
08:12	<moritzm>	powercycling mw1099, stuck on reboot	[production]
08:12	<moritzm>	powercycling mw1097, stuck on reboot	[production]
08:05	<moritzm>	powercycling mw1092, stuck on reboot	[production]
07:47	<moritzm>	rolling reboot of appservers in eqiad for kernel security update	[production]
07:16	<moritzm>	powercycling snapshot1002, reboot stuck	[production]
07:11	<moritzm>	powercycling snapshot1001, reboot stuck	[production]
06:58	<moritzm>	rebooting most snapshot hosts for kernel security update	[production]
03:28	<krinkle@tin>	Synchronized wmf-config/InitialiseSettings.php: test2wiki (duration: 00m 33s)	[production]
02:56	<l10nupdate@tin>	ResourceLoader cache refresh completed at Wed Jun 29 02:56:32 UTC 2016 (duration 6m 30s)	[production]
02:50	<mwdeploy@tin>	scap sync-l10n completed (1.28.0-wmf.8) (duration: 04m 48s)	[production]
02:30	<chasemp>	labstore1004 is replicating NFS/DRBD shares to labstore1005 and they are large and it's taking a long time	[production]
02:29	<mwdeploy@tin>	scap sync-l10n completed (1.28.0-wmf.7) (duration: 09m 21s)	[production]
02:18	<twentyafterfour@tin>	rebuilt wikiversions.php and synchronized wikiversions files: sync wikiversions.json - group0 to 1.28.0-wmf.8 refs T137492	[production]
02:16	<twentyafterfour>	promoting group0 to 1.28.0-wmf.8	[production]
00:02	<twentyafterfour@tin>	Finished scap: sync new branch, testwiki to php-1.28.0-wmf.8 refs T137492 (duration: 51m 59s)	[production]
2016-06-28 §
23:10	<twentyafterfour@tin>	Started scap: sync new branch, testwiki to php-1.28.0-wmf.8 refs T137492	[production]
23:10	<Krenair>	wikitech-static working now, poke me on IRC or file a #wikitech.wikimedia.org ticket if you find any issues	[production]
23:10	<twentyafterfour>	syncing new branch 1.28.0-wmf.8 refs T137492	[production]
23:04	<ebernhardson@tin>	Synchronized php-1.28.0-wmf.7/extensions/EventBus/EventBus.php: SWAT: EventBus: Match the expected format of response log key (duration: 00m 31s)	[production]
23:01	<Krenair>	Updating MW version on wikitech-static to 1.27 (LTS) - https://lists.wikimedia.org/pipermail/mediawiki-announce/2016-June/000191.html	[production]
21:59	<halfak>	deploying ores beec291	[production]
21:33	<twentyafterfour@tin>	rebuilt wikiversions.php and synchronized wikiversions files: all wikis to 1.28.0-wmf.7	[production]
21:31	<twentyafterfour@tin>	Synchronized php-1.28.0-wmf.7/extensions/AbuseFilter/: deploy https://gerrit.wikimedia.org/r/#/c/296464/ refs T138550 T136973 (duration: 00m 36s)	[production]
21:24	<twentyafterfour>	deploying wmf.7 yet again, once CI finishes testing https://gerrit.wikimedia.org/r/#/c/296464/ refs T138550 T136973	[production]
20:24	<twentyafterfour@tin>	rebuilt wikiversions.php and synchronized wikiversions files: once again rolling back to wmf.6 refs T136973 T138550	[production]
20:11	<twentyafterfour@tin>	rebuilt wikiversions.php and synchronized wikiversions files: all wikis to 1.28.0-wmf.7	[production]
20:09	<twentyafterfour@tin>	Synchronized php-1.28.0-wmf.7/extensions/AbuseFilter/: deploying https://gerrit.wikimedia.org/r/#/c/296440/ refs T138550, T136973 (duration: 02m 06s)	[production]
20:09	<twentyafterfour>	deploying https://gerrit.wikimedia.org/r/#/c/296440/ to hopefully unblock wmf.7 deployments. refs T138550, T136973	[production]
20:08	<gehel>	disabling puppet on wdqs100[12] to cleanup after failed scap3 deplyoment	[production]
19:33	<twentyafterfour@tin>	rebuilt wikiversions.php and synchronized wikiversions files: Rolling back to wmf.6: save time regression is still present in wmf.7	[production]
19:32	<twentyafterfour>	Rolling back to wmf.6: T138550 is still a problem	[production]
19:23	<twentyafterfour@tin>	rebuilt wikiversions.php and synchronized wikiversions files: all wikis to 1.28.0-wmf.7	[production]
19:23	<twentyafterfour>	Deploying 1.28.0-wmf.7 to all wikis	[production]
18:23	<mutante>	zosma - fresh install, sign puppet certs, initial puppet run	[production]
16:16	<gehel>	starting rolling restart of elasticsearch codfw cluster (T138811)	[production]
15:25	<thcipriani@tin>	Synchronized portals: SWAT: [[gerrit:296399\|Bumping portals to master (T136874)]] (duration: 00m 29s)	[production]
15:24	<thcipriani@tin>	Synchronized portals/prod/wikipedia.org/assets: SWAT: [[gerrit:296399\|Bumping portals to master (T136874)]] (duration: 00m 24s)	[production]
15:16	<thcipriani@tin>	Synchronized dblists/visualeditor-default.dblist: SWAT: Enable VisualEditor by default for all users of the [[gerrit:292750\|French (T136993)]], [[gerrit:292751\|English (T136992)]], and [[gerrit:292752\|German (T136991)]] Wikivoyage (duration: 00m 24s)	[production]
15:09	<thcipriani@tin>	Synchronized dblists/visualeditor-default.dblist: SWAT: [[gerrit:292749\|Enable VisualEditor by default for all users of the Italian Wikivoyage (T136994)]] (duration: 00m 25s)	[production]
14:52	<gehel>	powercycling elastic1004 (server not coming up during restart - T138811)	[production]
13:46	<godog>	bounce carbon on graphite machines after applying https://gerrit.wikimedia.org/r/266567	[production]
13:40	<elukey@palladium>	conftool action : set/pooled=yes; selector: aqs1001.eqiad.wmnet	[production]