production SAL

9751-9800 of 10000 results (42ms)

2014-02-11 §
07:47	<gwicke>	deployed parsoid hotfix to avoid recursive log spew filling up the logs	[production]
03:39	<RoanKattouw>	Freed up disk space on wtp* by blanking /var/log/parsoid/parsoid.log	[production]
03:30	<RoanKattouw>	Managed to restart Parsoid cleanly in the end. Turns out dsh -g parsoid service restart parsoid doesn't work but dsh -cM -g parsoid /etc/init.d/parsoid restart does work	[production]
03:23	<RoanKattouw>	Doing rolling restart of the Parsoid cluster	[production]
03:21	<RoanKattouw>	Copying wtp1005's Parsoid log for future reference	[production]
02:43	<Tim>	restarting apache on servers with workers in a futex wait: mw1056,mw1082,mw1189,mw1196,mw1199,mw1201,mw1202,mw1203,mw1204,mw1208	[production]
02:40	<LocalisationUpdate>	ResourceLoader cache refresh completed at 2014-02-11 02:40:11+00:00	[production]
02:19	<LocalisationUpdate>	completed (1.23wmf13) at 2014-02-11 02:19:13+00:00	[production]
02:10	<LocalisationUpdate>	completed (1.23wmf12) at 2014-02-11 02:10:38+00:00	[production]
02:02	<awight>	update crm from 7e5786aaf71363c04e5c766e6b403fa4767fe51a to 06f7e4d6d6c2653f1d4aef1e8b8e6293a82b39ef	[production]
01:25	<mwalker>	updated fundraising civicrm from d96164f76877d75ab97c02e6b47449f7a45b31b3 to 7e5786aaf71363c04e5c766e6b403fa4767fe51a for unsbuscribe and quick search changes	[production]
00:56	<^demon>	gerrit upgraded from 2.8.1 stable to 2.8.1-1-g83098d0 (custom build) to work around mysql issue pending upstream release.	[production]
00:34	<maxsem>	finished scap: MobileApp deployment (duration: 28m 19s)	[production]
00:06	<maxsem>	started scap: MobileApp deployment	[production]
00:04	<maxsem>	scap aborted: MobileApp deployment (duration: 06m 33s)	[production]
2014-02-10 §
23:58	<maxsem>	started scap: MobileApp deployment	[production]
23:03	<mutante>	all parsoid machines reployed per gwicke's	[production]
23:00	<bd808>	mw1185 segfaulting starting at 22:39Z. ~240 occurrences in last 20 minutes	[production]
22:59	<aaron>	synchronized php-1.23wmf13/includes/db/LoadBalancer.php '8f6471e04ce0f33c64c090cbe5561deed82f60ee'	[production]
22:59	<springle>	restarting db1050 for investigation	[production]
22:45	<mutante>	restarting parsoid on wtp1008	[production]
22:44	<springle>	synchronized wmf-config/db-eqiad.php 'sync proper non-hot depool db1050'	[production]
22:39	<springle>	synchronized wmf-config/db-eqiad.php 'move s1 vslow dump'	[production]
22:36	<maxsem>	synchronized wmf-config 'https://gerrit.wikimedia.org/r/112597'	[production]
22:34	<mark>	Power cycled ms-be1001	[production]
22:32	<springle>	pt-kill jobs on s1 slaves killing anything sleeping longer than 10s	[production]
22:28	<springle>	killed thousands of broken connections on s1 slaves in Sleep state	[production]
22:25	<maxsem>	scap aborted: Extension:MobileApp deployment (duration: 14m 41s)	[production]
22:23	<matanya>	big dberror spike. "Error connecting" to various ips from various ips	[production]
22:17	<springle>	synchronized wmf-config/db-eqiad.php 'db1050 crashed, depool'	[production]
22:12	<mutante>	fixing broken parsoid deploy on wtp*, one by one	[production]
22:10	<maxsem>	started scap: Extension:MobileApp deployment	[production]
22:02	<mutante>	wtp1016 - delete deployment/parsoid, salt-call fetch/checkout.., restart parsoid	[production]
21:57	<gwicke>	unsuccessful Parsoid deploy as trebuchet failed to update the submodule with the parsoid source, need trebuchet bug fix	[production]
21:21	<catrope>	synchronized wmf-config/InitialiseSettings.php 'touch'	[production]
21:11	<catrope>	synchronized visualeditor-default.dblist 'fix missing entries'	[production]
20:14	<mutante>	harmon - revoke puppet cert,disable puppet,disable icinga notifications, shutting down	[production]
20:09	<mutante>	harmon - removing from puppet stored configs, complete decom, unused Tampa spare	[production]
17:45	<mwalker>	updated civicrm from 97a5146124168096148b6167e2968052b3dda468 to d96164f76877d75ab97c02e6b47449f7a45b31b3 for thank you translations	[production]
16:28	<manybubbles>	correction: done with link count update for cirrus	[production]
16:28	<manybubbles>	done with links count update for cirurs	[production]
15:58	<hashar>	Jenkins: migrating labs jenkins-deploy user homedir from /home/jenkins-deploy (GlusterFS) to local directories under /mnt/home/jenkins-deploy to avoid GlusterFS and race conditions between instances. {{bug\|61144}}	[production]
15:54	<manybubbles>	reindex went well. performing a links recount so we can push more code changes next week safely.	[production]
15:28	<manybubbles>	reindexing phase 0 wikis after Cirrus deploy last Thursday	[production]
12:20	<hashar>	Jenkins: deleted /srv/slave-scrips from old jenkins servers, everything should now use /srv/deployment/integration/slave-scripts	[production]
03:41	<springle>	synchronized wmf-config/db-pmtpa.php 's2 switch master to db1023 (eqiad)'	[production]
03:40	<springle>	synchronized wmf-config/db-eqiad.php 's2 switch master to db1023 (eqiad)'	[production]
03:16	<springle>	synchronized wmf-config/db-pmtpa.php 's2 switch master to db1023 (pmtpa)'	[production]
03:16	<springle>	synchronized wmf-config/db-eqiad.php 's2 switch master to db1023 (eqiad)'	[production]
03:03	<springle>	synchronized wmf-config/db-pmtpa.php 'prepare for s2 master rotation db1036 to db1024 (pmtpa)'	[production]