production SAL

451-500 of 10000 results (9ms)

2012-05-07 §
22:35	<awjrichards>	synchronizing Wikimedia installation... : Sync'ing MobileFrontend changes per http://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments/2012-05-07	[production]
22:34	<awjrichards>	synchronizing Wikimedia installation... : Sync'ing MobileFrontend changes per http://www.mediawiki.org/wiki/Extension:MobileFrontend/Deployments/2012-05-07	[production]
22:24	<RoanKattouw>	chmod 775 /usr/local/apache/common-local/php-1.20wmf2/extensions/PageTriage with dsh as root	[production]
22:19	<raindrift>	synchronized php-1.20wmf1/resources/startup.js 'touch'	[production]
22:18	<binasher>	rebooting nfs2 to new kernel	[production]
22:16	<raindrift>	synchronized wmf-config/InitialiseSettings.php 'enabling PageTriage on enwp'	[production]
22:14	<raindrift>	synchronized php-1.20wmf2/extensions/PageTriage 'Syncing PageTriage to enwp, a la carte'	[production]
22:14	<raindrift>	synchronized php-1.20wmf1/extensions/PageTriage 'Syncing PageTriage to enwp, a la carte'	[production]
21:59	<mutante>	was still upgrading/rebooting amssq* and knsq* hosts on the side (slow,b/c upload squids). expect temp. nagios squid reports tomorrow as well. out for now.	[production]
21:44	<binasher>	moved default resolution for upload from eqiad to pmtpa	[production]
21:29	<cmjohnson1>	shutting down storage3 for troubleshooting	[production]
20:37	<binasher>	attempting a live online schema change for zuwikitionary.recentchanges on the prod master	[production]
20:22	<LeslieCarr>	(above) restarted nagios-wm on spence	[production]
20:20	<LeslieCarr>	restarted irc bot	[production]
20:15	<binasher>	rebooting db45	[production]
20:11	<binasher>	rebooting db1019	[production]
18:46	<reedy>	synchronized php-1.20wmf1/extensions/Collection/Collection.session.php 'head'	[production]
18:45	<reedy>	synchronized php-1.20wmf2/extensions/Collection/Collection.session.php 'head'	[production]
18:25	<reedy>	synchronized php-1.20wmf2/extensions/GlobalBlocking/GlobalBlocking.class.php	[production]
18:24	<reedy>	synchronized php-1.20wmf1/extensions/GlobalBlocking/GlobalBlocking.class.php	[production]
18:07	<reedy>	rebuilt wikiversions.cdb and synchronized wikiversions files: enwiki to 1.20wmf2	[production]
16:16	<cmjohnson1>	shutting down storage3 to reseat RAID card	[production]
15:58	<cmjohnson1>	Going to power cycling storage3 several times to troubleshoot hardware issue	[production]
15:15	<RobH>	updating firmware on storgae3	[production]
14:20	<Jeff_Green>	stopped cron jobs on storage3 because of RAID failure	[production]
12:49	<mutante>	pushing out virtual host for wikimania2013 wiki. sync / apache-graceful/all	[production]
11:18	<mutante>	continuing with upgrades/reboots in amssq* on the side during the day	[production]
11:09	<mutante>	squids - sq* done. all latest kernel and 0 pending upgrades.	[production]
09:27	<mutante>	rebooting bits varnish sq68-70 one by one..	[production]
08:01	<mutante>	upgrading/rebooting the last couple sq* servers	[production]
07:20	<binasher>	power cycled db45 (crashed dewiki slave)	[production]
07:05	<asher>	synchronized wmf-config/db.php 'db45 is down'	[production]
02:25	<Tim>	on locke: introduced 1/100 sampling for banner impressions, changed filename to bannerImpressions-sampled100.log	[production]
02:12	<Tim>	on locke: moved fundraising logs back where they were	[production]
02:00	<LocalisationUpdate>	failed: git pull of extensions failed	[production]
01:38	<Tim>	on locke: compressing bannerImpressions.log	[production]
01:35	<Tim>	on locke: moved bannerImpressions.log to archive and restarted udp2log	[production]
01:26	<Tim>	on locke: moved fundraising logs from /a/squid/fundraising/logs to /a/squid so that they will be processed by logrotate	[production]
2012-05-06 §
07:03	<apergos>	manually rotates udplogs on locke, copying destined_for_storage3 off to hume:/archive/emergencyfromlocke/ (jeff, this note's for you in particular)	[production]
06:36	<apergos>	bringing up storage3 with neither /a nor /archive mounted, saw "The disk drive for /archive is not ready yet or not present" etc on boot, waited a long time, finally skipped them	[production]
06:12	<apergos>	and powercycling the box instead. grrrr	[production]
06:05	<apergos>	rebooting storage3: we have messages like May 6 05:45:12 storage3 kernel: [465081.410025] Filesystem "dm-0": xfs_log_force: error 5 returned. in the log, and the raid is unaccessible, megacli doesn't run either	[production]
02:00	<LocalisationUpdate>	failed: git pull of extensions failed	[production]
2012-05-05 §
09:37	<mutante>	squids - upgrading in the sq5x range (upload)	[production]
08:53	<apergos>	disabling modcompress temporarily for lightty on dataset2 (live hack), let's see what that does as far as it dying. could be issue similar to http://redmine.lighttpd.net/issues/2391	[production]
06:45	<mutante>	squids - upgrading sq44,48 (upload)	[production]
05:23	<mutante>	squids - finishing a couple reboots in the sq7x range	[production]
03:04	<binasher>	rebooting db1006 as well	[production]
03:04	<binasher>	rebooting db1038, kernel uptime scheduler chaos	[production]
02:00	<LocalisationUpdate>	failed: git pull of extensions failed	[production]