production SAL

551-600 of 10000 results (11ms)

2012-01-06 §
21:35	<RobH>	db1029 coming down for ssd testing	[production]
21:26	<RobH>	cp1014 and cp1019 hdd controller cables replaced (removed for testing controllers), both can be used normally	[production]
21:19	<binasher>	restoring db22 from a live hotbackup of db1038	[production]
21:18	<RobH>	es1002 back ready for service use per #2220: replace original RAID card in es1002	[production]
21:05	<binasher>	putting db51 into production as an s4 slave	[production]
21:05	<asher>	synchronized wmf-config/db.php 'adding db51 as an s4 slave'	[production]
20:57	<binasher>	started slaving db51 off of db31	[production]
20:21	<RobH>	rt2226 - redeploy db22 for asher	[production]
20:19	<RobH>	db22 reinstalled and booting into OS. No puppet runs yet, now its Asher's problem ;]	[production]
20:04	<RobH>	db22 reinstalling	[production]
19:24	<binasher>	started innodb hot backup of db1038 to db51	[production]
18:43	<maplebed>	s4 database rotation complete. outage duration 36 minutes.	[production]
18:37	<maplebed>	pushed out new db.php setting s4 to read-write	[production]
18:37	<ben>	synchronized wmf-config/db.php	[production]
18:35	<maplebed>	db31 made read-write as the new master for s4	[production]
18:31	<maplebed>	old master for s4 log file db22-bin.000106 log pos 631618956	[production]
18:30	<maplebed>	new master for s4: db31, log file db31-bin.000213 log pos is 205612709	[production]
18:24	<asher>	synchronized wmf-config/db.php 'setting s4 to read only, preparing to make db31 master'	[production]
18:22	<Reedy>	Commons having db issues, db22 (s4 master) has a disk issue	[production]
16:02	<apergos>	restarted lilghty on dataset2	[production]
16:01	<Reedy>	HTTP server (lighttpd?) seems to be down on dataset2	[production]
15:46	<RoanKattouw>	Removing gs_* files in /tmp on srv220 that are >30 min old	[production]
15:44	<reedy>	synchronized wmf-config/InitialiseSettings.php 'Bug 33556 - ArticleFeedback settings on Chinese wikipedia'	[production]
15:43	<RoanKattouw>	Removed /tmp/mw-cache-1.17 and /tmp/mw-cache-1.17-test on srv220	[production]
15:41	<Reedy>	srv220 / is at 100% usage	[production]
15:41	<reedy>	synchronized wmf-config/InitialiseSettings.php 'Bug 33556 - ArticleFeedback settings on Chinese wikipedia'	[production]
14:34	<mutante>	saw the log about cp1043/44 being deliberately left broken, but requirement in varnish.pp also broke others, fixed on sq67,68,69 (gerrit change 1802)	[production]
02:01	<LocalisationUpdate>	completed (1.18) at Fri Jan 6 02:05:01 UTC 2012	[production]
01:25	<binasher>	puppet is being deliberately left broken on cp1043 and 1044 until tomorrow	[production]
01:23	<binasher>	backend varnish instance on cp1042 running 3.0.2 is in production for 1/3 of mobile requests	[production]
2012-01-05 §
22:15	<preilly>	small fix for iPhone vary support	[production]
22:15	<preilly>	synchronized php-1.18/extensions/MobileFrontend/MobileFrontend.php	[production]
21:39	<Ryan_Lane>	rebooting virt1	[production]
21:01	<reedy>	synchronized wmf-config/CommonSettings.php 'wmgShortUrlPrefix'	[production]
21:01	<reedy>	synchronized wmf-config/InitialiseSettings.php 'wmgShortUrlPrefix'	[production]
20:08	<Reedy>	Created ShortUrl tables on testwiki	[production]
20:07	<reedy>	synchronizing Wikimedia installation... : Update extensionmessages	[production]
20:05	<reedy>	synchronized wmf-config/CommonSettings.php 'wmgUseShortUrl'	[production]
20:04	<reedy>	synchronized wmf-config/InitialiseSettings.php 'wmgUseShortUrl'	[production]
20:02	<reedy>	synchronized php-1.18/extensions/ShortUrl 'Pushing ShortUrl files out'	[production]
19:08	<notpeter>	restarting dhcpd on brewster	[production]
18:45	<preilly>	pushing fix for js error on production	[production]
18:45	<preilly>	synchronized php-1.18/extensions/MobileFrontend/ApplicationTemplate.php	[production]
18:45	<preilly>	synchronized php-1.18/extensions/MobileFrontend/javascripts/application.js	[production]
18:00	<mutante>	tarin - added "#includedir /etc/sudoers.d" to sudo config, needs to read /etc/sudoers.d/nrpe for Nagios RAID check	[production]
17:49	<logmsgbot_>	hashar: gallium: cleaned /tmp . Our test suites leak a large amount of files :D	[production]
17:49	<^demon>	removed chuck norris plugin from jenkins, restarted	[production]
16:48	<mutante>	payments4 - 25 running nginx procs cause a warning - but normal and just raise limit?	[production]
16:15	<mutante>	people claim it was "completely resolved with "2.6.38-10 backport from PPA." (add-apt-repository ppa:kernel-ppa/ppa ...). wanna try that? (or just reboot ms1002 pls)	[production]
15:45	<mutante>	ms1002 - kswapd 100% CPU - but no swap used and free memory left - this looks like https://bugs.launchpad.net/ubuntu/+bug/721896 again	[production]