6101-6150 of 10000 results (22ms)
2010-07-12 §
11:45 <JeLuF> fixed broken ganglia-metrics installation on srv146 (chown gmetric /var/log/gmetricd/gmetricd.log) [production]
11:41 <JeLuF> added DPKG status monitoring for all app servers to nagios. Reports all packages that are not in state 'rc' or 'ii'. [production]
10:43 <JeLuF> lots of false alerts from nagios due to missing SSL setup for NRPE. Working on it. [production]
09:53 <JeLuF> changed puppet config to install nrpe on all app servers [production]
09:28 <JeLuF> replacing opsview-nrpe agents by nagios-nrpe agents (image_scalers, some other apaches). Most apaches already use nagios-nrpe [production]
07:40 <Tim> set up NRPE disk space monitoring on ms4, discovered that /mnt2 is full [production]
04:54 <Tim> updated NFS host/service groups to monitor the actual NFS servers, not a random collection of miscellaneous ex-NFS servers [production]
04:46 <Tim> installed NRPE on nfs1 and nfs2 [production]
04:08 <Tim> adding rendering, m, bits.esams, recursor0, recursor1, recursor0.esams to nagios [production]
04:02 <Tim> added forward DNS entry for recursor0.esams, modified reverse DNS entry resolver0.esams -> recursor0.esams [production]
03:55 <Tim> fixed reverse DNS entries for recursor0 and recursor1, were set incorrectly to non-existent hostnames "resolver0" and "recursor1" [production]
03:36 <Tim> renamed db6.mgmt to locke.mgmt [production]
2010-07-10 §
14:14 <rainman-sr> search7 disk was full, deleting some old unneccessary indexes [production]
12:50 <Fred> applied security updates on all machine running Karmic or Lucid (per USN-959-1) [production]
2010-07-09 §
18:07 <domas> forgot to log, rebooted locke, put startup stuff to rc.local, maybe Tim changed it afterwards, hehe. beer is good too. [production]
15:31 <Rob> wikimania2011wiki is now using vector [production]
15:31 <robh> synchronized php-1.5/wmf-config/InitialiseSettings.php [production]
12:48 <robh> ran sync-common-all [production]
01:06 <tstarling> synchronized php-1.5/includes/filerepo/RepoGroup.php [production]
01:04 <tstarling> synchronized php-1.5/includes/filerepo/RepoGroup.php [production]
01:04 <root> synchronized php-1.5/includes/filerepo/RepoGroup.php [production]
01:03 <tstarling> synchronized php-1.5/includes/filerepo/RepoGroup.php [production]
00:59 <tstarling> synchronized php-1.5/includes/filerepo/RepoGroup.php [production]
2010-07-08 §
22:27 <apergos> powercycled db9 fromm drac after shutdown failed [production]
22:20 <Fred> re-imaging srv225 back to normal until wikimedia-task*can be ported to lucid. [production]
22:15 <apergos> rebooting db9, mysqld was defunct but the port was in use so couldn't restart it the nice way [production]
17:06 <mark> Set temporary 91.198.174.0/24 null0 route on br1-knams, to investigate prefix announcement problems [production]
16:10 <Rob> updated puppet to add zak to the mortals admin group and allowed access to shell on fenari as non-root [production]
04:10 <Tim> starting upload of BnF images, using importImages.php in screen on fenari [production]
2010-07-07 §
21:38 <Fred> RIP sfoservices. (box not booting at all anymore) [production]
17:10 <Fred> re-imaging srv225 to the apache cluster. [production]
16:26 <mark> Fixed puppet on srv193 [production]
15:56 <mark> Fixed horrible gmond mess on searchidx1 [production]
15:42 <mark> Fixed puppet on sr255 [production]
15:30 <mark> Mounted /mnt/upload6 on srv255 [production]
14:23 <mark> Fixed /home backup on nfs1/nfs2 to tridge [production]
07:57 <Rob> srv193 is refusing to take my updates, removed it from pybal so it doesnt serve out of data information [production]
07:53 <robh> ran sync-common-all [production]
07:50 <Rob> updated dns for wikimania wiki [production]
07:38 <Rob> adding wikimaniawiki apache support, sycning lots of apaches and docroots. [production]
01:49 <Tim> downloading the DjVu files via rsync/ssh for http://www.wikimedia.fr/wikim%C3%A9dia-france-signe-un-partenariat-avec-la-bnf [production]
2010-07-06 §
13:43 <mark> Fixed puppet on nfs1 and nfs2 [production]
11:11 <mark> Removed config cache on srv110 [production]
10:32 <mark> Fixed puppet on srv110 [production]
10:26 <mark> Stopped apache on srv110 [production]
00:28 <Tim> restarted mailman on lily [production]
00:23 <Tim> killed all mailman processes on lily in an attempt to save it from swap death (swapping severely since 00:07) [production]
00:12 <Tim> fixed stale /home on searchidx1 and restarted indexer [production]
00:02 <Tim> codereview-proxy is up now. Pinging CR update API for all recent revisions [production]
2010-07-05 §
23:55 <tstarling> synchronized php-1.5/wmf-config/CommonSettings.php 'new URL for codereview proxy' [production]