2010-11-19
§
|
23:04 |
<Ryan_Lane> |
changed https check on payments to 1 retry in nagios, via puppet |
[production] |
20:59 |
<atglenn> |
restarted torrus. guess why :-P |
[production] |
17:52 |
<mark> |
Readded notification of Service[nagios] when changing nagios types in puppet |
[production] |
13:59 |
<richcole> |
set raid 10 up on DB41 |
[production] |
08:36 |
<catrope> |
synchronized php-1.5/wmf-config/CommonSettings.php 'bug 25850 - Hide Take me Back link on all wikis' |
[production] |
07:32 |
<JeLuF> |
started puppet on sq42 sq41 sq47 sq45 sq46 sq44 sq50 sq52 sq53 sq51 sq48 sq55 sq54 sq43 sq56 sq58 sq64 sq62 sq61 sq65 sq66 sq60 sq63 sq72 sq75 sq71 sq73 sq74 sq76 sq78 sq77 sq81 sq80 sq79 sq83 sq84 sq82 sq85 sq86 |
[production] |
05:21 |
<apergos> |
on sq85 I was seeing complaints from cron about restart of puppet: unknown option -w. removed that from /etc/default/puppet and restart, but that fails: Could not parse for environment production: Could not find file /agent.pp |
[production] |
01:00 |
<atglenn> |
added monitoring mechanism in root's crontab on sq85 (don't need it everywhere) that will sms me when ms4 is acting up. I'd do it in puppet if someone told me how they would want it to be added there. |
[production] |
00:22 |
<tomasz> |
turning db9 watchdog back on. setting at 5minutes |
[production] |
2010-11-18
§
|
21:57 |
<richcole> |
DB42 shutdown for service |
[production] |
21:40 |
<JeLuF> |
CORRECTION: started puppet manually on sq59 sq61 sq73 sq60 sq62 sq65 sq77 sq63 sq64 sq72 sq75 sq74 sq76 sq71 sq78 sq66, startup script is broken. |
[production] |
21:40 |
<JeLuF> |
started squid manually on sq59 sq61 sq73 sq60 sq62 sq65 sq77 sq63 sq64 sq72 sq75 sq74 sq76 sq71 sq78 sq66, startup script is broken. |
[production] |
21:26 |
<mark> |
Fixed puppet on formey |
[production] |
21:24 |
<mark> |
Fixed puppet on linne |
[production] |
19:57 |
<JeLuF> |
blocked UDP from srv124 on nfs1 aka syslog |
[production] |
17:00 |
<JeLuF> |
restarted puppet on srv215, srv235, srv244, srv257, srv262, srv288 |
[production] |
16:33 |
<JeLuF> |
fixed puppet on srv185 and srv200 |
[production] |
15:00 |
<aaron> |
synchronized php-1.5/wmf-config/flaggedrevs.php 'Set FR_INCLUDES_CURRENT on mediawikiwiki' |
[production] |
2010-11-17
§
|
20:19 |
<JeLuF> |
syslog is being spammed with one week old messages from srv124 |
[production] |
20:19 |
<RobH> |
owa1/2/3 online with base OS install and puppet updates |
[production] |
17:59 |
<RobH> |
updated dns for new databases servers |
[production] |
17:15 |
<richcole> |
owa1 going down for repair |
[production] |
15:52 |
<Ryan_Lane> |
moved the nagios purge stuff out of puppet, and into nagios's init script. Pulled the nagios init script into puppet |
[production] |
10:03 |
<tomasz_> |
adding single field index on converted amount under public_reporting within civirm db on db9 |
[production] |
10:03 |
<tomasz_> |
adding single field indexes to utm_source, utm_medium, and utm_campaign under contribution_tracking table within drupal db on db9 |
[production] |
03:44 |
<atglenn> |
restarted apache on ekrem, many processes hung in "graceful close" state for a long period of time |
[production] |
03:06 |
<tfinc> |
synchronized php-1.5/extensions/CentralNotice/SpecialBannerController.php |
[production] |
03:04 |
<Tim> |
in puppet, disabled nagios::purge since it breaks puppet entirely on fenari. Removed Aaron's obsolete ssh public key by adding an ensure=>absent to puppet. |
[production] |
01:34 |
<Tim> |
on ekrem: ran logrotate -f, since log rotation previously failed due to disk full |
[production] |
01:28 |
<Tim> |
on ekrem: root partition full, deleted old apache access logs |
[production] |