2013-11-28
§
|
10:06 |
<apergos> |
stack traces filling up parsoid nohup.out logs (sveral gigs in only a few minutes once the parsoid gets into that state), sample on wtp1010 in /var/lib/parsoid/nohup.out.errors |
[production] |
08:34 |
<apergos> |
and wtp1023 |
[production] |
08:29 |
<apergos> |
/var/lib/parsoid/nohup.out on wtp 1005,11,12 was 6gb or more, causing / on these boxes to fill; moved it, restarted parsoid, removed it |
[production] |
07:16 |
<apergos> |
powercycled sq80 |
[production] |
05:41 |
<ori> |
synchronized wmf-config/CommonSettings.php 'Icdaa4c1b5: Configure parser cache databases in db-$realm file (3/3)' |
[production] |
05:41 |
<ori> |
synchronized wmf-config/db-pmtpa.php 'Icdaa4c1b5: Configure parser cache databases in db-$realm file (2/3)' |
[production] |
05:40 |
<ori> |
synchronized wmf-config/db-eqiad.php 'Icdaa4c1b5: Configure parser cache databases in db-$realm file (1/3)' |
[production] |
05:37 |
<ori> |
updated /a/common to {{Gerrit|Icdaa4c1b5}}: Configure parser cache databases in db-$realm file |
[production] |
03:37 |
<springle> |
synchronized wmf-config/db-eqiad.php 'repool slaves after package upgrade, (lvm snapshot boxes only, LB=0)' |
[production] |
03:16 |
<springle> |
synchronized wmf-config/db-eqiad.php 'depool slaves for package upgrade' |
[production] |
02:43 |
<LocalisationUpdate> |
ResourceLoader cache refresh completed at Thu Nov 28 02:42:58 UTC 2013 |
[production] |
02:29 |
<springle> |
synchronized wmf-config/db-eqiad.php 'slaves to full steam after package upgrade' |
[production] |
02:15 |
<LocalisationUpdate> |
completed (1.23wmf5) at Thu Nov 28 02:15:36 UTC 2013 |
[production] |
02:09 |
<LocalisationUpdate> |
completed (1.23wmf4) at Thu Nov 28 02:09:38 UTC 2013 |
[production] |
01:17 |
<springle> |
synchronized wmf-config/db-eqiad.php 'warm up slaves after package upgrade' |
[production] |
01:02 |
<ori-l> |
started rsync of graphite data (~400gb) from professor.pmtpa to tungsten.eqiad |
[production] |
00:40 |
<springle> |
synchronized wmf-config/db-eqiad.php 'depool slaves for package upgrade' |
[production] |
2013-11-27
§
|
19:50 |
<demon> |
synchronized wmf-config/InitialiseSettings.php 'Fixes for Flow config, no-op in prod' |
[production] |
19:49 |
<demon> |
synchronized wmf-config/CommonSettings.php 'Fixes for Flow config, no-op in prod' |
[production] |
18:12 |
<paravoid> |
kill -9 gdb on cp3012, attached to varnish frontend |
[production] |
11:28 |
<ori-l> |
faidon switched gdash.wm.o from professor.pmtpa -> tungsten.eqiad behind misc-varnish & rebooted ssl1 in tampa |
[production] |
11:11 |
<apergos> |
ssl1 rebooted itself about 15 mins ago, no idea why |
[production] |
10:20 |
<ariel> |
synchronized wmf-config/db-eqiad.php 'db1019 (s3) back to full weight in the pool' |
[production] |
10:19 |
<ariel> |
updated /a/common to {{Gerrit|If5ebd6194}}: db1019 (s3) back to full weight in pool |
[production] |
10:08 |
<apergos> |
shot some old puppet processes hogging memory on db9 (from march and earlier) |
[production] |
09:49 |
<apergos> |
there was no mount /srv/pagecounts on labstore4, so rsync to /exp/pagecounts wrote to and filled /; did the mkdir and now things seem ok |
[production] |
08:00 |
<ariel> |
synchronized wmf-config/db-eqiad.php 'warm up db1019 (s3) aftr lvm resize' |
[production] |
07:59 |
<ariel> |
updated /a/common to {{Gerrit|I50354e622}}: warm up db1019 (s3) after lvm resize |
[production] |
07:38 |
<apergos> |
rebooting db1019 after kernel upgrade, fix for broken xfs_growfs |
[production] |
07:02 |
<ariel> |
synchronized wmf-config/db-eqiad.php 'depool db1019 (s3) temporarily for lvm resize' |
[production] |
07:01 |
<ariel> |
updated /a/common to {{Gerrit|I4372bb602}}: depool db1019 (s3) temporarily for lvm resize |
[production] |
02:40 |
<LocalisationUpdate> |
ResourceLoader cache refresh completed at Wed Nov 27 02:40:52 UTC 2013 |
[production] |
02:15 |
<LocalisationUpdate> |
completed (1.23wmf5) at Wed Nov 27 02:15:19 UTC 2013 |
[production] |
02:08 |
<LocalisationUpdate> |
completed (1.23wmf4) at Wed Nov 27 02:08:28 UTC 2013 |
[production] |
00:32 |
<springle> |
stopping replication on sanitarium db1054:3308 and labsdb1002:3308 while restoring dewiki to labs |
[production] |
2013-11-26
§
|
22:58 |
<mutante> |
deploying gerrit 92925 & 91124 (apache-config), makes /entity/ URLs on wikidata 303 and removes non-existent noncom wiki |
[production] |
21:15 |
<mutante> |
praseodymium, cerium, xenon: disabled icinga notifications and scheduled 1yr downtime for host and all services per gwicke, they are test hosts and not prod. |
[production] |
21:04 |
<bblack> |
killed attached gdb on cp3012's varnishd frontend, which restarted it... |
[production] |
20:21 |
<csteipp> |
synchronized php-1.23wmf4/extensions/TimedMediaHandler 'bug56699' |
[production] |
20:16 |
<csteipp> |
synchronized php-1.23wmf5/extensions/TimedMediaHandler 'bug56699' |
[production] |
20:05 |
<reedy> |
rebuilt wikiversions.cdb and synchronized wikiversions files: Not wmf5 day today |
[production] |
19:33 |
<reedy> |
rebuilt wikiversions.cdb and synchronized wikiversions files: Revert to 1.23wmf4 on wikisources |
[production] |
19:23 |
<reedy> |
rebuilt wikiversions.cdb and synchronized wikiversions files: All non wikipedias to 1.23wmf5 |
[production] |
19:21 |
<MaxSem> |
Reindexing GeoData |
[production] |
19:16 |
<reedy> |
updated /a/common to {{Gerrit|Idb0e7956e}}: slaves to full steam |
[production] |
08:00 |
<ori-l> |
re-pooled ssl1003 |
[production] |
07:57 |
<ori-l> |
depooling ssl1003 for quick test of puppet config |
[production] |
06:22 |
<apergos> |
pwercycled labstore1001, unreachable, nothing on mgmt console |
[production] |
06:00 |
<springle> |
synchronized wmf-config/db-eqiad.php 'slaves to full steam after package upgrade' |
[production] |
05:59 |
<paravoid> |
applying workaround for Ganglia XSS https://github.com/ganglia/ganglia-web/issues/218 |
[production] |