2014-02-24
§
|
19:30 |
<Coren> |
remmoting virt1001 (sick stuck on bad mounts) |
[production] |
17:06 |
<bd808> |
Logs on logstash1003 show elasticsearch split brain starting at 2014-02-23T00:00:12. logstash1001 and logstash1003 both thought they were master. logstash1001 not responding to logstash1003's requests to become authoritative. |
[production] |
16:38 |
<bd808> |
Logstash elasticsearch split-brain resulted in loss of all logs for 2014-02-24 from 00:00Z to ~16:30Z |
[production] |
16:16 |
<bd808> |
Restarted elasticsearch on logstash1001 |
[production] |
03:28 |
<LocalisationUpdate> |
ResourceLoader cache refresh completed at 2014-02-24 03:27:55+00:00 |
[production] |
02:44 |
<LocalisationUpdate> |
completed (1.23wmf15) at 2014-02-24 02:44:44+00:00 |
[production] |
02:31 |
<LocalisationUpdate> |
completed (1.23wmf14) at 2014-02-24 02:31:50+00:00 |
[production] |
01:16 |
<tstarling> |
updated /a/common/php-1.23wmf15 to {{Gerrit|I268599be9}}: [1.23wmf15] Make SiteStats (re)initializing more sane |
[production] |
01:16 |
<tstarling> |
synchronized php-1.23wmf14/includes/SiteStats.php |
[production] |
2014-02-23
§
|
20:29 |
<Tim> |
updated ss_active_users on plwiki master to not be -1 |
[production] |
20:14 |
<springle> |
killed SiteStatsInit from both wikiuser and wikiadmin on all s2 slaves |
[production] |
20:01 |
<Tim> |
killed SiteStatsInit queries on db1060 |
[production] |
19:57 |
<tstarling> |
synchronized php-1.23wmf15/includes/SiteStats.php |
[production] |
19:56 |
<tstarling> |
synchronized php-1.23wmf14/includes/SiteStats.php |
[production] |
19:48 |
<RobH> |
operations folks are looking into site issues at present |
[production] |
19:38 |
<greg-g> |
< paravoid> something that has to do with SiteStatsInit, probably |
[production] |
19:33 |
<greg-g> |
< paravoid> it's all plwiki |
[production] |
19:33 |
<greg-g> |
< paravoid> tons of SELECT /* SiteStatsInit::edits */ COUNT(*) FROM `revision` LIMIT 1 |
[production] |
19:32 |
<greg-g> |
< paravoid> it's s2 |
[production] |
02:08 |
<LocalisationUpdate> |
ResourceLoader cache refresh completed at 2014-02-23 02:08:36+00:00 |
[production] |
02:02 |
<LocalisationUpdate> |
completed (1.23wmf15) at 2014-02-23 02:02:44+00:00 |
[production] |
02:02 |
<LocalisationUpdate> |
completed (1.23wmf14) at 2014-02-23 02:01:56+00:00 |
[production] |
2014-02-21
§
|
22:12 |
<mwalker> |
updated civicrm from eb3536eb32cbc7400e4e5884d56fbf104e38fc2b to 41dce289bc15ea1ca638c37b29ff2e3e709a2251 for thank you templates |
[production] |
21:40 |
<bd808> |
mw1047 and mw1079 errors cleared after apache-graceful |
[production] |
21:29 |
<mutante> |
graceful'ing apache on mw1047 and mw1079 by request |
[production] |
21:26 |
<bd808> |
mw1047 and mw1079 throwing PHP exception that looks like APC corruption |
[production] |
20:35 |
<bd808> |
Finished scap: no-diff scap; recording asciicast (duration: 03m 13s) |
[production] |
20:31 |
<bd808> |
Started scap: no-diff scap; recording asciicast |
[production] |
18:57 |
<catrope> |
synchronized php-1.23wmf15/extensions/VisualEditor/modules/ve-mw/init/targets/ve.init.mw.ViewPageTarget.js 'touch' |
[production] |
18:57 |
<catrope> |
synchronized php-1.23wmf15/extensions/VisualEditor/modules/ve-mw/init/ve.init.mw.Target.js 'touch' |
[production] |
18:57 |
<catrope> |
synchronized php-1.23wmf14/extensions/VisualEditor/modules/ve-mw/init/targets/ve.init.mw.ViewPageTarget.js 'touch' |
[production] |
18:56 |
<catrope> |
synchronized php-1.23wmf14/extensions/VisualEditor/modules/ve-mw/init/ve.init.mw.Target.js 'touch' |
[production] |
18:55 |
<bd808> |
The 4 hosts that failed scap-rebuild-cdbs were snapshot[1234]; can we pull them from mediawiki-installation dsh group? |
[production] |
18:54 |
<bd808> |
Finished scap: no-diff scap to test script changes; expect l10n updates (duration: 13m 38s) |
[production] |
18:54 |
<bd808> |
scap-rebuild-cdbs failed on 4 hosts |
[production] |
18:50 |
<bd808> |
The 4 hosts that failed scap-1 were snapshot[1234]; all have old/bad python installs |
[production] |
18:49 |
<bd808> |
scap-1 failed on 4 hosts |
[production] |
18:41 |
<bd808> |
Started scap: no-diff scap to test script changes; expect l10n updates |
[production] |
18:36 |
<bd808> |
Forced update of /svr/scap to 6203585 across cluster |
[production] |
18:25 |
<ottomata> |
initiating kafka preferred replica election to rebalance partition leaders |
[production] |
18:19 |
<bblack> |
cp1054 healthy now, rebuilding persistent cache from scratch there... |
[production] |
15:30 |
<Jeff_Green> |
dist-upgrade and reboot boron |
[production] |
13:29 |
<akosiaris> |
just resized 208.80.155.64/26 to 208.80.155.64/28. This is Sandbox1-b-eqiad subnet. dickson.freenode.net needs to have it's netmask changed. I will talk with coren, mutante |
[production] |
10:05 |
<ori> |
updated /a/common to {{Gerrit|I10170d77c}}: Set $wmfExtendedVersionNumber = $wmfVersionNumber |
[production] |
09:53 |
<ori> |
synchronized php-1.23wmf14/extensions/MultimediaViewer/resources/mmv/mmv.performance.js 'I41b6e975353: Backport fix for stats.bandwidth == Infinity' |
[production] |