2014-09-15
§
|
17:14 |
<bd808> |
Restarted elasticsearch on logstash1003; 2014-09-14T09:33:57Z java.lang.OutOfMemoryError |
[production] |
17:09 |
<_joe_> |
killing salt-call on all mediawiki hosts |
[production] |
17:06 |
<bd808> |
Restarted elasticsearch on logstash1001; 2014-09-15T06:12:09Z java.lang.OutOfMemoryError |
[production] |
17:04 |
<bblack> |
using salt to kill salt-minion everywhere... |
[production] |
17:02 |
<bd808> |
Restarted logstash on logstash1001. I hoped this would fix the dashboards, but it looks like the backing elasticsearch cluster is too sad for them to work at the moment. |
[production] |
16:55 |
<bd808> |
Restarted hung elasticsearch service on logstash1002 |
[production] |
16:15 |
<manybubbles> |
jawiki now has cirrus as primary. we're back to where we were before the great cascading failure of two months ago |
[production] |
16:13 |
<manybubbles> |
Synchronized wmf-config/InitialiseSettings.php: (no message) (duration: 00m 06s) |
[production] |
15:29 |
<marktraceur> |
Synchronized php-1.24wmf21/extensions/MultimediaViewer/: [SWAT] Several backports for metrics and bugfixes in Media Viewer (duration: 00m 07s) |
[production] |
15:27 |
<marktraceur> |
Synchronized php-1.24wmf20/extensions/MultimediaViewer/: [SWAT] Several backports for metrics and bugfixes in Media Viewer (duration: 00m 07s) |
[production] |
15:18 |
<marktraceur> |
Synchronized php-1.24wmf21/extensions/GeoCrumbs/GeoCrumbs.class.php: [SWAT] Handle return value NULL of GeoCrumbs::getParserCache (duration: 00m 07s) |
[production] |
15:17 |
<marktraceur> |
Synchronized php-1.24wmf20/extensions/GeoCrumbs/GeoCrumbs.class.php: [SWAT] Handle return value NULL of GeoCrumbs::getParserCache (duration: 00m 07s) |
[production] |
15:06 |
<marktraceur> |
Synchronized wmf-config/: [SWAT] Remove 'renameuser' right from bureaucrats on CentralAuth wikis (duration: 00m 09s) |
[production] |
14:54 |
<aude> |
Synchronized wmf-config/Wikibase.php: Bump wikibase memcached key for test.wikidata, test, test2 (duration: 00m 16s) |
[production] |
14:54 |
<hashar> |
Updated Jenkins Job Builder fork: e5c0c61..2d74b16 |
[production] |
14:50 |
<aude> |
Finished scap: Put test.wikidata back on mw1.24-wmf19 extension branch (duration: 37m 27s) |
[production] |
14:43 |
<manybubbles> |
restarting the enwiki cirrus reindex process - it crashed over the weekend. why you crash and leave error message "1". "1" is not a useful error message. |
[production] |
14:13 |
<aude> |
Started scap: Put test.wikidata back on mw1.24-wmf19 extension branch |
[production] |
13:03 |
<_joe_> |
fenari is swapping hard, restarting apache who was eating up all the RAM |
[production] |
09:20 |
<hashar> |
Synchronized wmf-config/InitialiseSettings.php: *.scienceimage.csiro.au to the wgCopyUploadsDomains {{gerrit|159999}} {{bug|70771}} (duration: 00m 06s) |
[production] |
09:16 |
<hashar> |
Jenkins: apt-get upgrade on prod slaves (updates php5 / libc / jdk 7) |
[production] |
03:09 |
<springle> |
Synchronized wmf-config/db-eqiad.php: depool db1036 (duration: 00m 09s) |
[production] |
02:03 |
<LocalisationUpdate> |
failed: mwversionsinuse returned empty list |
[production] |
01:47 |
<hoo> |
Synchronized wmf-config/liquidthreads.php: Remove global $path (duration: 00m 07s) |
[production] |
01:47 |
<hoo> |
Synchronized wmf-config/flaggedrevs.php: Remove global $path (duration: 00m 10s) |
[production] |
2014-09-14
§
|
20:37 |
<ori_> |
enabling puppet on mw1053 |
[production] |
20:11 |
<springle> |
Synchronized wmf-config/db-eqiad.php: depool db1062, locked up (duration: 00m 09s) |
[production] |
13:24 |
<_joe_> |
stopped puppet aand the JR on mw1053 |
[production] |
12:42 |
<hoo> |
Ran sync-common on mw1053 to stop "Unrecognized job type 'ChangeNotification'." exceptions |
[production] |
11:14 |
<springle> |
Synchronized wmf-config/db-eqiad.php: repool es1005 (duration: 00m 07s) |
[production] |
10:37 |
<springle> |
restart es1005 |
[production] |
09:56 |
<springle> |
Synchronized wmf-config/db-eqiad.php: repool es1007, depool es1005 (duration: 00m 10s) |
[production] |
02:01 |
<LocalisationUpdate> |
failed: mwversionsinuse returned empty list |
[production] |
00:45 |
<ori_> |
fenari appears to still have twemproxy (in addition to nutcracker); decom'ing. |
[production] |
00:29 |
<ori_> |
restarting apache2 on fenari |
[production] |
2014-09-13
§
|
04:42 |
<legoktm> |
global rename for Trevor Parscal (WMF) unstuck itself, yay |
[production] |
04:22 |
<LocalisationUpdate> |
ResourceLoader cache refresh completed at Sat Sep 13 04:22:04 UTC 2014 (duration 22m 3s) |
[production] |
03:51 |
<legoktm> |
global rename for Trevor Parscal --> Trevor Parscal (WMF) looks stuck on metawiki and mswiki, in queued state for both but showJobs.php says the jobs are active and claimed |
[production] |
03:11 |
<LocalisationUpdate> |
completed (1.24wmf21) at 2014-09-13 03:11:40+00:00 |
[production] |
02:38 |
<LocalisationUpdate> |
completed (1.24wmf20) at 2014-09-13 02:38:26+00:00 |
[production] |
01:45 |
<ori> |
Synchronized php-1.24wmf21/extensions/Flow: Update flow for I4da934dfe (duration: 00m 06s) |
[production] |
01:45 |
<ori> |
Synchronized php-1.24wmf20/extensions/Flow: Update flow for I4da934dfe (duration: 00m 06s) |
[production] |
01:41 |
<ori> |
Synchronized php-1.24wmf20/extensions/Flow: Update flow for I4da934dfe (duration: 00m 08s) |
[production] |
2014-09-12
§
|
21:26 |
<csteipp> |
deployed fixes for bugs 70620, 69008 |
[production] |
20:37 |
<mattflaschen> |
Synchronized php-1.24wmf21/extensions/GettingStarted/: Deploy to fix GettingStarted bucketting for users with null registration date (duration: 00m 05s) |
[production] |
20:37 |
<mattflaschen> |
Synchronized php-1.24wmf20/extensions/GettingStarted/: Deploy to fix GettingStarted bucketting for users with null registration date (duration: 00m 07s) |
[production] |
19:34 |
<legoktm> |
running migratePass0.php across all CentralAuth wikis |
[production] |
17:43 |
<ori> |
updated /a/common to {{Gerrit|I4e4187285}}: Rename some constants to clarify their meaning and purpose |
[production] |
14:52 |
<manybubbles> |
rebuilding enwiki's Cirrus index for more performance testing. Please be faster now. k? |
[production] |
08:37 |
<_joe_> |
rolling restart of pybal finished. Adding note on Fenari |
[production] |