2014-06-18
§
|
17:10 |
<RobH> |
magnesium back to proper function |
[production] |
17:09 |
<RobH> |
apache2ctl restart on magnesium, racktables wasn't working |
[production] |
16:55 |
<bd808> |
Setup hourly cron as user bd808 on deployment-salt to test automatic update of puppet repo using ~bd808/git-sync-upstream script |
[releng] |
16:24 |
<bblack> |
rebooting lvs4001 for kenerl + num_queues |
[production] |
16:19 |
<bblack> |
rebooting lvs4002 for kenerl + num_queues |
[production] |
15:20 |
<bblack> |
rebooting lvs4003 for kernel / num_queues updates |
[production] |
15:17 |
<bblack> |
rebooting lvs4004 for kernel / num_queues updates |
[production] |
15:10 |
<anomie> |
Synchronized php-1.24wmf9/extensions/Scribunto/engines/LuaCommon/SiteLibrary.php: SWAT: Fix Scribunto-related exceptions on testwiki [[gerrit:140370]] (duration: 00m 14s) |
[production] |
13:40 |
<_joe_> |
restarted profiler-to-carbon, stuck (again) waiting for mwprof |
[production] |
13:25 |
<springle> |
script rt-7708.pl hitting m2-master eventlogging from terbium for RT #7708. fine to kill if necessary |
[production] |
10:01 |
<hashar> |
Updated our Jenkins job builder fork: 8cbc93a..416ee7d |
[production] |
08:26 |
<_joe_> |
disk is gone, powering down ms-be1007, opening ticket for disk replacement |
[production] |
08:24 |
<_joe_> |
stopped swift on ms-be1007, unmounting volume to check for repair |
[production] |
06:01 |
<springle> |
restarted gmetad on nickel while unbreaking the mysql graphs I broke on ganglia |
[production] |
04:30 |
<ori> |
enabled puppet on polonium (was disabled but nothing in SAL) |
[production] |
02:59 |
<LocalisationUpdate> |
ResourceLoader cache refresh completed at Wed Jun 18 02:58:22 UTC 2014 (duration 58m 21s) |
[production] |
02:26 |
<LocalisationUpdate> |
completed (1.24wmf9) at 2014-06-18 02:25:03+00:00 |
[production] |
02:23 |
<MaxSem> |
searchidx1001 outta sync - running sync-common |
[production] |
02:14 |
<LocalisationUpdate> |
completed (1.24wmf8) at 2014-06-18 02:13:34+00:00 |
[production] |
02:05 |
<Krinkle> |
Nevermind, graphite.wikimedia.org going down is due to overload which recovers eventually (it just has). Has become SNAFU/FIXME. |
[production] |
02:02 |
<Krinkle> |
graphite.wikimedia.org is down with HTTP 502 Bad Gateway errors |
[production] |
01:49 |
<ori> |
puppet freshness on tungsten and stat1001 can be fixed with https://gerrit.wikimedia.org/r/#/c/140269/ |
[production] |
2014-06-17
§
|
20:36 |
<bd808> |
Upgraded elasticsearch to version 1.2.1 on deployment-logstash1 |
[releng] |
20:19 |
<maxsem> |
Synchronized php-1.24wmf9/extensions/MobileFrontend/: https://gerrit.wikimedia.org/r/#/c/140178/ (duration: 00m 04s) |
[production] |
20:17 |
<maxsem> |
Synchronized php-1.24wmf8/extensions/MobileFrontend/: https://gerrit.wikimedia.org/r/#/c/140178/ (duration: 00m 05s) |
[production] |
20:01 |
<hoo> |
Synchronized php-1.24wmf9/extensions/Wikidata/: Update Wikidata to fix editing site links (duration: 00m 24s) |
[production] |
18:23 |
<reedy> |
Synchronized docroot and w: (no message) (duration: 00m 16s) |
[production] |
18:22 |
<reedy> |
rebuilt wikiversions.cdb and synchronized wikiversions files: Non Wikipedias to 1.24wmf9 |
[production] |
18:05 |
<demon> |
Synchronized wmf-config/PoolCounterSettings-eqiad.php: Limit regex searches before they start landing on wikis (duration: 00m 04s) |
[production] |
16:32 |
<bblack> |
enabled amssq31-46 esams text frontend varnishes in pybal (were misconfigured; wrong domainname) |
[production] |
15:18 |
<manybubbles> |
Synchronized php-1.24wmf8/extensions/CirrusSearch/: SWAT - Fix Cirrus Special:Random (duration: 00m 04s) |
[production] |
15:13 |
<manybubbles> |
Synchronized php-1.24wmf9/extensions/CirrusSearch/: SWAT - Fix Cirrus Special:Random (duration: 00m 04s) |
[production] |
15:02 |
<manybubbles> |
Synchronized wmf-config/InitialiseSettings.php: SWAT - lower event logging rate for mediaviewer (duration: 00m 05s) |
[production] |
13:51 |
<_joe_> |
production puppet masters upgraded to puppet 3 |
[production] |
07:12 |
<springle> |
starting updateCollation on s3 frwikinews from tin |
[production] |
07:07 |
<springle> |
Synchronized wmf-config/InitialiseSettings.php: $wgCategoryCollation to uca-fr on frwikinews (duration: 00m 07s) |
[production] |
03:20 |
<LocalisationUpdate> |
ResourceLoader cache refresh completed at Tue Jun 17 03:19:12 UTC 2014 (duration 19m 11s) |
[production] |
02:35 |
<LocalisationUpdate> |
completed (1.24wmf9) at 2014-06-17 02:34:09+00:00 |
[production] |
02:23 |
<LocalisationUpdate> |
completed (1.24wmf8) at 2014-06-17 02:22:46+00:00 |
[production] |
2014-06-16
§
|
23:12 |
<maxsem> |
Synchronized php-1.24wmf8/extensions/MobileFrontend/: https://gerrit.wikimedia.org/r/#/c/139562/ (duration: 00m 05s) |
[production] |
23:11 |
<maxsem> |
Synchronized php-1.24wmf9/extensions/MobileFrontend/: https://gerrit.wikimedia.org/r/#/c/139562/ (duration: 00m 06s) |
[production] |
23:05 |
<maxsem> |
Synchronized wmf-config/CommonSettings.php: https://gerrit.wikimedia.org/r/#/c/139888/ (duration: 00m 08s) |
[production] |
21:30 |
<ori> |
upgraded eventlogging to 3012aad |
[production] |
21:16 |
<bd808> |
Jenkins beta-scap-eqiad job broken because of missing puppet config on deployment-jobrunner01; needs role::beta::scap_target |
[releng] |
20:45 |
<ori> |
updated eventlogging to b4b42effc6 |
[production] |
20:36 |
<bd808> |
Enabled puppet on deployment-jobrunner01 and forced a run |
[releng] |
20:34 |
<bd808> |
Puppet disabled on deployment-jobrunner01 since 2014-06-03; No SAL logs explaining why |
[releng] |
20:19 |
<bd808> |
Updated scap to 5adce72; trebuchet reported i-00000237 (deployment-videoscaler01) as not updating, but manual check shows it did sync properly |
[releng] |
20:00 |
<bd808> |
Deleted /var/lib/puppet/state/agent_catalog_run.lock on deployment-bastion after verifying that no puppet processes were running |
[releng] |
19:55 |
<bd808> |
Truncated /var/log/diamond/diamond.log and restarted diamond on deployment-bastion |
[releng] |