2014-04-01
§
|
21:38 |
<hashar> |
Removed the Zuul triggers that updated beta cluster in PMTPA {{gerrit|123100}}. |
[releng] |
19:49 |
<bd808> |
Converted deployment-graphite.eqiad.wmflabs to use local puppet & salt masters |
[releng] |
19:20 |
<bd808> |
Deleting and re-creating deployment-graphite because I forgot to add the web security group |
[releng] |
15:57 |
<andrewbogott> |
shutting down all pmtpa instances |
[releng] |
14:32 |
<manybubbles> |
completed upgrade to Elasticsearch 1.1.0 and fixed deployment-elastic04. |
[releng] |
13:32 |
<hashar> |
Thumbs access more or less fixed |
[releng] |
13:31 |
<hashar> |
deployment-upload is rejecting connection on port 80. Applying role::beta::uploadservice from {{gerrit|122786}} |
[releng] |
13:30 |
<manybubbles> |
upgrading labs Elasticsearch to 1.1.0 |
[releng] |
08:31 |
<hashar> |
MediaWiki config paths tweaks for Math {{bug|63331}} and Captchas {{bug|63342}} |
[releng] |
00:32 |
<bd808> |
Converted deployment-graphite to use local puppet & salt masters |
[releng] |
2014-03-31
§
|
21:02 |
<hashar> |
Making Parsoid daemon to write its logs to /data/project/parsoid/parsoid.log {{gerrit|122561}} |
[releng] |
20:17 |
<hashar> |
restarted parsoid daemon |
[releng] |
20:00 |
<hashar> |
stopped parsoid . It is killing the application servers |
[releng] |
19:53 |
<hashar> |
restarting both apaches |
[releng] |
19:21 |
<hashar> |
restarting job service on jobrunner01 to apply {{gerrit|122436}} |
[releng] |
19:20 |
<hashar> |
Unbreak puppetmaster on deployment-salt.eqiad.wmflabs |
[releng] |
19:01 |
<hashar> |
puppet master is broken :( |
[releng] |
17:39 |
<hashar> |
lowering # of jobs spawned by the jobrunner {{gerrit|122436}} |
[releng] |
16:00 |
<bd808> |
Restarted logstash service on deployment-logstash1; no new log events seen since 2014-03-28T10:57 |
[releng] |
15:58 |
<bd808> |
Updated kibana on deployment-logstash1 to e317bc6 |
[releng] |
15:56 |
<hashar_> |
Cluster slow because some CirrusSearch job is spamming simplewiki . Gotta find a way to throttle the number of jobs being run on jobrunner01 or add more apache boxes . It is transient anyway, might look at limiting the runs tonight |
[releng] |