2014-04-01
§
|
21:38 |
<hashar> |
Removed the Zuul triggers that updated beta cluster in PMTPA {{gerrit|123100}}. |
[releng] |
19:49 |
<bd808> |
Converted deployment-graphite.eqiad.wmflabs to use local puppet & salt masters |
[releng] |
19:20 |
<bd808> |
Deleting and re-creating deployment-graphite because I forgot to add the web security group |
[releng] |
15:57 |
<andrewbogott> |
shutting down all pmtpa instances |
[releng] |
14:32 |
<manybubbles> |
completed upgrade to Elasticsearch 1.1.0 and fixed deployment-elastic04. |
[releng] |
13:32 |
<hashar> |
Thumbs access more or less fixed |
[releng] |
13:31 |
<hashar> |
deployment-upload is rejecting connection on port 80. Applying role::beta::uploadservice from {{gerrit|122786}} |
[releng] |
13:30 |
<manybubbles> |
upgrading labs Elasticsearch to 1.1.0 |
[releng] |
08:31 |
<hashar> |
MediaWiki config paths tweaks for Math {{bug|63331}} and Captchas {{bug|63342}} |
[releng] |
00:32 |
<bd808> |
Converted deployment-graphite to use local puppet & salt masters |
[releng] |
2014-03-31
§
|
21:02 |
<hashar> |
Making Parsoid daemon to write its logs to /data/project/parsoid/parsoid.log {{gerrit|122561}} |
[releng] |
20:17 |
<hashar> |
restarted parsoid daemon |
[releng] |
20:00 |
<hashar> |
stopped parsoid . It is killing the application servers |
[releng] |
19:53 |
<hashar> |
restarting both apaches |
[releng] |
19:21 |
<hashar> |
restarting job service on jobrunner01 to apply {{gerrit|122436}} |
[releng] |
19:20 |
<hashar> |
Unbreak puppetmaster on deployment-salt.eqiad.wmflabs |
[releng] |
19:01 |
<hashar> |
puppet master is broken :( |
[releng] |
17:39 |
<hashar> |
lowering # of jobs spawned by the jobrunner {{gerrit|122436}} |
[releng] |
16:00 |
<bd808> |
Restarted logstash service on deployment-logstash1; no new log events seen since 2014-03-28T10:57 |
[releng] |
15:58 |
<bd808> |
Updated kibana on deployment-logstash1 to e317bc6 |
[releng] |
15:56 |
<hashar_> |
Cluster slow because some CirrusSearch job is spamming simplewiki . Gotta find a way to throttle the number of jobs being run on jobrunner01 or add more apache boxes . It is transient anyway, might look at limiting the runs tonight |
[releng] |
15:10 |
<hashar_> |
Rebased puppet repository. Only one hack left: https://gerrit.wikimedia.org/r/#/c/119534/ |
[releng] |
14:20 |
<hashar> |
deleting deployment-parsoidcache01 cache the hardway: stopping varnish, deleting files in /srv/vdb/ , starting varnish |
[releng] |
14:05 |
<hashar> |
shutdowning database and apache boxes for now. |
[releng] |
14:03 |
<hashar> |
shutdowning varnishes instances in pmtpa |
[releng] |
13:56 |
<hashar> |
Deleted deployment-cache-upload01 , replaced by deployment-cache-upload02 |
[releng] |
13:52 |
<hashar> |
upload varnish cache working :-] |
[releng] |
13:47 |
<hashar> |
applying role::cache::upload to role-cache-upload02 |
[releng] |
13:37 |
<hashar> |
migrating deployment-cache-upload02.eqiad.Wmflabs to self puppet/salt master |
[releng] |
13:22 |
<hashar> |
Creating deployment-cache-upload02 to replace deployment-cache-upload01 which was missing the security group "web" |
[releng] |