1151-1200 of 1408 results (15ms)
2016-05-02 §
17:33 <elukey> reverted Varnish config to return 503s for datasets and stats [analytics]
12:14 <elukey> deployed Varnish change to force HTTP 503 for datasets.wikimedia.org, stats.wikimedia.org, metrics.wikimedia.org as prep-step for OS reimage. [analytics]
12:05 <elukey> enabled maintenance banner to dashiki based dashboards via https://meta.wikimedia.org/wiki/Dashiki:OutOfService [analytics]
11:21 <elukey> deployed last version of Event Logging. Service also restarted. [analytics]
2016-04-30 §
13:42 <elukey> disabled puppet on analytics1047 and scheduled downtime for the host, IO errors in the dmesg for /dev/sdd. Stopped also Hadoop daemons to remove it from the cluster temporarily (not sure how to do it properly, will write docs). [analytics]
2016-04-28 §
10:44 <joal> deployed aqs on all three nodes (Thanks elukey !!!!) [analytics]
09:03 <joal> Deploying aqs on aqs1001 [analytics]
08:14 <elukey> restarting kafka on kafka{1012,1014,1022,1020,2001,2002} for Java upgrades. EL will be restarted as well (sigh) [analytics]
2016-04-27 §
15:47 <elukey> restarted event logging on eventlogging1001 [analytics]
14:01 <elukey> restarted Event Logging on eventlogging1001 [analytics]
13:53 <elukey> restarted kafka on kafka1018.eqiad.wmnet for Java upgrades [analytics]
2016-04-25 §
19:55 <nuria_> deployed new vitalsigns code to https://vital-signs.wmflabs.org [analytics]
17:43 <nuria_> deployed new vitalsigns code to https://vital-signs.wmflabs.org [analytics]
2016-04-22 §
09:23 <moritzm> installing ircbalance bugfix updates (preventing massive logspam on some systems) [analytics]
2016-04-20 §
16:06 <elukey> camus re-enabled on analytics1027 [analytics]
13:54 <elukey> puppet stopped on analytics1027 together with Camus (via crontab -e) [analytics]
10:41 <elukey> started rsync of /srv from stat1001 to stat1004 (/srv/stat1001) [analytics]
2016-04-19 §
08:33 <joal> deployed new refinery on hadoop [analytics]
08:21 <joal> deploying refinery from tin [analytics]
2016-04-18 §
10:11 <elukey> execute sudo eventloggingctl restart on eventlogging1001 [analytics]
2016-04-11 §
11:52 <joal> Restart refine job after deploy [analytics]
10:30 <joal> Deploying refinery on HDFS [analytics]
10:21 <joal> deploying refinery from tin [analytics]
09:13 <joal> Releasing refinery-source v0.0.30 to archiva [analytics]
2016-04-08 §
10:09 <joal> deploying aqs from tin on aqs1003 [analytics]
10:08 <joal> deploying aqs from tin on aqs1002 [analytics]
10:03 <joal> deploying aqs from tin on aqs1001 [analytics]
2016-04-07 §
22:58 <nuria_> deployed browser-reports master branch to labs [analytics]
19:34 <ottomata> restarting eventlogging so it runs out of the scap deploy in eventlogging/analytics [analytics]
10:21 <elukey> nodejs-legacy upgraded too on all aqs nodes [analytics]
09:43 <elukey> aqs1002.eqiad.wmnet re-pooled, aqs1003.eqiad.wmnet de-pooled/re-pooled too (nodejs upgrade) [analytics]
09:30 <elukey> aqs1002.eqiad.wmnet de-pooled via confctl. Nodejs upgrade will follow. [analytics]
09:18 <elukey> re-added aqs1001.eqiad.wmnet to LVS pool via confctl [analytics]
08:59 <elukey> removed aqs1001.eqiad.wmnet from LVS pool via confd for nodejs upgrade [analytics]
2016-04-06 §
14:04 <elukey> ran nodetool repair system_auth on aqs1002.eqiad/aqs1003.eqiad [analytics]
13:59 <elukey> ran nodetool repair system_auth on aqs1001.eqiad [analytics]
11:45 <elukey> started nodetool repair on aqs1002 after running "ALTER KEYSPACE system_auth WITH replication = { 'class': 'SimpleStrategy', 'replication_factor': 3 };" [analytics]
2016-04-04 §
15:45 <elukey> aqs1001 re-added to the aqs pool (nodejd NOT upgraded) [analytics]
14:46 <elukey> de-pooled aqs1001.eqiad from the confd pool for nodejs upgrade [analytics]
10:42 <elukey> re-pooled aqs1001.eqiad (no node upgrade, need more info about restbase) [analytics]
09:53 <elukey> de-pooled aqs1001.eqiad.wmnet as pre-step for nodejs upgrade [analytics]
2016-04-01 §
13:23 <joal> Deploying aqs in aqs1001 from tin [analytics]
2016-03-31 §
20:01 <ottomata> stopping eventlogging, uninstalling globally installed eventlogging python code, running puppet, restarting eventlogging from /srv/deployment/eventlogging/eventlogging [analytics]
19:45 <ottomata> merging puppet change to run eventlogging code out of deploy repo [analytics]
2016-03-30 §
18:06 <ottomata> repooling aqs1001 [analytics]
18:00 <ottomata> depooling aqs1001 [analytics]
2016-03-29 §
13:27 <joal> Update CirrusSearchRequestSet schema in hive [analytics]
2016-03-24 §
18:29 <elukey> camus and puppet re-enabled on analytics1027 [analytics]
18:27 <ottomata> resuming suspended webrequest load and refine jobs [analytics]
17:57 <elukey> enabled Hadoop Master Node automatic failover on analytics1001/1002 (this time without fireworks). [analytics]