3251-3300 of 3598 results (25ms)
2016-07-14 §
20:02 <ottomata> restarting hadoop-yarn-resourcemanager on analytics1002 and then analytics1001 to apply yarn log aggregation change [analytics]
2016-07-13 §
13:45 <ottomata> restarting hadoop nodemanagers to apply log aggregation retention check interval change [analytics]
13:14 <elukey> varnishkafka upgraded from 1.0.10-1 to 1.0.11-1 manually on cp3008.esams (misc) and via apt for the whole cache maps cluster [analytics]
09:05 <joal> Deploying refinery to HDFS [analytics]
08:59 <joal> deploying refinery from tin [analytics]
2016-07-12 §
17:41 <joal> Insert test data in aqs100[456] to prevent false alarms [analytics]
13:05 <ottomata> restarting nodemanagers on analytics 1039 1046 and 1054 [analytics]
2016-07-11 §
20:31 <ottomata> rolling restart of hadoop-yarn-nodemanager to apply log aggregation retention seconds [analytics]
11:33 <joal> Deploying aqs on aqs100[456] (new cluster, no traffic) [analytics]
11:22 <joal> Succesfull deployment in beta - Deploying aqs on aqs1001 as canary [analytics]
11:18 <joal> deploying aqs on deployment-prep [analytics]
2016-07-04 §
20:38 <joal> Insert monitoring test data into cassandra on hosts aqs100[456] to prevent icinga alarms [analytics]
20:38 <joal> Insert manitoring testto make tests pass [analytics]
2016-06-22 §
15:01 <elukey> rebooting bohrium.eqiad.wmnet (running piwik) for kernel upgrades [analytics]
2016-06-15 §
08:34 <joal> Restart misc load job with 10% data loss error threshold [analytics]
2016-06-09 §
14:37 <elukey> Tested retention.bytes=2G for kafka webrequest_misc [analytics]
14:36 <elukey> Tested retention.bytes=2G for kafka webrequest_misc - setting removed [analytics]
2016-06-08 §
18:11 <elukey> removed retention.bytes override configuration for kafka webrequest_text (didn't work) [analytics]
16:03 <elukey> temporary set a 10TB upperbound to the Kafka webrequest_text topic to free space [analytics]
08:45 <elukey> removed temporary retention override for kafka webrequest_text topic (T136690) [analytics]
08:17 <elukey> lowering down webrequest_text kafka topic retention time from 7 days to 4 days to free disk space [analytics]
2016-06-07 §
17:51 <ottomata> restarting broker on kafka1020 [analytics]
10:10 <elukey> hue restarted on analytics1027 for security upgrades [analytics]
2016-06-06 §
19:15 <ottomata> restarting kafka broker on kafka1020 to test python consumption client [analytics]
2016-06-04 §
09:47 <elukey> removed temporary Analytics Kafka upload retention override (T136690) [analytics]
09:38 <elukey> Lowering down temporarily the Analytics kafka upload retention time to 24h to free space (T136690) [analytics]
2016-06-03 §
08:38 <elukey> event logging restarted on eventlog1001 [analytics]
08:34 <elukey> rebooting kafka1012 for kernel upgrades. [analytics]
2016-06-02 §
19:53 <ottomata> stopping kafka broker and restarting kafka1014 [analytics]
2016-06-01 §
18:16 <ottomata> stopping kafka broker on kafka1018 and rebooting node [analytics]
11:55 <elukey> restarted EL on eventlog1001 [analytics]
11:51 <elukey> rebooting kafka1022 for kernel upgrades [analytics]
08:26 <elukey> deleted very old kafka.log files in /var/log/kafka to free root space [analytics]
07:54 <elukey> EL restarted on eventlog1001 [analytics]
07:47 <elukey> stopping kafka on kafka1020.eqiad and rebooting the host for Linux 4.4 upgrades [analytics]
2016-05-27 §
11:28 <elukey> restarted jmxtrans on kafka10* hosts [analytics]
11:26 <elukey> restarted jmxtrans on kafka1013 [analytics]
11:21 <elukey> executed kafka preferred-replica-election on kafka1013 [analytics]
2016-05-25 §
14:24 <joal> deploying aqs from tin [analytics]
14:16 <joal> Deploying aqs into aqs_deploy [analytics]
2016-05-24 §
19:25 <nuria_> deploying latest master to dashiki 08cc9a2545bcc0a183a3c00c18e81f21326a41b [analytics]
12:56 <elukey> EL restarted after kafka1013 node stop (kernel upgrades) [analytics]
12:49 <elukey> stopping kafka on kafka1013 and rebooting the host for kernel upgrade [analytics]
2016-05-23 §
17:28 <elukey> re-run from Hue webrequest-load-wf-(text|upload)-2016-5-23-13. The failures were likely caused by my global Yarn restart on the cluster. [analytics]
17:20 <elukey> oozie bundles re-enabled [analytics]
14:57 <elukey> suspended all the oozie bundles as prep step for https://gerrit.wikimedia.org/r/#/c/290252 (yes I know super paranoid mode on) [analytics]
06:42 <elukey> Removed Kafka temp. override for webrequest_upload retention.ms after freeing some disk space. [analytics]
06:37 <elukey> Set kafka retention.ms=172800000 for the topic webrequest_upload to free some disk space on kafka1022 [analytics]
2016-05-20 §
12:50 <elukey> aqs100[123] restarted for openjdk upgrades [analytics]
08:53 <elukey> cassandra upgraded to 2.1.13 on aqs1003 [analytics]