2551-2600 of 2915 results (16ms)
2016-08-04 §
19:50 <ottomata> now running kafka-python 1.2.5 for eventlogging-service-eventbus in codfw, removed downtime for kafka200[12] [analytics]
17:36 <elukey> added the analytics-deploy key to the Keyholder for the Analytics Refinery scap3 migration (also updated https://wikitech.wikimedia.org/wiki/Keyholder) [analytics]
17:28 <elukey> deploying the refinery with scap3 for the first time on all nodes [analytics]
2016-07-29 §
01:55 <milimetric> limn1 disk full, no idea how to clean it because /public refuses to list its files or listen to me when I try to delete it [analytics]
2016-07-28 §
17:37 <ottomata> powercycling analytics1032 [analytics]
2016-07-26 §
10:13 <joal> Re-deploying refinery after bug fix [analytics]
09:26 <joal> Deploying refinery [analytics]
08:41 <joal> Deploying refinery-source using Jenkins [analytics]
2016-07-25 §
18:31 <ottomata> upgrading kafka to 0.9 in main-codfw, first kafka2001 then 2002 [analytics]
2016-07-20 §
19:40 <joal> Relaunch 2016-07-19 cassandra per-article-daily oozie job [analytics]
15:45 <elukey> executed https://phabricator.wikimedia.org/P3520 on aqs100[456] for both a/b cassandra instances [analytics]
15:32 <elukey> raising compaction throughput to 256 on aqs100[456] [analytics]
2016-07-18 §
17:16 <joal> Change compression from lz4 to deflate on aqs100[456] [analytics]
17:16 <joal> Change compression from lz4 to deflate [analytics]
08:59 <joal> deploy restabase on aqs100[23] [analytics]
08:36 <elukey> re-executed cassandra-daily-wf-local_group_default_T_pageviews_per_article_flat-2016-7-16 (failed oozie job) [analytics]
2016-07-15 §
15:29 <ottomata> restarting hadoop-mapreduce-historyserver to apply yarn log aggreation retention settings [analytics]
2016-07-14 §
20:02 <ottomata> restarting hadoop-yarn-resourcemanager on analytics1002 and then analytics1001 to apply yarn log aggregation change [analytics]
2016-07-13 §
13:45 <ottomata> restarting hadoop nodemanagers to apply log aggregation retention check interval change [analytics]
13:14 <elukey> varnishkafka upgraded from 1.0.10-1 to 1.0.11-1 manually on cp3008.esams (misc) and via apt for the whole cache maps cluster [analytics]
09:05 <joal> Deploying refinery to HDFS [analytics]
08:59 <joal> deploying refinery from tin [analytics]
2016-07-12 §
17:41 <joal> Insert test data in aqs100[456] to prevent false alarms [analytics]
13:05 <ottomata> restarting nodemanagers on analytics 1039 1046 and 1054 [analytics]
2016-07-11 §
20:31 <ottomata> rolling restart of hadoop-yarn-nodemanager to apply log aggregation retention seconds [analytics]
11:33 <joal> Deploying aqs on aqs100[456] (new cluster, no traffic) [analytics]
11:22 <joal> Succesfull deployment in beta - Deploying aqs on aqs1001 as canary [analytics]
11:18 <joal> deploying aqs on deployment-prep [analytics]
2016-07-04 §
20:38 <joal> Insert monitoring test data into cassandra on hosts aqs100[456] to prevent icinga alarms [analytics]
20:38 <joal> Insert manitoring testto make tests pass [analytics]
2016-06-22 §
15:01 <elukey> rebooting bohrium.eqiad.wmnet (running piwik) for kernel upgrades [analytics]
2016-06-15 §
08:34 <joal> Restart misc load job with 10% data loss error threshold [analytics]
2016-06-09 §
14:37 <elukey> Tested retention.bytes=2G for kafka webrequest_misc [analytics]
14:36 <elukey> Tested retention.bytes=2G for kafka webrequest_misc - setting removed [analytics]
2016-06-08 §
18:11 <elukey> removed retention.bytes override configuration for kafka webrequest_text (didn't work) [analytics]
16:03 <elukey> temporary set a 10TB upperbound to the Kafka webrequest_text topic to free space [analytics]
08:45 <elukey> removed temporary retention override for kafka webrequest_text topic (T136690) [analytics]
08:17 <elukey> lowering down webrequest_text kafka topic retention time from 7 days to 4 days to free disk space [analytics]
2016-06-07 §
17:51 <ottomata> restarting broker on kafka1020 [analytics]
10:10 <elukey> hue restarted on analytics1027 for security upgrades [analytics]
2016-06-06 §
19:15 <ottomata> restarting kafka broker on kafka1020 to test python consumption client [analytics]
2016-06-04 §
09:47 <elukey> removed temporary Analytics Kafka upload retention override (T136690) [analytics]
09:38 <elukey> Lowering down temporarily the Analytics kafka upload retention time to 24h to free space (T136690) [analytics]
2016-06-03 §
08:38 <elukey> event logging restarted on eventlog1001 [analytics]
08:34 <elukey> rebooting kafka1012 for kernel upgrades. [analytics]
2016-06-02 §
19:53 <ottomata> stopping kafka broker and restarting kafka1014 [analytics]
2016-06-01 §
18:16 <ottomata> stopping kafka broker on kafka1018 and rebooting node [analytics]
11:55 <elukey> restarted EL on eventlog1001 [analytics]
11:51 <elukey> rebooting kafka1022 for kernel upgrades [analytics]
08:26 <elukey> deleted very old kafka.log files in /var/log/kafka to free root space [analytics]