5451-5500 of 5815 results (29ms)
2016-08-04
§
|
19:50 |
<ottomata> |
now running kafka-python 1.2.5 for eventlogging-service-eventbus in codfw, removed downtime for kafka200[12] |
[analytics] |
17:36 |
<elukey> |
added the analytics-deploy key to the Keyholder for the Analytics Refinery scap3 migration (also updated https://wikitech.wikimedia.org/wiki/Keyholder) |
[analytics] |
17:28 |
<elukey> |
deploying the refinery with scap3 for the first time on all nodes |
[analytics] |
2016-07-29
§
|
01:55 |
<milimetric> |
limn1 disk full, no idea how to clean it because /public refuses to list its files or listen to me when I try to delete it |
[analytics] |
2016-07-28
§
|
17:37 |
<ottomata> |
powercycling analytics1032 |
[analytics] |
2016-07-26
§
|
10:13 |
<joal> |
Re-deploying refinery after bug fix |
[analytics] |
09:26 |
<joal> |
Deploying refinery |
[analytics] |
08:41 |
<joal> |
Deploying refinery-source using Jenkins |
[analytics] |
2016-07-25
§
|
18:31 |
<ottomata> |
upgrading kafka to 0.9 in main-codfw, first kafka2001 then 2002 |
[analytics] |
2016-07-20
§
|
19:40 |
<joal> |
Relaunch 2016-07-19 cassandra per-article-daily oozie job |
[analytics] |
15:45 |
<elukey> |
executed https://phabricator.wikimedia.org/P3520 on aqs100[456] for both a/b cassandra instances |
[analytics] |
15:32 |
<elukey> |
raising compaction throughput to 256 on aqs100[456] |
[analytics] |
2016-07-18
§
|
17:16 |
<joal> |
Change compression from lz4 to deflate on aqs100[456] |
[analytics] |
17:16 |
<joal> |
Change compression from lz4 to deflate |
[analytics] |
08:59 |
<joal> |
deploy restabase on aqs100[23] |
[analytics] |
08:36 |
<elukey> |
re-executed cassandra-daily-wf-local_group_default_T_pageviews_per_article_flat-2016-7-16 (failed oozie job) |
[analytics] |
2016-07-15
§
|
15:29 |
<ottomata> |
restarting hadoop-mapreduce-historyserver to apply yarn log aggreation retention settings |
[analytics] |
2016-07-14
§
|
20:02 |
<ottomata> |
restarting hadoop-yarn-resourcemanager on analytics1002 and then analytics1001 to apply yarn log aggregation change |
[analytics] |
2016-07-13
§
|
13:45 |
<ottomata> |
restarting hadoop nodemanagers to apply log aggregation retention check interval change |
[analytics] |
13:14 |
<elukey> |
varnishkafka upgraded from 1.0.10-1 to 1.0.11-1 manually on cp3008.esams (misc) and via apt for the whole cache maps cluster |
[analytics] |
09:05 |
<joal> |
Deploying refinery to HDFS |
[analytics] |
08:59 |
<joal> |
deploying refinery from tin |
[analytics] |
2016-07-12
§
|
17:41 |
<joal> |
Insert test data in aqs100[456] to prevent false alarms |
[analytics] |
13:05 |
<ottomata> |
restarting nodemanagers on analytics 1039 1046 and 1054 |
[analytics] |
2016-07-11
§
|
20:31 |
<ottomata> |
rolling restart of hadoop-yarn-nodemanager to apply log aggregation retention seconds |
[analytics] |
11:33 |
<joal> |
Deploying aqs on aqs100[456] (new cluster, no traffic) |
[analytics] |
11:22 |
<joal> |
Succesfull deployment in beta - Deploying aqs on aqs1001 as canary |
[analytics] |
11:18 |
<joal> |
deploying aqs on deployment-prep |
[analytics] |
2016-07-04
§
|
20:38 |
<joal> |
Insert monitoring test data into cassandra on hosts aqs100[456] to prevent icinga alarms |
[analytics] |
20:38 |
<joal> |
Insert manitoring testto make tests pass |
[analytics] |
2016-06-22
§
|
15:01 |
<elukey> |
rebooting bohrium.eqiad.wmnet (running piwik) for kernel upgrades |
[analytics] |
2016-06-15
§
|
08:34 |
<joal> |
Restart misc load job with 10% data loss error threshold |
[analytics] |
2016-06-09
§
|
14:37 |
<elukey> |
Tested retention.bytes=2G for kafka webrequest_misc |
[analytics] |
14:36 |
<elukey> |
Tested retention.bytes=2G for kafka webrequest_misc - setting removed |
[analytics] |
2016-06-08
§
|
18:11 |
<elukey> |
removed retention.bytes override configuration for kafka webrequest_text (didn't work) |
[analytics] |
16:03 |
<elukey> |
temporary set a 10TB upperbound to the Kafka webrequest_text topic to free space |
[analytics] |
08:45 |
<elukey> |
removed temporary retention override for kafka webrequest_text topic (T136690) |
[analytics] |
08:17 |
<elukey> |
lowering down webrequest_text kafka topic retention time from 7 days to 4 days to free disk space |
[analytics] |
2016-06-07
§
|
17:51 |
<ottomata> |
restarting broker on kafka1020 |
[analytics] |
10:10 |
<elukey> |
hue restarted on analytics1027 for security upgrades |
[analytics] |
2016-06-06
§
|
19:15 |
<ottomata> |
restarting kafka broker on kafka1020 to test python consumption client |
[analytics] |
2016-06-04
§
|
09:47 |
<elukey> |
removed temporary Analytics Kafka upload retention override (T136690) |
[analytics] |
09:38 |
<elukey> |
Lowering down temporarily the Analytics kafka upload retention time to 24h to free space (T136690) |
[analytics] |
2016-06-03
§
|
08:38 |
<elukey> |
event logging restarted on eventlog1001 |
[analytics] |
08:34 |
<elukey> |
rebooting kafka1012 for kernel upgrades. |
[analytics] |
2016-06-02
§
|
19:53 |
<ottomata> |
stopping kafka broker and restarting kafka1014 |
[analytics] |
2016-06-01
§
|
18:16 |
<ottomata> |
stopping kafka broker on kafka1018 and rebooting node |
[analytics] |
11:55 |
<elukey> |
restarted EL on eventlog1001 |
[analytics] |
11:51 |
<elukey> |
rebooting kafka1022 for kernel upgrades |
[analytics] |
08:26 |
<elukey> |
deleted very old kafka.log files in /var/log/kafka to free root space |
[analytics] |