2021-04-29
§
|
15:55 |
<razzi> |
restart hadoop-yarn-nodemanager and hadoop-hdfs-datanode on an-worker1100 for hadoop to recognize new disk /dev/sdl |
[analytics] |
15:38 |
<ottomata> |
enabling event_sanitized_main jobs - T273789 |
[analytics] |
14:57 |
<elukey> |
run mysql_upgrade on an-coord1001 to complete the buster upgrade - T278424 |
[analytics] |
14:44 |
<hnowlan> |
restored all eventlogging jobs to eventlog1003 |
[analytics] |
14:21 |
<hnowlan> |
bump eventlog1003 CPUs to 6 |
[analytics] |
13:53 |
<joal> |
Rerun failed pageview-hourly-wf-2021-4-29-11 and pageview-hourly-wf-2021-4-29-12 |
[analytics] |
13:09 |
<joal> |
Rerun failed pageview-hourly-wf-2021-4-29-11 |
[analytics] |
12:35 |
<hnowlan> |
restarting 2 processors on eventlog1002 |
[analytics] |
12:02 |
<hnowlan> |
stopping processors on eventlog1002 to migrate to eventlog1003 |
[analytics] |
11:50 |
<elukey> |
manual stop of one of the eventlog processors on eventlog1002 to see if 1003 takes it over |
[analytics] |
02:59 |
<milimetric> |
deployed hotfix for referrer job |
[analytics] |
2021-04-28
§
|
17:46 |
<hnowlan> |
eventlog1003 joined to groups successfully |
[analytics] |
17:36 |
<razzi> |
sudo mkdir /srv/log/eventlogging and sudo chown eventlogging:eventlogging /srv/log/eventlogging to workaround missing directory puppet error (to be puppetized later) |
[analytics] |
17:31 |
<razzi> |
remove deployment cache on eventlogging1003: sudo rm -fr /srv/deployment/eventlogging/analytics-cache/ |
[analytics] |
17:26 |
<razzi> |
manually change /srv/deployment/eventlogging/analytics/.git/DEPLOY_HEAD to deployment1002 on deployment1002 to fix puppet scap error |
[analytics] |
16:53 |
<hnowlan> |
stopping deployment-eventlog05 in deployment-prep |
[analytics] |
14:42 |
<milimetric> |
deployed refinery with 0.1.9 jars and synced to hdfs |
[analytics] |
14:30 |
<elukey> |
chown -R analytics-deploy:analytics-deploy /srv/deployment/analytics on an-coord1001 |
[analytics] |
12:50 |
<ottomata> |
applied data_purge jobs in analytics test cluster; old data will now be dropped there - T273789 |
[analytics] |
2021-04-21
§
|
21:30 |
<ottomata> |
temporariliy disabling sanitize_eventlogging_analytics_delayed jobs until T280813 is completed (probably tomorrow) |
[analytics] |
20:04 |
<ottomata> |
renaming event_santized hive table directories to lower case and repairing table partition paths - T280813 |
[analytics] |
09:28 |
<elukey> |
roll restart druid-overlord on druid* after an-coord1001 maintenance |
[analytics] |
09:08 |
<elukey> |
upgrade hue on an-tool1009 to 4.9.0-2 |
[analytics] |
08:31 |
<elukey> |
re-enable timers on an-launcher1002 and airflow on an-airflow1001 after maintenance on an-coord1001 |
[analytics] |
07:08 |
<elukey> |
reimage an-coord1001 after partition reshape (/var/lib/mysql folded in /srv) |
[analytics] |
06:51 |
<elukey> |
stop airflow on an-airflow1001 |
[analytics] |
06:49 |
<elukey> |
stop all services on an-coord1001 as prep step for reimage |
[analytics] |
06:45 |
<elukey> |
PURGE BINARY LOGS BEFORE '2021-04-14 00:00:00'; on an-coord1001 to free some space before the reimage |
[analytics] |
06:00 |
<elukey> |
stop timers on an-launcher1002 as prep step for an-coord1001 reimage |
[analytics] |