|
2020-09-24
§
|
| 13:24 |
<elukey> |
moved the hadoop cluster to puppet TLS certificates |
[analytics] |
| 13:20 |
<elukey> |
re-enable timers on an-launcher1002 after maintenance |
[analytics] |
| 09:51 |
<elukey> |
stop all timers on an-launcher1002 to ease maintenance |
[analytics] |
| 09:41 |
<elukey> |
force re-creation of jupyterhub's default venv on stat1006 after reimage |
[analytics] |
| 07:29 |
<klausman> |
Starting reimaging of stat1006 |
[analytics] |
| 06:48 |
<elukey> |
on an-launcher1002: sudo -u hdfs kerberos-run-command hdfs hdfs dfs -rm -r -skipTrash /var/log/hadoop-yarn/apps/mirrys/logs/* |
[analytics] |
| 06:45 |
<elukey> |
on an-launcher1002: sudo -u hdfs kerberos-run-command hdfs hdfs dfs -rm -r -skipTrash /var/log/hadoop-yarn/apps/analytics-privatedata/logs/* |
[analytics] |
| 06:39 |
<elukey> |
manually ran "/usr/bin/find /srv/backup/hadoop/namenode -mtime +15 -delete" on an-master1002 to free some space in the backup partition |
[analytics] |
|
2020-09-16
§
|
| 19:12 |
<joal> |
Manually kill webrequest-hour oozie job that started before the restart could happen (waiting for previous hour to be finished) |
[analytics] |
| 19:00 |
<joal> |
Kill-restart data-quality-hourly bundle after deploy |
[analytics] |
| 18:57 |
<joal> |
Kill-restart webrequest after deploy |
[analytics] |
| 18:44 |
<joal> |
Kill restart mediawiki-history-reduced job after deploy |
[analytics] |
| 17:59 |
<joal> |
Deploy refinery onto HDFS |
[analytics] |
| 17:46 |
<joal> |
Deploy refinery using scap |
[analytics] |
| 15:27 |
<elukey> |
update the TLS backend certificate for Analytics UIs (unified one) to include hue-next.w.o as SAN |
[analytics] |
| 12:11 |
<klausman> |
stat1008 updated to use rock/rocm DKMS driver and back in operation |
[analytics] |
| 11:28 |
<klausman> |
starting to upgrade to rock-dkms driver on stat1008 |
[analytics] |
| 08:11 |
<elukey> |
superset 0.37.1 deployed to an-tool1005 (staging env) |
[analytics] |