2021-05-03
§
|
14:23 |
<ottomata> |
stopping all venv based jupyter singleuser servers - T262847 |
[analytics] |
13:59 |
<ottomata> |
dropped all obselete (upper cased location) event_santizied.*_T280813 tables created for T280813 |
[analytics] |
10:43 |
<joal> |
Add _SUCCESS flag to /wmf/data/raw/mediawiki_private/tables/cu_changes/month=2021-04 after having manually sqooped missing tables |
[analytics] |
09:57 |
<joal> |
restart refinery-sqoop-mediawiki-private timer after patch |
[analytics] |
09:56 |
<joal> |
Reset refinery-sqoop-mediawiki-private timer |
[analytics] |
09:38 |
<joal> |
Drop already sqooped data to restart jobs |
[analytics] |
08:53 |
<joal> |
Deploy refinery for sqoop hotfix |
[analytics] |
08:33 |
<elukey> |
clean up libmariadb-java from hadoop workers and clients |
[analytics] |
07:46 |
<joal> |
Kill prod sqoop job to restart after fix |
[analytics] |
2021-04-29
§
|
15:55 |
<razzi> |
restart hadoop-yarn-nodemanager and hadoop-hdfs-datanode on an-worker1100 for hadoop to recognize new disk /dev/sdl |
[analytics] |
15:38 |
<ottomata> |
enabling event_sanitized_main jobs - T273789 |
[analytics] |
14:57 |
<elukey> |
run mysql_upgrade on an-coord1001 to complete the buster upgrade - T278424 |
[analytics] |
14:44 |
<hnowlan> |
restored all eventlogging jobs to eventlog1003 |
[analytics] |
14:21 |
<hnowlan> |
bump eventlog1003 CPUs to 6 |
[analytics] |
13:53 |
<joal> |
Rerun failed pageview-hourly-wf-2021-4-29-11 and pageview-hourly-wf-2021-4-29-12 |
[analytics] |
13:09 |
<joal> |
Rerun failed pageview-hourly-wf-2021-4-29-11 |
[analytics] |
12:35 |
<hnowlan> |
restarting 2 processors on eventlog1002 |
[analytics] |
12:02 |
<hnowlan> |
stopping processors on eventlog1002 to migrate to eventlog1003 |
[analytics] |
11:50 |
<elukey> |
manual stop of one of the eventlog processors on eventlog1002 to see if 1003 takes it over |
[analytics] |
02:59 |
<milimetric> |
deployed hotfix for referrer job |
[analytics] |
2021-04-28
§
|
17:46 |
<hnowlan> |
eventlog1003 joined to groups successfully |
[analytics] |
17:36 |
<razzi> |
sudo mkdir /srv/log/eventlogging and sudo chown eventlogging:eventlogging /srv/log/eventlogging to workaround missing directory puppet error (to be puppetized later) |
[analytics] |
17:31 |
<razzi> |
remove deployment cache on eventlogging1003: sudo rm -fr /srv/deployment/eventlogging/analytics-cache/ |
[analytics] |
17:26 |
<razzi> |
manually change /srv/deployment/eventlogging/analytics/.git/DEPLOY_HEAD to deployment1002 on deployment1002 to fix puppet scap error |
[analytics] |
16:53 |
<hnowlan> |
stopping deployment-eventlog05 in deployment-prep |
[analytics] |
14:42 |
<milimetric> |
deployed refinery with 0.1.9 jars and synced to hdfs |
[analytics] |
14:30 |
<elukey> |
chown -R analytics-deploy:analytics-deploy /srv/deployment/analytics on an-coord1001 |
[analytics] |
12:50 |
<ottomata> |
applied data_purge jobs in analytics test cluster; old data will now be dropped there - T273789 |
[analytics] |