2022-09-14
§
|
17:11 |
<aqu> |
Sep 14 15:23:34 UTC sudo systemctl start check_webrequest_partitions.service |
[analytics] |
12:56 |
<aqu> |
~1hago sudo systemctl start refinery-sqoop-mediawiki-production-daily.service ; sudo systemctl start refinery-import-siteinfo-dumps.service ; sudo systemctl start refinery-import-page-current-dumps.service ; sudo systemctl start refinery-import-page-history-dumps.service |
[analytics] |
11:34 |
<btullis> |
remounted all remaining /mnt/hdfs mount points, except stat1005 which is busy |
[analytics] |
11:12 |
<btullis> |
remounted /mnt/hdfs on an-coord100[1-2] |
[analytics] |
11:09 |
<btullis> |
remounted /mnt/hdfs on an-airflow1001 |
[analytics] |
09:14 |
<joal> |
Restart oozie virtualpageview job |
[analytics] |
09:10 |
<btullis> |
re-mounted /mnt/hdfs on an-launcher1002. |
[analytics] |
07:11 |
<joal> |
restart webrequest oozie bundle |
[analytics] |
2022-09-13
§
|
17:22 |
<joal> |
rerun refine_eventloggin_legacy |
[analytics] |
17:14 |
<joal> |
rerun refine_event |
[analytics] |
17:14 |
<joal> |
rerun refine_netflow |
[analytics] |
16:53 |
<joal> |
Rerun refine_eventlogging_analytics |
[analytics] |
16:45 |
<joal> |
Kill-rerun suspended oozie jobs (virtual-pagview and predictions-actor |
[analytics] |
16:34 |
<joal> |
rerun failed webrequest oozie jobs |
[analytics] |
16:30 |
<btullis> |
restarting hive-server2 and hive-metastore on an-coord1001 (currently standby) |
[analytics] |
16:29 |
<btullis> |
restarting oozie on an-coord1001 |
[analytics] |
16:10 |
<joal> |
Rerun failed oozie webrequest jobs |
[analytics] |
15:57 |
<btullis> |
rolling out updated hadoop packages to an-airflow1003 |
[analytics] |
15:55 |
<btullis> |
rolling out upgraded hadoop client packages to stat servers. |
[analytics] |
15:51 |
<btullis> |
restarting eventlogging_to_druid_network_flows_internal_hourly.service eventlogging_to_druid_prefupdate_hourly.service refine_event_sanitized_analytics_immediate.service refine_event_sanitized_main_immediate.service |
[analytics] |
15:49 |
<btullis> |
restarting eventlogging_to_druid_navigationtiming_hourly.service on an-launcher1002 |
[analytics] |
15:46 |
<btullis> |
restarting eventlogging_to_druid_editattemptstep_hourly.service on an-launcher1002 |
[analytics] |
15:44 |
<btullis> |
cancel that last message. Upgrading hadoop packages on an-launcher instead. They were inadvertently omitted last time. |
[analytics] |
15:39 |
<btullis> |
Going to downgrade hadoop on ann hadoop-worker nodes to 2.10.1 |
[analytics] |
15:21 |
<btullis> |
failed over hive to an-coord1002 via DNS https://gerrit.wikimedia.org/r/c/operations/dns/+/831906 |
[analytics] |
15:20 |
<btullis> |
restarted yarn service on an-master1002 to make the active host an-master1001 again. |
[analytics] |
15:11 |
<btullis> |
restart hive-server2 and hive-metastore service on an-coord1002 to pick up new version of hadoop |
[analytics] |
14:55 |
<btullis> |
rolling out updated hadoop packages to analytics-airflow (cumin alias) hosts |
[analytics] |