251-300 of 4940 results (22ms)
2022-09-14 §
11:34 <btullis> remounted all remaining /mnt/hdfs mount points, except stat1005 which is busy [analytics]
11:12 <btullis> remounted /mnt/hdfs on an-coord100[1-2] [analytics]
11:09 <btullis> remounted /mnt/hdfs on an-airflow1001 [analytics]
09:14 <joal> Restart oozie virtualpageview job [analytics]
09:10 <btullis> re-mounted /mnt/hdfs on an-launcher1002. [analytics]
07:11 <joal> restart webrequest oozie bundle [analytics]
2022-09-13 §
17:22 <joal> rerun refine_eventloggin_legacy [analytics]
17:14 <joal> rerun refine_event [analytics]
17:14 <joal> rerun refine_netflow [analytics]
16:53 <joal> Rerun refine_eventlogging_analytics [analytics]
16:45 <joal> Kill-rerun suspended oozie jobs (virtual-pagview and predictions-actor [analytics]
16:34 <joal> rerun failed webrequest oozie jobs [analytics]
16:30 <btullis> restarting hive-server2 and hive-metastore on an-coord1001 (currently standby) [analytics]
16:29 <btullis> restarting oozie on an-coord1001 [analytics]
16:10 <joal> Rerun failed oozie webrequest jobs [analytics]
15:57 <btullis> rolling out updated hadoop packages to an-airflow1003 [analytics]
15:55 <btullis> rolling out upgraded hadoop client packages to stat servers. [analytics]
15:51 <btullis> restarting eventlogging_to_druid_network_flows_internal_hourly.service eventlogging_to_druid_prefupdate_hourly.service refine_event_sanitized_analytics_immediate.service refine_event_sanitized_main_immediate.service [analytics]
15:49 <btullis> restarting eventlogging_to_druid_navigationtiming_hourly.service on an-launcher1002 [analytics]
15:46 <btullis> restarting eventlogging_to_druid_editattemptstep_hourly.service on an-launcher1002 [analytics]
15:44 <btullis> cancel that last message. Upgrading hadoop packages on an-launcher instead. They were inadvertently omitted last time. [analytics]
15:39 <btullis> Going to downgrade hadoop on ann hadoop-worker nodes to 2.10.1 [analytics]
15:21 <btullis> failed over hive to an-coord1002 via DNS https://gerrit.wikimedia.org/r/c/operations/dns/+/831906 [analytics]
15:20 <btullis> restarted yarn service on an-master1002 to make the active host an-master1001 again. [analytics]
15:11 <btullis> restart hive-server2 and hive-metastore service on an-coord1002 to pick up new version of hadoop [analytics]
14:55 <btullis> rolling out updated hadoop packages to analytics-airflow (cumin alias) hosts [analytics]
14:42 <btullis> sudo systemctl restart analytics-reportupdater-logs-rsync.service on an-launcher1002 [analytics]
13:21 <joal> Manual launch of refinery-drop-mediawiki-snapshots with new tables in patch https://gerrit.wikimedia.org/r/831866 [analytics]
10:51 <btullis> attempting failback operation on hadoop namenodes [analytics]
09:42 <btullis> roll-restarting the hadoop masters via the cookbook [analytics]
2022-09-12 §
08:37 <btullis> cold-reset BMC device on analytics1073 [analytics]
2022-09-08 §
17:32 <joal> make ops reboot stat1008 [analytics]
2022-09-07 §
13:36 <joal> rerun failed airflow tasks [analytics]
2022-09-06 §
22:18 <milimetric> restarted webrequest druid daily and hourly jobs [analytics]
22:18 <milimetric> restarted referrer daily coordinator [analytics]
22:18 <milimetric> restarted webrequest load bundle [analytics]
21:57 <milimetric> finished cleaning up bad state and re-deploying refinery [analytics]
21:45 <milimetric> cleared logs earlier than September 1st from an-launcher1002:/srv/airflow-analytics/logs/scheduler [analytics]
18:49 <milimetric> finished refinery-source 0.2.6 deploy, waiting 5 minutes and starting refinery deploy [analytics]
18:28 <milimetric> weekly deployment train starting [analytics]
09:55 <btullis> merged and deployed https://gerrit.wikimedia.org/r/c/operations/puppet/+/821695 [analytics]
2022-09-04 §
12:49 <elukey> pkill remaining processes of user effeietsanders on stat1008 to unblock puppet [analytics]
2022-09-02 §
08:25 <joal> Restart mediawiki_history_denormalize job manually [analytics]
2022-08-30 §
17:49 <joal> Deploying refinery onto HDFS [analytics]
17:11 <joal> deploy refinery using scap [analytics]
17:11 <joal> release refinery-source v0.2.5 to archiva [analytics]
2022-08-29 §
16:44 <mforns> killed mediawiki-history-dumps oozie after migration to airflow [analytics]
08:04 <joal> Rerun refine_eventlogging_legacy failed hours [analytics]
07:54 <joal> rerun pageview-hourly-wf-2022-8-28-15 oozie workflow [analytics]
2022-08-22 §
16:25 <btullis> btullis@an-airflow1004:~$ sudo systemctl reset-failed ifup@ens13.service [analytics]