1-50 of 5193 results (24ms)
2023-05-23 §
10:01 <stevemunene> reboot an-test-master1001.eqiad.wmnet December 2022 Buster reboots T325132 [analytics]
09:33 <stevemunene> reboot an-test-coord1001.eqiad.wmnetDecember 2022 Buster reboots T325132 [analytics]
08:22 <btullis> installing conda-analytics-0.0.17.dev_amd64.deb to an-test-worker1001 for T332765 [analytics]
2023-05-22 §
22:12 <btullis> installing conda-analytics-0.0.17.dev_amd64.deb to an-test-client1001 for T332765 [analytics]
2023-05-19 §
13:23 <btullis> restart monitor_refine_eventlogging_analytics.service on an-launcher1002 [analytics]
2023-05-18 §
16:54 <btullis> systemctl reset-failed services on stat1008 [analytics]
16:53 <btullis> installing conda-analytics 0.0.15 to an-test-worker1001 for T332765 [analytics]
15:49 <mforns> deployed airflow analytics_test [analytics]
14:22 <btullis> systemctl reset-failed user manager services on stat1004 [analytics]
12:46 <elukey> clean up old jupyterhub.service references (crash looping) on stat* nodes that had it [analytics]
10:31 <btullis> cold booting an-worker1110 to troubleshoot drive failure T336929 [analytics]
2023-05-17 §
17:58 <ottomata> Deployed refinery-source using jenkins [analytics]
13:22 <btullis> roll-rebooting dse-k8s-workers via cookbook [analytics]
13:16 <btullis> roll-rebooting an-worker1[096-101] for T335835 [analytics]
2023-05-16 §
17:59 <joal> rerun druid_load_pageviews_daily_aggregated_monthly [analytics]
17:34 <joal> Stop, delete then restart airflow druid_load_banner_activity jobs [analytics]
17:34 <joal> deploy fix for airflow druid_load_banner_activity jobs [analytics]
15:58 <joal> Kill oozie banner_activity-druid-monthly-coord job [analytics]
15:57 <joal> Start airflow druid_load_banner_activity_minutely_aggregated_monthly [analytics]
15:55 <joal> Kill oozie banner_activity_daily job [analytics]
15:55 <joal> Start airflow duid_load_banner_activity_minutely [analytics]
15:51 <joal> Kill oozie mediawiki_history_reduced job [analytics]
15:50 <joal> Start airflow mediawiki_history_reduced job with start-date to 2023-05-01 [analytics]
15:45 <joal> Clear failed wikidata_item_page_link sensor task after deploy - due to datacenter switcover [analytics]
15:41 <joal> Deploying analytics airflow dags [analytics]
14:00 <joal> Deploy refinery onto HDFS [analytics]
13:40 <btullis> pooled schema2004 for T335042 [analytics]
11:45 <joal> Deploy refinery using scap [analytics]
11:04 <btullis> depooled schema2004 for T335042 [analytics]
2023-05-15 §
07:16 <joal> Rerun failed refine_eventlogging_legacy job for universallanguageselector [analytics]
07:02 <joal> Rerun failed refine_event job for content_translation_event [analytics]
2023-05-12 §
16:05 <mforns> dropped mobile_apps_* hive tables because of https://phabricator.wikimedia.org/T329310 [analytics]
2023-05-11 §
14:55 <xcollazo> replaced /user/spark/share/lib/spark-3.1.2-assembly.jar in HDFS with new version that includes Iceberg. [analytics]
2023-05-10 §
20:37 <milimetric> deployed refinery (except to an-airflow1001) [analytics]
16:50 <stevemunene> deploy conda-analytics v0.0.13 T335721 [analytics]
16:36 <btullis> installing airflow 2.6.0 on an-test-client1001 for T336286 [analytics]
15:51 <mforns> stopped Airflow DAG mobile_app_session_metrics_weekly because of https://phabricator.wikimedia.org/T329310 [analytics]
15:50 <mforns> killed oozie job mobile_apps-uniques-monthly-coord because of https://phabricator.wikimedia.org/T329310 [analytics]
2023-05-09 §
13:02 <btullis> rebooting an-worker1088 after firmware upgrade for T336077 [analytics]
12:59 <btullis> upgrading SAS RAID controller firmware on an-worker1088 for T336077 [analytics]
12:27 <btullis> rebooting eventlog1003 for T325132 [analytics]
2023-05-08 §
21:22 <mforns> deployed airflow analytics for a quick fix [analytics]
2023-05-05 §
15:44 <mforns> re-ran projectview_hourly DAG for 2023-05-05T13 [analytics]
15:06 <mforns> deployed airflow analytics [analytics]
14:26 <btullis> roll-rebooting presto workers for T335835 [analytics]
2023-05-04 §
20:12 <btullis> executed `sudo apt clean` on stat1005 to free up some space. [analytics]
20:09 <btullis> restarting hive-server2 and hive-metastore on an-coord1002 [analytics]
14:07 <btullis> failing back hive service to an-coord1001 [analytics]
2023-05-03 §
21:43 <milimetric> deployed refinery-source and refinery to prepare for launching new airflow druid jobs [analytics]
2023-05-02 §
18:20 <milimetric> deployed refinery as part of weekly train [analytics]