201-250 of 5901 results (20ms)
2023-12-19 §
18:29 <mforns> starting refinery deploy (weekly train) [analytics]
11:10 <btullis> restarted the jupyterhub-conda service on stat servers. [analytics]
10:24 <btullis> deploying version 0.0.27 of conda-analytics [analytics]
2023-12-18 §
10:54 <btullis> deploy conda-analytics v 0.0.27 to the hadoop-test-analytics cluster for T345482 [analytics]
09:43 <btullis> cleared some space on an-test-worker1001 by deleting old refinery jars from /tmp `btullis@an-test-worker1001:/tmp$ sudo find . -type f -mtime +60 -name *.jar -delete` [analytics]
09:22 <btullis> deploying refinery version 0.02.27 to production refinery jobs with https://gerrit.wikimedia.org/r/c/operations/puppet/+/980923 for T349121 [analytics]
2023-12-15 §
13:53 <brouberol> deploying spark-history-analytics-hadoop.spark-history.dse-k8s-eqiad.wmnet - T351816 [analytics]
12:55 <brouberol> deploying spark-history-analytics-test-hadoop.spark-history-test.dse-k8s-eqiad.wmnet - T351816 [analytics]
2023-12-12 §
17:13 <btullis> executed `apt clean` on an-coord1001 to free up 7GB. [analytics]
2023-12-11 §
14:43 <btullis> roll-restarting the aqs (nodejs based) services with https://gerrit.wikimedia.org/r/c/operations/puppet/+/982097 [analytics]
2023-12-07 §
21:45 <xcollazo> Deployed latest changes to Airflow Analytics instance to pickup T352890 [analytics]
16:12 <milimetric> finished deploying and syncing refinery [analytics]
15:45 <milimetric> deploying refinery for the sqoop fix [analytics]
12:31 <btullis> deploying conda-analytics v0.0.26 to hadoop-test [analytics]
11:48 <btullis> deploying refinery to hadoop-test only [analytics]
2023-12-06 §
18:27 <btullis> restarted hadoop-yarn-nodemanager and hadoop-hdfs-datanode services on an-worker1086 for T352168 [analytics]
17:19 <btullis> deployed https://gerrit.wikimedia.org/r/c/operations/puppet/+/979118 for airflow metrice update to airflow_test instance for T349532 [analytics]
14:18 <btullis> killed a stalled sqoop process on an-launcher1002 [analytics]
14:16 <btullis> killed a stalled sqoop process on an-launcher1002 [analytics]
2023-12-05 §
15:21 <btullis> I have pushed out version 0.0.25 of conda-analytics to the test cluster. No user facing changes expected. [analytics]
08:29 <stevemunene> depool druid10[04-06] T336043 [analytics]
2023-12-04 §
14:47 <btullis> re-running refine_event for mediawiki_cirrussearch_request failure [analytics]
14:42 <btullis> restarted archiva service on archiva1002 [analytics]
14:38 <btullis> cleared some space on -atest-worker1002 by running: `sudo find /tmp -type f -mtime +30 -delete` [analytics]
13:53 <btullis> bringing an-coord1003 into service as an `analytics_cluster::coordinator` for T336045 [analytics]
13:41 <btullis> starting a rolling restart of the daemons on the analytics druid cluster, to make sure that they restart cleanly after the puppet 7 upgrade [analytics]
12:14 <stevemunene> pool druid1010 T336043 [analytics]
11:03 <btullis> re-ran refine_eventlogging_analytics for MobileWikiAppiOSSessions [analytics]
10:01 <btullis> Marked TaskInstance: projectview_geo.move_data_to_archive scheduled__2023-12-02T04:00:00 as succeeded in airflow analytics. [analytics]
2023-12-01 §
11:13 <stevemunene> pool druid1010 after reimage T336043 [analytics]
10:04 <btullis> marked TaskInstance: pageview_hourly.move_data_to_archive scheduled__2023-12-01T06:00:00+00:00 as succeeded in airflow analytics [analytics]
2023-11-30 §
17:41 <btullis> reran refine_event for mediawiki_cirrussearch_request [analytics]
08:28 <stevemunene> reimage druid1010 to pick up the right raid config and corresponding partman recipe T336043 [analytics]
2023-11-29 §
17:10 <btullis> depool schema2004 for reimage to bookworm for T349286 [analytics]
17:07 <btullis> pooled schema2003 after reimages a bookworm [analytics]
15:30 <btullis> depool schema2003 for upgrade to bookworm [analytics]
15:24 <btullis> pooled schema1004 after upgrade to bookworm for T349286 [analytics]
14:44 <btullis> reimaging schema1004 to bookworm for T349286 [analytics]
14:43 <btullis> depooling schema1004 for reimage T349286 [analytics]
14:41 <btullis> pooled schema1003 after upgrade to bookeworm [analytics]
14:10 <btullis> reimaging schema1003 to bookworm for T349286 [analytics]
14:04 <btullis> depooling schema1003 for reimage T349286 [analytics]
14:01 <btullis> increased the size of the vg0/srv logical volume on an-web1001 by 350 GB for T349889 [analytics]
2023-11-28 §
18:30 <milimetric> deployed refinery to hdfs [analytics]
2023-11-27 §
21:03 <btullis> deploying airflow-dags to analytics_test instance [analytics]
15:05 <stevemunene> pool druid1007 after bullseye reimage T332589 [analytics]
13:27 <stevemunene> reimage druid1007 to upgrade to bullseye T332589 [analytics]
2023-11-24 §
12:34 <joal> Rerun webrequest refine text for 2023-11-23T17 [analytics]
06:07 <stevemunene> pool druid1008 after reimage T332589 [analytics]
2023-11-23 §
14:58 <btullis> merging 974649: Remove all remaining references to oozie and clean up | https://gerrit.wikimedia.org/r/c/operations/puppet/+/974649 for T341893 [analytics]