1-50 of 4815 results (31ms)
2022-11-25 §
15:29 <btullis> reset the bmc on an-coord1002 [analytics]
11:24 <elukey> restart turnilo on an-tool1007 to pick up new settings for webrequest_sampled_live [analytics]
10:07 <elukey> refresh the webrequest-sampled-live druid supervisor after https://gerrit.wikimedia.org/r/c/analytics/refinery/+/859463 [analytics]
2022-11-24 §
16:21 <SandraEbele> restarted webrequest-druid-daily-coord as part of weekly deployment train. [analytics]
16:15 <SandraEbele> killed webrequest-druid-daily-coord for restart as part of weekly deployment train. [analytics]
16:13 <SandraEbele> successfully restarted webrequest-druid-hourly-coord for restart as part of weekly deployment train. [analytics]
16:11 <SandraEbele> killed webrequest-druid-hourly-coord for restart as part of weekly deployment train. [analytics]
15:30 <SandraEbele> Started deployment of refinery as part of weekly deployment train [analytics]
2022-11-23 §
15:38 <btullis> roll-restarting kafka-jumbo brokers to pick up new certificates. T323697 [analytics]
2022-11-18 §
18:56 <mforns> re-ran refine_event_sanitized_analytics_immediate from 2022-11-17T13 to 2022-11-18T18 to fix the issues caused by a bug (allow-list typo) deployed yesterday. [analytics]
2022-11-17 §
17:14 <mforns> restarted mediawiki-denormalize-coord as part of weekly deployment train [analytics]
16:07 <mforns> finished refinery deployment [analytics]
15:53 <mforns> started refinery deployment for weekly train (accompanying refinery-source 0.2.9) [analytics]
14:52 <btullis> deploying updated hadoop packages to druid-public [analytics]
14:51 <btullis> deploying updated hadoop packages to druid-analytics [analytics]
14:37 <btullis> deploying updated hadoop packages to hue and yarn webservers [analytics]
14:34 <btullis> deploying updated hadoop packages to analytics-presto hosts [analytics]
2022-11-16 §
21:40 <mforns> deployed airflow up to e08e32e83b519dee214b7177bbe0fd3ac5a0be3c [analytics]
20:37 <mforns> deployed refinery-source 0.2.9 as part of weekly deployment train [analytics]
09:11 <elukey> update the webrequest sampled live supervisor on Druid Analytics after https://gerrit.wikimedia.org/r/857408 [analytics]
2022-11-15 §
14:24 <elukey> started webrequest_sampled supervisor on Druid Analytics - T314981 [analytics]
11:50 <elukey> `elukey@kafka-jumbo1001:~$ kafka topics --create --topic webrequest_sampled --partitions 3 --replication-factor 3` - T314981 [analytics]
2022-11-07 §
06:24 <aqu> sudo systemctl reset-failed monitor_refine_eventlogging_legacy.service [analytics]
06:00 <aqu> Rerunning on an-launcher1002 sudo -u analytics kerberos-run-command analytics refine_eventlogging_legacy --ignore_failure_flag=true --table_include_regex='homepagemodule' --since='2022-11-04T15:00:00.000Z' --until='2022-11-05T16:00:00.000Z' [analytics]
2022-11-04 §
10:14 <btullis> btullis@clouddumps1002:/srv/dumps/xmldatadumps/public/other/pageview_complete/2022/2022-11$ sudo systemctl restart analytics-dumps-fetch-pageview_complete_dumps.service [analytics]
10:14 <btullis> btullis@clouddumps1002:/srv/dumps/xmldatadumps/public/other/pageview_complete/2022/2022-11$ sudo chown dumpsgen:dumpsgen pageviews-20221102-automated.bz2 [analytics]
2022-11-03 §
08:55 <joal> Add _SUCCESS file to manually computed pageview-actor data for 2022-11-02T11:00 [analytics]
2022-10-27 §
17:24 <mforns> re-running webrequest-load-wf-text-2022-10-27-10 with lower thresholds [analytics]
2022-10-25 §
17:28 <mforns> deployed refinery to the test cluster [analytics]
2022-10-24 §
16:19 <btullis> `chown analytics-deploy /srv/deployment/analytics` on clouddumps100[1-2] [analytics]
15:30 <mforns> finished deploying refinery as part of the weekly train [analytics]
15:30 <mforns> deployed airflow-dags as part of weekly train [analytics]
15:12 <mforns> starting refinery regular weekly deploy [analytics]
07:32 <elukey> `elukey@stat1005:~$ sudo systemctl reset-failed session-c4122.scope session-c4123.scope session-c4124.scope session-c4447.scope session-c4450.scope session-c4449.scope session-c4638.scope jupyter-echetty-singleuser.service` [analytics]
07:30 <elukey> `elukey@stat1004:~$ sudo systemctl reset-failed jupyter-ntsako-singleuser.service` [analytics]
2022-10-23 §
13:31 <elukey> clean logs with 10d+ on an-airflow1001 to free some space [analytics]
13:26 <elukey> clean logs with 15d+ on an-airflow1001 to free some space [analytics]
2022-10-22 §
08:17 <joal> rerun webrequest-load-wf-text-2022-10-22-3 oozie job with higher error threshold [analytics]
2022-10-21 §
16:55 <btullis> restarting hive-server2 service on an-coord1001 [analytics]
16:49 <btullis> restarting hue on an-tool1009 [analytics]
15:18 <joal> restart hive-server2 service [analytics]
07:32 <joal> restart failed oozie jobs [analytics]
07:28 <joal> Restart HiveServer2 on an-coord1001 (I didn't even know I could do this) [analytics]
06:53 <joal> killing old mjolnit jobs [analytics]
06:50 <joal> Kill rerun stuck oozie job [analytics]
06:37 <joal> Kill skein test jobs in arn [analytics]
2022-10-19 §
17:14 <btullis> reset the BMC on analytics1075 [analytics]
2022-10-17 §
18:17 <mforns> deleted Airflow DAGs for backfilling of Cassandra loading of unique devices [analytics]
2022-10-15 §
09:24 <joal> Rerun failed refine_eventlogging_analytics job [analytics]
09:00 <joal> Rerun pageview-hourly-wf-2022-10-14-23 [analytics]