1-50 of 5148 results (29ms)
2023-05-04 §
20:12 <btullis> executed `sudo apt clean` on stat1005 to free up some space. [analytics]
20:09 <btullis> restarting hive-server2 and hive-metastore on an-coord1002 [analytics]
14:07 <btullis> failing back hive service to an-coord1001 [analytics]
2023-05-03 §
21:43 <milimetric> deployed refinery-source and refinery to prepare for launching new airflow druid jobs [analytics]
2023-05-02 §
18:20 <milimetric> deployed refinery as part of weekly train [analytics]
16:24 <btullis> roll-restarting AQS [analytics]
13:49 <btullis> deploying updated mediawiki history snapshot to aqs [analytics]
09:33 <btullis> depooled schema2003 for T334049 [analytics]
2023-04-26 §
11:12 <btullis> restart refine_netflow service on an-launcher1002. [analytics]
10:55 <btullis> deploying refinery to hdfs [analytics]
09:12 <btullis> deploying refinery [analytics]
2023-04-25 §
13:47 <btullis> rebooting an-test-worker1002 T335358 [analytics]
13:07 <btullis> restarted the gobblin-eventlogging_legacy_test on an-test-coord1001 [analytics]
13:06 <btullis> killed the gobblin-eventlogging_legacy_test on an-test-coord1001 [analytics]
2023-04-24 §
09:40 <btullis> upgrading RAID controller firmware an an-worker1110 T334832 [analytics]
2023-04-20 §
16:25 <SandraEbele> Deployed refinery using scap, then deployed onto hdfs as part of weekly deployment train. [analytics]
15:44 <SandraEbele> deploying weekly deployment train for analytics refinery. [analytics]
2023-04-18 §
15:49 <btullis> restarting refinery-drop-raw-netflow-event.service refinery-drop-webrequest-raw-partitions.service refinery-drop-webrequest-refined-partitions.service on an-launchger1002 [analytics]
15:48 <btullis> restart refinery-drop-raw-event.service on an-launcher1002 [analytics]
15:45 <btullis> restart refinery-drop-pageview-actor-hourly-partitions.service on an-launcher1002 [analytics]
15:44 <btullis> restart refinery-drop-eventlogging-legacy-raw-partitions.service on an-launcher1002 [analytics]
15:42 <btullis> restart drop-webrequest-actor-metrics-rollup-hourly.service on an-launcher1002 [analytics]
15:40 <btullis> restart drop-webrequest-actor-metrics-hourly.service on an-launcher1002 [analytics]
14:51 <btullis> restart drop-webrequest-actor-label-hourly.service on an-launcher1002 [analytics]
13:56 <btullis> re-enabling gobblin timers [analytics]
13:52 <btullis> pooled schema1004 [analytics]
13:51 <btullis> pooled aqs10[14,15,19] [analytics]
13:49 <btullis> re-enabling YARN queues [analytics]
13:43 <btullis> leaving HDFS safe mode on an-master1001 [analytics]
11:55 <btullis> entering safe mode for prod hadoop HDFS [analytics]
11:48 <btullis> depooled aqs10[14,15,19] [analytics]
11:45 <btullis> depooled schema1004 T333377 [analytics]
11:41 <btullis> refreshed yarn queues with `sudo cumin '(A:hadoop-master or A:hadoop-standby)' 'kerberos-run-command yarn /usr/bin/yarn rmadmin -refreshQueues'` [analytics]
11:36 <btullis> stopping YARN queues T333377 [analytics]
11:34 <btullis> disable gobblin timers T333377 [analytics]
08:39 <btullis> rebooting an-worker1110 to attempt upgrading RAID controller firmware [analytics]
2023-04-17 §
20:48 <joal> Restart AQS to pick up druid new datasource using scap [analytics]
18:34 <xcollazo> Removed old Airflow cached artifacts. Details at T334886. [analytics]
17:26 <SandraEbele> restarted turnilo with ‘sudo systemctl restart turnilo’ [analytics]
17:13 <SandraEbele> restarted Oozie page view-druid-daily job 0174450-220913162928808-oozie-oozi-C [analytics]
17:00 <xcollazo> scap deploy 'analytics: deploy Airflow ArchiveOperator should have a number of retries of 0. T332216' [analytics]
16:56 <SandraEbele> restarted oozie page view-druid-hourly job 0174449-220913162928808-oozie-oozi-C [analytics]
11:12 <btullis> running sre.hadoop.init-hadoop-workers an-worker1132.eqiad.wmnet [analytics]
10:32 <btullis> reimaging an-worker1132 [analytics]
2023-04-13 §
21:37 <SandraEbele> Successfully Deployed analytics refinery using scap, then deployed onto hdfs. [analytics]
15:42 <SandraEbele> paused Oozie pageview-druid-hourly job. [analytics]
15:27 <SandraEbele> deploying analytics refinery-update pageview druid table [analytics]
08:19 <steve_munene> Decommission an-worker1132 from the Hadoop cluster for T333091 reimage [analytics]
2023-04-12 §
15:16 <mforns> cleared airflow task aggregate_projectview_geographically from dag projectview_geo for 2023-04-12T08->09 [analytics]
14:50 <mforns> cleared airflow task aggregrate_pageview_to_projectview from projectview_hourly dag for 2023-04-12Y08->09 [analytics]