1-50 of 3661 results (9ms)
2021-03-24 §
18:49 <elukey> systemctl restart refinery-import-* failed jobs (/mnt/hdfs errors due to me umounting the mountpoint) [analytics]
18:43 <elukey> kill fuse hdfs mount process on an-launcher1002, re-mounted /mnt/hdfs, too many processes in D state [analytics]
15:46 <razzi> rebalance kafka partitions for webrequest_text partitions 3 and 4 [analytics]
05:40 <razzi> sudo chown analytics /var/log/refinery/sqoop-mediawiki.log.1 on an-launcher1002 and restart logrotate [analytics]
2021-03-22 §
18:12 <elukey> drop /srv/.hardsync* to clean up hardlinks not needed [analytics]
18:07 <elukey> run rm -rfv .hardsync.*/archive/public-datasets/* on thorium:/srv to clean up files to drop (didn't work) [analytics]
18:01 <elukey> drop /srv/.hardsync*trash* on thorium - old hardlinks that should have been trashed [analytics]
15:52 <razzi> rebalance kafka partitions for webrequest_text partition 2 [analytics]
09:28 <elukey> move the yarn scheduler in hadoop test to capacity [analytics]
2021-03-19 §
15:44 <razzi> rebalance kafka partitions for webrequest_text partition 1 [analytics]
2021-03-18 §
19:30 <razzi> rename /usr/lib/python2.7/dist-packages/cqlshlib/copyutil.so back [analytics]
19:29 <razzi> temporarily rename /usr/lib/python2.7/dist-packages/cqlshlib/copyutil.so on aqs1004 to fix https://issues.apache.org/jira/browse/CASSANDRA-11574 [analytics]
19:02 <ottomata> hdfs dfs -chgrp -R analytics-privatedata-users /wmf/camus - T275396 [analytics]
16:47 <razzi> rebalance kafka partitions for webrequest_text partition 0 [analytics]
06:32 <elukey> force a manual run of create_virtualenv.sh on an-tool1010 - superset down [analytics]
2021-03-17 §
20:45 <razzi> release wikistats 2.9.0 [analytics]
20:15 <ottomata> install anaconda-wmf 2020.02~wmf3 on analytics cluster clients and workers - T262847 [analytics]
18:10 <ottomata> started oozie/cassandra/coord_pageview_top_percountry_daily [analytics]
15:21 <razzi> rebalance kafka partitions for webrequest_upload partitions 22 and 23 [analytics]
13:54 <razzi> sudo cookbook sre.hosts.reboot-single an-conf1001.eqiad.wmnet [analytics]
13:47 <razzi> sudo cookbook sre.hosts.reboot-single an-conf1003.eqiad.wmnet [analytics]
13:41 <razzi> sudo cookbook sre.hosts.reboot-single an-conf1002.eqiad.wmnet [analytics]
13:39 <ottomata> deploying refinery for weekly train [analytics]
13:28 <ottomata> deploy aqs as part of train - T207171, T263697 [analytics]
01:28 <razzi> rebalance kafka partitions for webrequest_upload partition 21 [analytics]
2021-03-16 §
14:43 <razzi> rebalance kafka partitions for webrequest_upload partition 20 [analytics]
03:17 <razzi> rebalance kafka partitions for webrequest_upload partition 19 [analytics]
2021-03-15 §
16:53 <razzi> rebalance kafka partitions for webrequest_upload partition 18 [analytics]
08:25 <elukey> stop/start hdfs-balancer on an-launcher1002 with bw 200MB [analytics]
07:48 <joal> Manually start mediawiki-history-drop-snapshot.service to check the run succeeds [analytics]
07:47 <joal> Drop hive wmf.mediawiki_wikitext_history snapshot partitions (2020-08, 2020-09, 2020-10, 2020-11) [analytics]
2021-03-14 §
20:49 <joal> Manually clean some data ( mediawiki-history-drop-snapshot.service seems not working) [analytics]
20:46 <joal> Force a run of mediawiki-history-drop-snapshot.service to clean up some data [analytics]
2021-03-12 §
17:20 <elukey> kill duplicate mediawiki-wikitext-history coordinator failing and sending emails to alerts@ [analytics]
07:21 <elukey> re-run monitor_refine_event_failure_flags [analytics]
2021-03-11 §
22:31 <razzi> rebalance kafka partitions for webrequest_upload partition 17 [analytics]
20:20 <razzi> disable maintenance mode for matomo1002 [analytics]
20:08 <razzi> starting reboot of matomo1002 for kernel upgrade [analytics]
18:52 <razzi> systemctl restart hadoop-hdfs-datanode on analytics1059 [analytics]
18:50 <razzi> systemctl restart hadoop-yarn-nodemanager on analytics1059 [analytics]
18:35 <razzi> apt-get install parted on analytics1059 [analytics]
15:34 <razzi> rebalance kafka partitions for webrequest_upload partition 17 [analytics]
10:52 <elukey> drop /home/bsitzmann on all stat100x hosts - T273712 [analytics]
08:25 <elukey> drop database dedcode cascade in hive - T276748 [analytics]
08:15 <elukey> hdfs dfs -rmr /user/dedcode on an-launcher1002 (data in trash for a month) - T276748 [analytics]
2021-03-10 §
23:15 <razzi> rebalance kafka partitions for webrequest_upload partition 16 [analytics]
18:44 <mforns> finished deployment of refinery (session length oozie job) [analytics]
18:16 <mforns> starting deployment of refinery (session length oozie job) [analytics]
16:54 <razzi> rebalance kafka partitions for webrequest_upload partition 15 [analytics]
07:05 <elukey> all hadoop worker nodes on Buster [analytics]