1001-1050 of 4632 results (24ms)
2021-03-15 §
07:48 <joal> Manually start mediawiki-history-drop-snapshot.service to check the run succeeds [analytics]
07:47 <joal> Drop hive wmf.mediawiki_wikitext_history snapshot partitions (2020-08, 2020-09, 2020-10, 2020-11) [analytics]
2021-03-14 §
20:49 <joal> Manually clean some data ( mediawiki-history-drop-snapshot.service seems not working) [analytics]
20:46 <joal> Force a run of mediawiki-history-drop-snapshot.service to clean up some data [analytics]
2021-03-12 §
17:20 <elukey> kill duplicate mediawiki-wikitext-history coordinator failing and sending emails to alerts@ [analytics]
07:21 <elukey> re-run monitor_refine_event_failure_flags [analytics]
2021-03-11 §
22:31 <razzi> rebalance kafka partitions for webrequest_upload partition 17 [analytics]
20:20 <razzi> disable maintenance mode for matomo1002 [analytics]
20:08 <razzi> starting reboot of matomo1002 for kernel upgrade [analytics]
18:52 <razzi> systemctl restart hadoop-hdfs-datanode on analytics1059 [analytics]
18:50 <razzi> systemctl restart hadoop-yarn-nodemanager on analytics1059 [analytics]
18:35 <razzi> apt-get install parted on analytics1059 [analytics]
15:34 <razzi> rebalance kafka partitions for webrequest_upload partition 17 [analytics]
10:52 <elukey> drop /home/bsitzmann on all stat100x hosts - T273712 [analytics]
08:25 <elukey> drop database dedcode cascade in hive - T276748 [analytics]
08:15 <elukey> hdfs dfs -rmr /user/dedcode on an-launcher1002 (data in trash for a month) - T276748 [analytics]
2021-03-10 §
23:15 <razzi> rebalance kafka partitions for webrequest_upload partition 16 [analytics]
18:44 <mforns> finished deployment of refinery (session length oozie job) [analytics]
18:16 <mforns> starting deployment of refinery (session length oozie job) [analytics]
16:54 <razzi> rebalance kafka partitions for webrequest_upload partition 15 [analytics]
07:05 <elukey> all hadoop worker nodes on Buster [analytics]
06:28 <elukey> force the re-run of refine_eventlogging_legacy - failed due to worker reimage in progress [analytics]
06:17 <elukey> reimage an-worker1111 to buster [analytics]
2021-03-09 §
22:00 <razzi> rebalance kafka partitions for webrequest_upload partition 14 [analytics]
20:42 <elukey> reimaged an-worker1091 to buster [analytics]
18:26 <elukey> reimage an-worker1087 to buster [analytics]
16:40 <elukey> reimage analytics1077 to buster [analytics]
15:36 <razzi> rebalance kafka partitions for webrequest_upload partition 13 [analytics]
15:18 <elukey> reimage analytics1072 (hadoop hdfs journal node) to buster [analytics]
14:29 <elukey> drain + reimage an-worker1090/89 to Buster [analytics]
13:26 <elukey> reimage an-worker1102 and an-worker1080 (hdfs journal node) to Buster [analytics]
12:59 <elukey> drain + reimage an-worker1103 to Buster [analytics]
09:14 <elukey> drain + reimage analytics1076 and an-worker1112 to Buster [analytics]
07:01 <elukey> drain + reimage an-worker109[4,5] to Buster [analytics]
2021-03-08 §
23:22 <razzi> rebalance kafka partitions for webrequest_upload partition 12 [analytics]
18:49 <razzi> rebalance kafka partitions for webrequest_upload partition 11 [analytics]
18:11 <elukey> drain + reimage an-worker11[15,16] to Buster [analytics]
17:12 <elukey> drain + reimage an-worker11[13,14] to Buster [analytics]
16:17 <elukey> drain + reimage an-worker1109/1110 to Buster [analytics]
14:54 <elukey> drain + reimage an-worker110[7,8] to Buster [analytics]
14:52 <ottomata> altered topics (eqiad|codfw).mediawiki.client.session_tick to have 2 partitions - T276502 [analytics]
13:51 <elukey> drain + reimage an-worker110[4,5] to Buster [analytics]
10:41 <elukey> drain + reimage an-worker1104/1089 to Debian Buster [analytics]
09:19 <elukey> drain + reimage an-worker108[3,4] to Buster [analytics]
08:20 <elukey> drain + reimage an-worker108[1,2] to Buster [analytics]
07:23 <elukey> drain + reimage analytics107[4,5] to Buster [analytics]
2021-03-07 §
08:00 <elukey> "megacli -LDSetProp -ForcedWB -Immediate -Lall -aAll" on analytics1066 [analytics]
07:49 <elukey> umount /var/lib/hadoop/data/e on analytics1059 and restart hadoop daemons to exclude failed disk - T276696 [analytics]
2021-03-05 §
18:30 <razzi> run again sudo -i wmf-auto-reimage-host -p T269211 clouddb1021.eqiad.wmnet --new [analytics]
18:18 <razzi> sudo cookbook sre.dns.netbox -t T269211 "Move clouddb1021 to private vlan" [analytics]