1051-1100 of 4750 results (22ms)
2021-04-08 §
07:44 <elukey> restart hadoop hdfs masters on an-master100[1,2] to apply the new log4j settings fro the audit log [analytics]
06:44 <elukey> re-deployed refinery to hadoop-test after fixing permissions on an-test-coord1001 [analytics]
2021-04-07 §
23:03 <ottomata> installing anaconda-wmf-2020.02~wmf5 on remaining nodes - T279480 [analytics]
22:51 <ottomata> installing anaconda-wmf-2020.02~wmf5 on stat boxes - T279480 [analytics]
22:47 <mforns> finished refinery deployment up to 1dbbd3dfa996d2e970eb1cbc0a63d53040d4e3a3 [analytics]
22:39 <mforns> deployment of refinery via scap to hadoop-test failed with Permission denied: '/srv/deployment/analytics/refinery-cache/.config' (deployemt to production went fine) [analytics]
21:44 <mforns> starting refinery deploy up to 1dbbd3dfa996d2e970eb1cbc0a63d53040d4e3a3 [analytics]
21:26 <mforns> deployed refinery-source v0.1.4 [analytics]
21:25 <razzi> sudo apt-get install --reinstall sudo apt-get install --reinstall anaconda-wmf on stat1008 [analytics]
20:15 <razzi> rebalance kafka partitions for webrequest_text partitions 15, 16 [analytics]
19:53 <ottomata> upgrade anaconda-wmf everywhere to 2020.02~wmf4 with fixes for T279480 [analytics]
14:03 <hnowlan> setting profile::aqs::git_deploy: true in aqs-test1001 hiera config [analytics]
2021-04-06 §
22:34 <razzi> rebalance kafka partitions for webrequest_text_13,14 [analytics]
09:37 <elukey> reimage an-coord1002 to Debian Buster [analytics]
2021-04-05 §
16:07 <razzi> remove old hive logs on an-coord1001: sudo rm /var/log/hive/hive-*.log.2021-02-* [analytics]
14:54 <razzi> remove empty /var/log/sqoop on an-launcher1002 (logs go in /var/log/refinery); sudo rmdir /var/log/sqoop [analytics]
14:51 <razzi> rebalance kafka partitions for webrequest_text partitions 11, 12 [analytics]
2021-04-02 §
16:28 <razzi> rebalance kafka partitions for webrequest_text partitions 9,10 [analytics]
16:19 <elukey> all the Hadoop test cluster on Debian Buster [analytics]
07:28 <elukey> manual fix for an-worker1080's interface in netbox (xe-4/0/11), moved by mistake to public-1b [analytics]
2021-04-01 §
20:27 <razzi> restore superset_production from backup superset_production_1617306805.sql [analytics]
20:14 <razzi> manually run bash /srv/deployment/analytics/superset/deploy/create_virtualenv.sh as analytics_deploy on an-tool1010, since somehow it didn't run with scap [analytics]
20:01 <razzi> sudo chown -R analytics_deploy:analytics_deploy /srv/deployment/analytics/superset/venv since it's owned by root and needs to be removed upon deployment [analytics]
19:54 <razzi> dump superset production to an-coord1001.eqiad.wmnet:/home/razzi/superset_production_1617306805.sql just in case [analytics]
16:50 <razzi> rebalance kafka partitions for webrequest_text partitions 7 and 8 [analytics]
2021-03-31 §
14:18 <hnowlan> starting copy of large tables from aqs1007 to aqs1011 [analytics]
2021-03-30 §
20:25 <joal> Kill-Restart data_quality_stats-hourly-bundle after deploy [analytics]
20:19 <joal> Deploying refinery onto HDFS [analytics]
19:57 <joal> Deploying refinery using scap [analytics]
19:57 <joal> Refinery-source released to archiva and new jars commited to refinery (v0.1.3) [analytics]
17:07 <razzi> rebalance kafka partitions for webrequest_text partitions 5 and 6 [analytics]
12:35 <hnowlan> Depooling aqs1004 for another transfer of local_group_default_T_pageviews_per_article_flat [analytics]
12:30 <elukey> restart reportupdater-codemirror on an-launcher1002 fro T275757 [analytics]
11:30 <elukey> ERRATA: upgrade to 2.3.6-2 [analytics]
11:29 <elukey> upgrade hive client packages to 2.3.6-1 on an-launcher1002 (already applied to all stat100x) [analytics]
2021-03-25 §
15:58 <elukey> disable vmemory checks in Yarn nodemanagers on Hadoop [analytics]
13:53 <elukey> systemctl restart performance-asotranking on stat1007 for T276121 [analytics]
08:14 <elukey> upgrade hive packages on stat100x to 2.6.3-2 - T276121 [analytics]
08:12 <elukey> upgrade hive packages in thirdparty/bigtop15 to 2.3.6-2 for buster-wikimedia [analytics]
2021-03-24 §
18:49 <elukey> systemctl restart refinery-import-* failed jobs (/mnt/hdfs errors due to me umounting the mountpoint) [analytics]
18:43 <elukey> kill fuse hdfs mount process on an-launcher1002, re-mounted /mnt/hdfs, too many processes in D state [analytics]
15:46 <razzi> rebalance kafka partitions for webrequest_text partitions 3 and 4 [analytics]
05:40 <razzi> sudo chown analytics /var/log/refinery/sqoop-mediawiki.log.1 on an-launcher1002 and restart logrotate [analytics]
2021-03-22 §
18:12 <elukey> drop /srv/.hardsync* to clean up hardlinks not needed [analytics]
18:07 <elukey> run rm -rfv .hardsync.*/archive/public-datasets/* on thorium:/srv to clean up files to drop (didn't work) [analytics]
18:01 <elukey> drop /srv/.hardsync*trash* on thorium - old hardlinks that should have been trashed [analytics]
15:52 <razzi> rebalance kafka partitions for webrequest_text partition 2 [analytics]
09:28 <elukey> move the yarn scheduler in hadoop test to capacity [analytics]
2021-03-19 §
15:44 <razzi> rebalance kafka partitions for webrequest_text partition 1 [analytics]
2021-03-18 §
19:30 <razzi> rename /usr/lib/python2.7/dist-packages/cqlshlib/copyutil.so back [analytics]