1751-1800 of 4855 results (23ms)
2020-10-22 §
06:54 <elukey> restart httpd on matomo1002, errors while connecting [analytics]
06:31 <elukey> restart turnilo to apply new settings for wmf_netflow [analytics]
06:06 <elukey> execute "sudo -u hdfs kerberos-run-command hdfs hdfs dfs -chown -R analytics /wmf/data/archive/geoip" on an-launcher1002 - permission issues for 'analytics' and /wmf/data/archive/geoip [analytics]
02:37 <ottomata> re-run webrequest-load-wf-{text,upload}-2020-10-21-{19,20} oozie jobs after they timed out waiting for data due to camus misconfiguration (fixed in https://gerrit.wikimedia.org/r/c/operations/puppet/+/635678) [analytics]
2020-10-21 §
20:12 <razzi> stop nginx on analytics-tool1001.eqiad.wmnet to switch to envoy (hue-next) [analytics]
20:10 <razzi> stop nginx on analytics-tool1001.eqiad.wmnet to switch to envoy (hue) [analytics]
20:07 <razzi> stop nginx on analytics-tool1007.eqiad.wmnet to switch to envoy (turnilo) [analytics]
20:05 <razzi> stop nginx on analytics-tool1004.eqiad.wmnet to switch to envoy (superset) [analytics]
20:02 <razzi> stop nginx on matomo1002.eqiad.wmnet to switch to envoy [analytics]
10:41 <elukey> decommission analytics1052 from the hadoop cluster [analytics]
10:26 <elukey> move journalnode from analytics1052 (to be decommed) to an-worker1080 [analytics]
2020-10-20 §
20:59 <mforns> Deploying refinery with refinery-deploy-to-hdfs (for 0.0.137) [analytics]
20:24 <mforns> Deploying refinery with scap for v0.0.137 [analytics]
20:00 <mforns> Deployed refinery-source v0.0.137 [analytics]
15:00 <ottomata> disabling sending EventLogging events to eventlogging-valid-mixed topic - T265651 [analytics]
13:34 <elukey> upgrade superset's presto TLS config after the above changes [analytics]
13:33 <elukey> move presto to pupet host TLS certificates [analytics]
10:29 <klausman> rocm38 install on an-worker1101 successful, rebooting to make sure everything is in place [analytics]
06:41 <elukey> decom analytics1056 from the hadoop cluster [analytics]
2020-10-19 §
14:40 <ottomata> restarted eventlogging-processor with filter to skip events already migrated to event platform - T262304 [analytics]
10:09 <elukey> add pps/bps measures to wmf_netflow in turnilo [analytics]
07:27 <elukey> decom analytics1055 from the hadoop cluster [analytics]
06:47 <elukey> turnilo upgraded to 1.27.0 [analytics]
2020-10-18 §
07:01 <elukey> decom analytics1054 from hadoop [analytics]
2020-10-17 §
06:08 <elukey> decom analytics1053 from the hadoop cluster [analytics]
2020-10-15 §
17:57 <razzi> taking yarn.wikimedia.org offline momentarily to test new tls configuration: T240439 [analytics]
14:51 <elukey> roll restart druid-historical daemons on druid1004-1008 to pick up new conn pooling changes [analytics]
07:03 <elukey> restart oozie to pick up the analytics team's admin list [analytics]
06:09 <elukey> decommission analytics1050 from the hadoop cluster [analytics]
2020-10-14 §
17:39 <joal> Rerun refine for mediawiki_api_request failed hour [analytics]
15:59 <elukey> drain + reboot an-worker1100 to pick up GPU settings [analytics]
15:29 <elukey> drain + reboot an-worker110[1,2] to pick up GPU settings [analytics]
14:56 <elukey> drain + reboot an-worker109[8,9] to pick up GPU settings [analytics]
05:48 <elukey> decom analytics1049 from the Hadoop cluster [analytics]
2020-10-13 §
12:38 <elukey> drop /srv/backup/mysql from an-master1002 (not used anymore) [analytics]
08:59 <klausman> Regenned the jupyterhub venvs on stat1004 [analytics]
07:56 <klausman> re-imaging stat1004 to Buster [analytics]
06:20 <elukey> decom analytics1048 from the Hadoop cluster [analytics]
2020-10-12 §
11:36 <joal> Clean druid test-datasources [analytics]
11:32 <elukey> remove analytics-meta lvm backup settings from an-coord1001 [analytics]
11:23 <elukey> remove analytics-meta lvm backup settings from an-master1002 [analytics]
07:02 <elukey> reduce hdfs block replication factor on Hadoop test to 2 [analytics]
05:37 <elukey> decom analytics1047 from the Hadoop cluster [analytics]
2020-10-11 §
08:33 <elukey> drop some old namenode backups under /srv on an-master1002 to free some space [analytics]
08:24 <elukey> decommission analytics1046 from the hadoop cluster [analytics]
08:12 <elukey> clean up logs on an-launcher1002 (disk space full) [analytics]
2020-10-10 §
12:01 <elukey> decommission analytics1045 from the Hadoop cluster [analytics]
2020-10-09 §
13:17 <elukey> execute "cumin 'stat100[5,8]* or an-worker109[6-9]* or an-worker110[0,1]*' 'apt-get install -y linux-headers-amd64'" [analytics]
11:15 <elukey> bootstrap the Analytics Hadoop test cluster [analytics]
09:47 <elukey> roll restart of hadoop-yarn-nodemanager on all hadoop workers to pick up new settings [analytics]