151-200 of 1006 results (16ms)
2017-08-23 §
10:12 <joal> Restart oozie webrequest-load bundle after deploy and updates [analytics]
10:09 <joal> Alter webrequest table before restarting oozie load bundle [analytics]
10:06 <joal> Deploying refinery onto hdfs [analytics]
09:59 <joal> Deploying refinery [analytics]
09:59 <joal> Kill oozie webrequest-load bundle for restart after deploy [analytics]
08:25 <joal> Deploying refinery-source v0.0.50 using jenkins [analytics]
2017-08-22 §
19:52 <joal> Drop / recreate wmf.mediawiki_history table for naming correction [analytics]
13:57 <ottomata> sudo -u hdfs hdfs dfs -rm /tmp/druid-indexing/classpath/guava.jar (guava 11.0.2 is conflicting with guava 16.0.1. from druid-hdfs-storage-cdh extension). Not sure how guava 11.0.2 got there, but let's see if it doesn't come back [analytics]
08:27 <joal> Rerun druid loading jobs after night failures [analytics]
2017-08-21 §
13:46 <ottomata> adding index on (database, rev_timestamp) on mediawiki_page_create_2 table on db1047: T170990 [analytics]
13:26 <ottomata> adding index on (database, rev_timestamp) on mediawiki_page_create_2 table on dbstore1002: T170990 [analytics]
2017-08-14 §
16:40 <elukey> analytics1034 back in service after swapping the eth cable - T172633 [analytics]
2017-08-10 §
20:06 <milimetric> stopped Wikimetrics web and queue on wikimetrics-01.eqiad.wmflabs because the queue ran into errors connecting to the database (max 10 connections limit reached) [analytics]
08:59 <elukey> updated librdkafka1 to 0.9.4.1 on eventlog1001 [analytics]
2017-08-08 §
18:39 <elukey> restart projectview-hourly-wf-2017-8-8-14, pageview-druid-hourly-wf-2017-8-8-14, pageview-hourly-wf-2017-8-8-14 via Hue (analytics1055 disk failure) [analytics]
14:20 <elukey> restart varnishkafka statsv/eventlogging instances to pick up https://gerrit.wikimedia.org/r/#/c/370637/ (kafka protocol explicitly set to 0.9.0.1) [analytics]
2017-08-06 §
11:02 <elukey> stop yarn on analytics1034 to reload the tg3 driver - T172633 [analytics]
2017-08-03 §
16:15 <ottomata> druid cluster restarted with 0.9.2 mysql-metadata-storage extension, un-suspending oozie druid jobs [analytics]
14:11 <ottomata> pausing oozie druid jobs and doing a cluster upgrade/restart again to make sure updated version of mysql-metadata-storage jar is properly loaded [analytics]
09:56 <elukey> set piwik in maintenance mode to allow mysql updates [analytics]
08:08 <elukey> restarted Druid jobs failed over night (drud_loader.py error) and due to Hive metastore restart [analytics]
08:03 <elukey> restart hive-metastore to pick up new JVM Xms settings [analytics]
2017-08-02 §
14:34 <ottomata> beginning druid upgrade to 0.92 (take 2 :) ) [analytics]
14:23 <elukey> restart hive-server to pick up JVM Xms4g change [analytics]
14:22 <ottomata> suspending druid oozie jobs [analytics]
2017-08-01 §
18:57 <madhuvishy> Bumped instance quota to 24 instances (nova quota-update analytics --instances 24) [analytics]
17:24 <ottomata> beginning druid upgrade to 0.9.2 http://druid.io/docs/0.9.2/operations/rolling-updates.html [analytics]
17:10 <ottomata> pausing all druid oozie coordinators [analytics]
12:49 <elukey> restart hive daemons on analytics1003 to pick up new jvm settings (bigger Xmx, JMX ports) [analytics]
10:05 <elukey> suspended again webrequest-load-bundle as prep step to restart the hive daemons [analytics]
07:58 <elukey> suspended webrequest-load-bundle as prep step to restart the hive daemons [analytics]
07:03 <elukey> restarted mobile_apps-session_metrics-coord-global-30days failed job via Hue [analytics]
2017-07-31 §
13:45 <elukey> suspended webrequest-load-bundle as prep step to restart hive metastore/server [analytics]
10:34 <elukey> restart hive-server on an1003 - beeline not connecting, thrift errors [analytics]
2017-07-28 §
07:55 <elukey> update nodejs to 6.11 on aqs1004 (testing prod node after beta qa) [analytics]
07:54 <elukey> re-run webrequest-load-wf-upload-2017-7-28-6 from Hue (was playing with eth0 issues on an1034) [analytics]
02:08 <ottomata> stat1002: disabled puppet, umounted /tmp, /home and /a, poweroff [analytics]
2017-07-26 §
21:01 <mforns> Deployed refinery using scap, then deployed onto hdfs [analytics]
18:57 <mforns> Deployed refinery-source using jenkins [analytics]
2017-07-25 §
17:43 <bd808> Forced puppet run on zk1-1.analytics.eqiad.wmflabs after elukey fixed hiera settings [analytics]
17:34 <bd808> Puppet broken on zk1-1.analytics.eqiad.wmflabs with "$clusters[$cluster_name] is :undef, not a hash or array at /etc/puppet/modules/profile/manifests/zookeeper/server.pp:22" [analytics]
15:24 <elukey> restart cassandra loading after maintenance via hue [analytics]
13:06 <elukey> stop cassandra load bundle, restarting AQS for jvm updates [analytics]
12:13 <elukey> executed sudo apt-get remove openjdk-8-jre openjdk-8-jre-headless on druid nodes [analytics]
2017-07-24 §
14:24 <ottomata> restarted mysql-eventbus eventlogging consumer with new consumer group [analytics]
2017-07-20 §
20:31 <nuria_> restaring eventlogging on eventlog1001 [analytics]
20:30 <nuria_> deploying eventlogging c1c2c39411ccd002ff8cea197bc535155213f5fb and restarting [analytics]
18:18 <ottomata> deleted instance deployment-eventlogging03 in favor of new instance deployment-eventlog02 [analytics]
17:14 <ottomata> killed tranquility instances tranq-banners and tranq-netflow running on druid1003 in joal's screen sessions [analytics]
2017-07-18 §
13:04 <ottomata> adding unique index on meta_id and index on meta_dt to mediawiki_page_{create,delete,move,undelete}_1 on db1046 MySQL eventlogging master [analytics]