201-250 of 10000 results (30ms)
2018-01-17 ยง
15:15 <andrewbogott> repooling exec-manage tools-exec-1430. [tools]
15:04 <andrewbogott> depooling exec-manage tools-exec-1430. Experimenting with purge-old-kernels [tools]
14:57 <moritzm> resetting RAC on labsdb1007 (serial console inaccessible) [production]
14:53 <moritzm> resetting RAC on labsdb1006 (serial console inaccessible) [production]
14:42 <elukey> restart druid middlemanager on druid1003 as attempt to unblock realtime streaming [analytics]
14:38 <chasemp> labstore1001:~# /sbin/reboot [production]
14:27 <zeljkof> EU SWAT finished [production]
14:23 <zfilipin@tin> Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:404327|Create "eliminator" user group on ur.wikipedia (T184607)]] (duration: 01m 12s) [production]
14:21 <elukey> forced kill of banner impression data streaming job to get it restarted [analytics]
14:14 <moritzm> repooling chromium [production]
14:14 <zfilipin@tin> Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:404624|Add Draft Namespace in enwikiversity (T184957)]] (duration: 01m 12s) [production]
14:09 <arturo> T181647 aborrero@tools-clushmaster-01:~$ clush -w @all 'sudo puppet agent --test' [tools]
14:07 <moritzm> rebooting chromium for kernel security update [production]
14:04 <gehel> restart of elasticsearch / cirrus eqiad completed (cluster still recovering) [production]
14:03 <moritzm> depooling chromium [production]
13:51 <chasemp> reboot labstore2003 [production]
13:46 <akosiaris> reboot sca2003 webperf2001 planet2001 poolcounter2002 mx2001 kubetcd200{1,2,3} install2002 dbmonitor2001 alsafi acrux hassaleh diadem nihal pybal-test200{1,2,3} releases2001 tureis for PCID, INVPCID [production]
13:45 <chasemp> labstore2002:~# sudo update-grub && /sbin/reboot [production]
13:40 <chasemp> labstore2001:~# /sbin/reboot [production]
13:39 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Slowly repool db1104 (duration: 01m 13s) [production]
13:31 <akosiaris> reboot acrab for PCID,INVPCID enabling [production]
13:22 <marostegui> Deploy schema change on db1099:3318 - https://phabricator.wikimedia.org/T174569 [production]
13:22 <marostegui@tin> Synchronized wmf-config/db-eqiad.php: Depool db1099:3318 - T174569 (duration: 01m 12s) [production]
13:17 <moritzm> upgrading app server canaries to 3.18.5+wmf4 [production]
13:12 <marostegui> Fixing drifts on db1065 - T162807 [production]
13:10 <hashar> nodepool: updating snapshot to get hhvm +wmf4 for T185024 : nodepool image-update wmflabs-eqiad snapshot-ci-jessie [releng]
12:28 <moritzm> uploading HHVM 3.18.5+wmf4 for jessie-wikimedia to apt.wikimedia.org (3.18.7 with the patch https://github.com/facebook/hhvm/commit/bd7b2bcfe70b053a3a001480653012f68599250f backed out) [production]
12:10 <moritzm> updating HHVM in deployment-prep to 3.18.5+wmf4 [production]
11:44 <elukey> re-run pageview-druid-hourly-wf-2018-1-17-9 and pageview-druid-hourly-wf-2018-1-17-8 (failed due to druid1002's middlemanager being in a weird state after reboot) [analytics]
11:44 <elukey> restart druid middlemanager on druid1002 [analytics]
11:40 <godog> bootstrap cassandra-b on restbase1016 [production]
11:28 <moritzm> rearmed keyholder on neodymium [production]
11:24 <moritzm> rebooting neodymium for kernel security update [production]
11:19 <_joe_> restarted nginx on mw1346, was in a bad state [production]
10:51 <moritzm> reset RAC on chromium, serial console is inaccessible [production]
10:42 <moritzm> repooling hydrogen [production]
10:39 <moritzm> rebooting hydrogen for kernel security update [production]
10:38 <elukey> stopped all crons on hadoop-coordinator-1 [analytics]
10:37 <elukey> re-run webrequest-druid-hourly-wf-2018-1-17-8 (failed due to druid1002's reboot) [analytics]
10:34 <moritzm> depooling hydrogen again [production]
10:22 <moritzm> repooling hydrogen (and pdns-recursor restarted), experiment concluded [production]
10:22 <elukey> reboot druid1002 for kernel upgrades [analytics]
10:14 <moritzm> depooling hydrogen (and keeping pdns-recursor stopped for a few minutes to check whether problems with load-balanced recdns traffic are still an issue) [production]
10:11 <moritzm> reset RAC on hydrogen, serial console was inaccessible [production]
10:01 <godog> start cassandra-a on restbase1016 [production]
09:53 <elukey> disable druid middlemanager on druid1002 as prep step for reboot [analytics]
09:52 <elukey> reboot druid1005 for kernel upgrades [production]
09:46 <elukey> rebooted analytics1003 [analytics]
09:46 <elukey> removed upstart config for brrd on eventlog1001 (failing and spamming syslog, old leftover?) [analytics]
09:46 <elukey> removed upstart config for brrd on eventlog1001 (failing and spamming syslog, old leftover?) [production]