3951-4000 of 10000 results (44ms)
2016-03-19 §
22:28 <jynus> powercycling oxygen, looks kernel-dead [production]
22:16 <urandom> removing 22G of heap dumps from restbase2004.codfw.wmnet [production]
22:16 <urandom> removing 22G of heap dumps [production]
22:07 <urandom> clearing snapshots on restbase2004.codfw.wmnet [production]
15:43 <reedy@tin> Synchronized wmf-config/throttle.php: Throttle rules for event T130447 (duration: 00m 26s) [production]
11:38 <godog> restart slapd on seaborgium, oom-killed [production]
10:51 <hashar> Labs LDAP is probably down. T130446 Cant log to tools-login.wmflabs.org / Jenkins interface and Nodepool yields error 500 communicating with OpenStack API [production]
02:31 <l10nupdate@tin> ResourceLoader cache refresh completed at Sat Mar 19 02:31:46 UTC 2016 (duration 8m 31s) [production]
02:23 <mwdeploy@tin> sync-l10n completed (1.27.0-wmf.17) (duration: 10m 07s) [production]
01:54 <urandom> bootstrapping restbase1013-b.eqiad.wmnet : T125842 [production]
2016-03-18 §
23:35 <krinkle@tin> Synchronized php-1.27.0-wmf.17/extensions/WikimediaEvents/modules/ext.wikimediaEvents.deprecate.js: (no message) (duration: 00m 35s) [production]
21:11 <ostriches> cleaned up stale /srv/mediawiki/php-1.27.0-wmf.{10,11} from the apaches. [production]
21:09 <krinkle@tin> Synchronized wmf-config/missing.php: (no message) (duration: 00m 25s) [production]
20:53 <ottomata> reenabling puppet on krypton [production]
19:52 <ottomata> temporarily disabling puppet on krypton [production]
19:21 <ori> rebooting bohrium [production]
19:20 <ori> upgraded bohrium VM: vcpus 2 => 8, ram 4 => 8g [production]
19:06 <ori@tin> Synchronized wmf-config/logging.php: Iabca8858e: Allow finer-grained control over debug logging via XWD (duration: 00m 32s) [production]
18:56 <demon@tin> Synchronized .arclint: no op really, co master sync (duration: 00m 39s) [production]
18:08 <gehel> restarting elasticsearch server elastic1031.eqiad.wmnet [production]
17:59 <mutante> netmon1001: failed torrus service - recovery steps as outlined on wikitech [[Torrus]] [production]
17:55 <ori> on bohrium: /etc/apache2/sites-enabled/.links2 ; was causing puppet to refresh apache2 on each run [production]
17:30 <gehel> restarting elasticsearch server elastic1030.eqiad.wmnet [production]
17:05 <gehel> restarting elasticsearch server elastic1029.eqiad.wmnet [production]
16:53 <jynus> starting enwiki import to labs from dbstore1002 (expect lag and consistency problems during the hot import) [production]
16:37 <moritzm> restarted hhvm on mw1205 [production]
16:30 <moritzm> bumped connection tracking table size on mw1161-mw1169 to 524288 to cope with currently elevated connections on those (T130364) [production]
16:19 <godog> reboot ms-be2010 to pick up new disk ordering [production]
15:23 <elukey@tin> Synchronized wmf-config/jobqueue-eqiad.php: REVERT - Re-enabled persistence between Job Queues and Job Runners. (duration: 00m 19s) [production]
15:03 <elukey@tin> Synchronized wmf-config/jobqueue-eqiad.php: Re-enabled persistence between Job Queues and Job Runners. (duration: 00m 30s) [production]
15:02 <godog> bootstrap restbase1013-a [production]
14:36 <gehel> restarting elasticsearch server elastic1028.eqiad.wmnet [production]
14:02 <elukey> restarted eventlog1001.eqiad.wmnet and eventlog2001.codfw.wmnet for kernel upgrade [production]
13:43 <gehel> restarting elasticsearch server elastic1027.eqiad.wmnet [production]
13:24 <gehel> restarting pybal on lvs2003.codfw.wmnet [production]
13:22 <gehel> enabling all nodes for service search.svc.codfw.wmnet:9243 (elastic-https) on codfw [production]
13:22 <gehel> restarting pybal on lvs2006.codfw.wmnet [production]
13:06 <gehel> restarting elasticsearch server elastic1026.eqiad.wmnet [production]
12:43 <gehel> restarting elasticsearch server elastic1025.eqiad.wmnet [production]
12:35 <godog> finished ms-fe1* rolling reboot [production]
12:15 <godog> finished ms-be1* rolling reboot [production]
12:00 <elukey> Forcing puppet agent run on all the Jobrunners and videoscalers since rdb1005 is now back in service. Will also restart jobchron as well. [production]
11:58 <elukey> Added rdb1005 back to the jobrunners puppet config after maintenance. [production]
11:57 <gehel> restarting elasticsearch server elastic1024.eqiad.wmnet [production]
11:46 <gehel> restarting pybal on lvs1003 [production]
11:43 <elukey@tin> Synchronized wmf-config/jobqueue-eqiad.php: Add rdb1005 back to the Redis Job Queues after maintenance (duration: 01m 22s) [production]
11:23 <moritzm> powercycled mw1163, hung on reboot and serial console stuck [production]
11:05 <moritzm> rolling reboot of mw1161 to mw1169 for kernel upgrade [production]
11:04 <gehel> restarting pybal on lvs1012 [production]
11:04 <gehel> restarting pybal on lvs1009 [production]