251-300 of 10000 results (14ms)
2016-02-06 §
05:43 <bblack> rebooted cp2006 via racadm after crash - no crash data in logs... [production]
2016-02-05 §
23:54 <chasemp> nfs shaping is really writes :) [production]
23:54 <chasemp> tc to shape some nfs read traffic in tools for labs (also logged there) can be cancelled with: /sbin/tc qdisc del dev eth0 root [production]
23:51 <YuviPanda> dropped old nfs snapshots from labstore1001 [production]
23:30 <maxsem@mira> Synchronized portals: (no message) (duration: 01m 18s) [production]
23:29 <maxsem@mira> Synchronized portals/prod/wikipedia.org/assets: (no message) (duration: 01m 19s) [production]
22:56 <jynus> reimaging db1018 [production]
22:48 <jynus> restarting slave on m2/codfw (db2011) [production]
22:41 <krenair@mira> Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/268818/ (duration: 01m 22s) [production]
22:10 <bblack> cache rolling reboots stopped for the weekend, can pick up the other half monday [production]
20:36 <bblack> resuming rolling cache reboots [production]
20:07 <mutante> cygnus - reboot VM [production]
19:28 <bblack> halted rolling cache reboots, we seem to be having problems with a batch of them coming back... [production]
18:23 <demon@mira> Synchronized wmf-config/InitialiseSettings.php: comment stuff, gerrit 267994 (duration: 01m 19s) [production]
18:15 <jynus> stopping mysql@db1018 and starting to clone it for reimaging [production]
18:10 <jynus@mira> Synchronized wmf-config/db-eqiad.php: Depool db1018 for maintenance (duration: 02m 12s) [production]
17:31 <cmjohnson1> trouble shooting elastic1021 [production]
17:07 <bblack> rolling cpNNNN reboots are 27% complete, only two hosts so far failed to reboot on their own (but came up fine after manual racadm powercycle) [production]
16:20 <ottomata> reenabling kafka1012 in analytics-eqiad kafka cluster [production]
16:03 <jynus> reimaging db2030 to test jessie installer [production]
15:53 <oblivian@tin> sync-l10n completed (1.27.0-wmf.12) (duration: 00m 08s) [production]
15:47 <urandom> performing rolling restbase restart in staging env [production]
15:38 <_joe_> launched l10update cronjob manually, was not running since tin's reimaging [production]
15:35 <andrewbogott> rebooting silver for kernel update - wikitech outage will ensue [production]
15:33 <urandom> re-restarting restbase on restbase1002.eqiad.wmnet,restbase1005.eqiad.wmnet,restbase1006.eqiad.wmnet,restbase1009.eqiad.wmnet (prior restarts may have happened before puppet run) [production]
15:29 <andrewbogott> rebooting holmium for kernel update [production]
15:27 <andrewbogott> rebooting labcontrol1002 for kernel update [production]
15:24 <bblack> cp3005 didn't come back online during rolling reboot, investigating (remains depooled) [production]
15:22 <_joe_> initializing mediawiki repos on tin [production]
15:22 <andrewbogott> rebooting labnet1001 for kernel update [production]
15:15 <urandom> restbase rolling restart complete [production]
15:08 <urandom> performing rolling restbase restart to apply config change (https://gerrit.wikimedia.org/r/#/c/268611/) [production]
14:56 <urandom> forcing puppet run and bouncing restbase on restbase1001.eqiad.wmnet (https://gerrit.wikimedia.org/r/#/c/268611/) [production]
14:41 <elukey> confctl mw1228.eqiad.wmnet: weight changed 10 => 20 [production]
14:24 <moritzm> rebooting db2065 to db2070 for kernel update [production]
14:20 <jynus> reimporting nlwiktionary revision into labs (expect some temporary lag on labs-s3) [production]
14:06 <moritzm> rebooting db2060 to db2064 for kernel update [production]
13:34 <bblack> starting rolling reboots of cp* (traffic cache hosts) for kernel updates [production]
12:50 <moritzm> rebooting db2055 to db2059 for kernel update [production]
12:38 <elukey> repooled mw1228.eqiad.wmnet [production]
12:34 <moritzm> rebooting db2050 to db2054 for kernel update [production]
12:15 <moritzm> rebooting db2045 to db2049 for kernel update [production]
12:07 <jynus> reimporting nlwiktionary pages into labs [production]
12:05 <l10nupdate@tin> LocalisationUpdate failed: git pull of core failed [production]
12:05 <l10nupdate@tin> LocalisationUpdate failed: git clone of core failed [production]
11:54 <moritzm> rebooting db2041 to db2044 for kernel update [production]
11:37 <moritzm> rebooting db2038 to db2040 for kernel update [production]
11:36 <godog> start swiftrepl replication pass of common thumbs eqiad -> codfw [production]
10:15 <moritzm> rolling reboot of ocg* cluster [production]
02:27 <mobrovac> restbase deploy end of caae1f7 [production]