production SAL

251-300 of 10000 results (25ms)

2016-02-06 §
05:43	<bblack>	rebooted cp2006 via racadm after crash - no crash data in logs...	[production]
2016-02-05 §
23:54	<chasemp>	nfs shaping is really writes :)	[production]
23:54	<chasemp>	tc to shape some nfs read traffic in tools for labs (also logged there) can be cancelled with: /sbin/tc qdisc del dev eth0 root	[production]
23:51	<YuviPanda>	dropped old nfs snapshots from labstore1001	[production]
23:30	<maxsem@mira>	Synchronized portals: (no message) (duration: 01m 18s)	[production]
23:29	<maxsem@mira>	Synchronized portals/prod/wikipedia.org/assets: (no message) (duration: 01m 19s)	[production]
22:56	<jynus>	reimaging db1018	[production]
22:48	<jynus>	restarting slave on m2/codfw (db2011)	[production]
22:41	<krenair@mira>	Synchronized wmf-config/InitialiseSettings.php: https://gerrit.wikimedia.org/r/#/c/268818/ (duration: 01m 22s)	[production]
22:10	<bblack>	cache rolling reboots stopped for the weekend, can pick up the other half monday	[production]
20:36	<bblack>	resuming rolling cache reboots	[production]
20:07	<mutante>	cygnus - reboot VM	[production]
19:28	<bblack>	halted rolling cache reboots, we seem to be having problems with a batch of them coming back...	[production]
18:23	<demon@mira>	Synchronized wmf-config/InitialiseSettings.php: comment stuff, gerrit 267994 (duration: 01m 19s)	[production]
18:15	<jynus>	stopping mysql@db1018 and starting to clone it for reimaging	[production]
18:10	<jynus@mira>	Synchronized wmf-config/db-eqiad.php: Depool db1018 for maintenance (duration: 02m 12s)	[production]
17:31	<cmjohnson1>	trouble shooting elastic1021	[production]
17:07	<bblack>	rolling cpNNNN reboots are 27% complete, only two hosts so far failed to reboot on their own (but came up fine after manual racadm powercycle)	[production]
16:20	<ottomata>	reenabling kafka1012 in analytics-eqiad kafka cluster	[production]
16:03	<jynus>	reimaging db2030 to test jessie installer	[production]
15:53	<oblivian@tin>	sync-l10n completed (1.27.0-wmf.12) (duration: 00m 08s)	[production]
15:47	<urandom>	performing rolling restbase restart in staging env	[production]
15:38	<_joe_>	launched l10update cronjob manually, was not running since tin's reimaging	[production]
15:35	<andrewbogott>	rebooting silver for kernel update - wikitech outage will ensue	[production]
15:33	<urandom>	re-restarting restbase on restbase1002.eqiad.wmnet,restbase1005.eqiad.wmnet,restbase1006.eqiad.wmnet,restbase1009.eqiad.wmnet (prior restarts may have happened before puppet run)	[production]
15:29	<andrewbogott>	rebooting holmium for kernel update	[production]
15:27	<andrewbogott>	rebooting labcontrol1002 for kernel update	[production]
15:24	<bblack>	cp3005 didn't come back online during rolling reboot, investigating (remains depooled)	[production]
15:22	<_joe_>	initializing mediawiki repos on tin	[production]
15:22	<andrewbogott>	rebooting labnet1001 for kernel update	[production]
15:15	<urandom>	restbase rolling restart complete	[production]
15:08	<urandom>	performing rolling restbase restart to apply config change (https://gerrit.wikimedia.org/r/#/c/268611/)	[production]
14:56	<urandom>	forcing puppet run and bouncing restbase on restbase1001.eqiad.wmnet (https://gerrit.wikimedia.org/r/#/c/268611/)	[production]
14:41	<elukey>	confctl mw1228.eqiad.wmnet: weight changed 10 => 20	[production]
14:24	<moritzm>	rebooting db2065 to db2070 for kernel update	[production]
14:20	<jynus>	reimporting nlwiktionary revision into labs (expect some temporary lag on labs-s3)	[production]
14:06	<moritzm>	rebooting db2060 to db2064 for kernel update	[production]
13:34	<bblack>	starting rolling reboots of cp* (traffic cache hosts) for kernel updates	[production]
12:50	<moritzm>	rebooting db2055 to db2059 for kernel update	[production]
12:38	<elukey>	repooled mw1228.eqiad.wmnet	[production]
12:34	<moritzm>	rebooting db2050 to db2054 for kernel update	[production]
12:15	<moritzm>	rebooting db2045 to db2049 for kernel update	[production]
12:07	<jynus>	reimporting nlwiktionary pages into labs	[production]
12:05	<l10nupdate@tin>	LocalisationUpdate failed: git pull of core failed	[production]
12:05	<l10nupdate@tin>	LocalisationUpdate failed: git clone of core failed	[production]
11:54	<moritzm>	rebooting db2041 to db2044 for kernel update	[production]
11:37	<moritzm>	rebooting db2038 to db2040 for kernel update	[production]
11:36	<godog>	start swiftrepl replication pass of common thumbs eqiad -> codfw	[production]
10:15	<moritzm>	rolling reboot of ocg* cluster	[production]
02:27	<mobrovac>	restbase deploy end of caae1f7	[production]