production SAL

7201-7250 of 10000 results (47ms)

2015-07-13 §
08:50	<godog>	upgrade graphite to 0.9.13 on graphite1001 and bounce one instance of carbon/cache	[production]
07:29	<ori>	Synchronized php-1.26wmf13/includes/cache/LCStoreStaticArray.php: I3f63594a4: Fix variable name (follows Ib2c5856d) (duration: 00m 11s)	[production]
06:25	<LocalisationUpdate>	failed: git pull of core failed	[production]
06:25	<ori>	Experimenting with altering the localisation cache implementation for testwiki, operations/mediawiki-config on tin will have a local hack for a little bit	[production]
05:07	<LocalisationUpdate>	ResourceLoader cache refresh completed at Mon Jul 13 05:07:32 UTC 2015 (duration 7m 31s)	[production]
02:26	<LocalisationUpdate>	ResourceLoader cache refresh completed at Mon Jul 13 02:25:58 UTC 2015 (duration 25m 57s)	[production]
02:23	<LocalisationUpdate>	completed (1.26wmf13) at 2015-07-13 02:23:43+00:00	[production]
02:20	<l10nupdate>	Synchronized php-1.26wmf13/cache/l10n: (no message) (duration: 06m 16s)	[production]
02:10	<LocalisationUpdate>	completed (1.26wmf13) at 2015-07-13 02:10:25+00:00	[production]
02:10	<l10nupdate>	Synchronized php-1.26wmf13/cache/l10n: (no message) (duration: 00m 34s)	[production]
01:47	<springle>	restarted labsdb1002 mysqld while troubleshooting replication	[production]
2015-07-12 §
14:59	<bblack>	upgraded most packages on sodium	[production]
14:48	<bblack>	upgraded apache2 to 2.2.22-1ubuntu1.9 on: antimony argon caesium fluorine helium iodine logstash1001 logstash1003 magnesium neon netmon1001 rhodium stat1001 ytterbium	[production]
04:49	<LocalisationUpdate>	ResourceLoader cache refresh completed at Sun Jul 12 04:49:08 UTC 2015 (duration 49m 7s)	[production]
02:26	<LocalisationUpdate>	completed (1.26wmf13) at 2015-07-12 02:26:52+00:00	[production]
02:25	<LocalisationUpdate>	ResourceLoader cache refresh completed at Sun Jul 12 02:25:33 UTC 2015 (duration 25m 32s)	[production]
02:23	<l10nupdate>	Synchronized php-1.26wmf13/cache/l10n: (no message) (duration: 06m 12s)	[production]
02:10	<LocalisationUpdate>	completed (1.26wmf13) at 2015-07-12 02:10:00+00:00	[production]
02:09	<l10nupdate>	Synchronized php-1.26wmf13/cache/l10n: (no message) (duration: 00m 34s)	[production]
2015-07-11 §
19:48	<jynus>	stopping labsdb1002 after table corruption has been detected	[production]
19:37	<urandom>	from restbase1002, starting revision culling process (node thin_out_key_rev_value_data.js `hostname -i` local_group_wikimedia_T_parsoid_html 2>&1 \| tee >(gzip -c > local_group_wikimedia_T_parsoid_html.log.`date +%s`.gz))	[production]
19:33	<urandom>	restbase: setting gc_grace_seconds to 604800 (1 week) on local_group_wikipedia_T_parsoid_html.data	[production]
04:56	<LocalisationUpdate>	ResourceLoader cache refresh completed at Sat Jul 11 04:55:56 UTC 2015 (duration 55m 55s)	[production]
04:21	<bd808>	Logstash cluster upgrade complete! Kibana working again	[production]
04:21	<bd808>	Upgraded Elasticsearch to 1.6.0 on logstash1006	[production]
04:12	<bd808>	rebooting logstash1006	[production]
04:07	<bd808>	logstash1005 fully recovered all shards	[production]
03:21	<mattflaschen>	Synchronized php-1.26wmf13/extensions/Flow/includes/Parsoid/Utils.php: Bump Flow to encode page name when sending to Parsoid (duration: 00m 13s)	[production]
02:28	<LocalisationUpdate>	completed (1.26wmf13) at 2015-07-11 02:28:18+00:00	[production]
02:25	<l10nupdate>	Synchronized php-1.26wmf13/cache/l10n: (no message) (duration: 06m 07s)	[production]
02:25	<LocalisationUpdate>	ResourceLoader cache refresh completed at Sat Jul 11 02:25:19 UTC 2015 (duration 25m 18s)	[production]
02:09	<LocalisationUpdate>	completed (1.26wmf13) at 2015-07-11 02:09:45+00:00	[production]
02:09	<l10nupdate>	Synchronized php-1.26wmf13/cache/l10n: (no message) (duration: 00m 35s)	[production]
00:46	<bd808>	Upgraded Elasticsearch to 1.6.0 on logstash1005; replicas recovering now	[production]
00:35	<bd808>	rebooting logstash1005	[production]
00:30	<bd808>	logstash1004 fully recovered all shards	[production]
2015-07-10 §
22:51	<mutante>	tendril: very short maintenance downtime	[production]
20:10	<bd808>	`service elasticsearch start` not starting on logstash1004; investigating	[production]
20:07	<bd808>	ran apt-get upgrade on logstash1004	[production]
19:52	<mutante>	adminbot - built and imported 1.7.10 into APT repo	[production]
19:43	<bd808>	rebooting logstash1004	[production]
19:40	<bd808>	Kibana seems to be broken by mixed 1.6.0/1.3.9 cluster	[production]
19:32	<bd808>	kibana not seeing indices after upgrading elasticsearch to 1.6.0; investigating	[production]
19:26	<bd808>	Upgraded logstash1003 to elasticsearch 1.6.0	[production]
19:22	<bd808>	Upgraded logstash1002 to elasticsearch 1.6.0	[production]
19:19	<bd808>	Upgraded logstash1001 to elasticsearch 1.6.0	[production]
19:10	<krenair>	Synchronized php-1.26wmf13/extensions/VisualEditor/lib/ve/src/ce/nodes/ve.ce.TableNode.js: https://gerrit.wikimedia.org/r/#/c/224122/ (duration: 00m 12s)	[production]
18:11	<gwicke>	ansible -i production restbase -a 'nodetool setcompactionthroughput 120'	[production]
18:00	<gwicke>	ansible -i production restbase -a 'nodetool setcompactionthroughput 90'	[production]
17:49	<gwicke>	rolling restart of the cassandra cluster to apply https://gerrit.wikimedia.org/r/#/c/224114/	[production]