production SAL

201-250 of 10000 results (55ms)

2018-07-24 §
21:10	<XioNoX>	starting to see recoveries from cr1-eqsin upgrade	[production]
21:06	<XioNoX>	Install done, cr1-eqsin re-rebooting	[production]
21:00	<XioNoX>	restarting cr1-eqsin for software upgrade	[production]
20:32	<XioNoX>	depooling eqsin for cr1-eqsin software upgrade	[production]
19:09	<gehel>	resetting postgres data on maps1003 after failing replication - T200228	[production]
18:34	<mobrovac@deploy1001>	Finished deploy [eventstreams/deploy@690fdad]: Wait for the client to consume the meesage being sent before consuming the next one - T199813 (duration: 02m 18s)	[production]
18:32	<mobrovac@deploy1001>	Started deploy [eventstreams/deploy@690fdad]: Wait for the client to consume the meesage being sent before consuming the next one - T199813	[production]
17:40	<ema>	re-enable puppet on all cache nodes with alternate domains disabled T164609	[production]
17:33	<zfilipin@deploy1001>	rebuilt and synchronized wikiversions files: all wikis to 1.32.0-wmf.13	[production]
17:18	<thcipriani>	train window running long, services deploy delayed	[production]
17:18	<ema>	restart varnish-fe on cp1068 to clear "child restarted" alert T164609	[production]
17:17	<elukey>	restart eventstreams on scb2* nodes (hopefully last time before deploying the fix) to avoid mem leaks issues during the EU night	[production]
17:06	<ladsgroup@deploy1001>	Synchronized php-1.32.0-wmf.14/includes/page/PageArchive.php: [[gerrit:447636\|PageArchive: Pass correct overrides to newRevisionFromArchiveRow() (T200072)]] (duration: 01m 01s)	[production]
16:58	<jynus>	finishing test on es3 hosts T199224	[production]
16:42	<ladsgroup@deploy1001>	Synchronized php-1.32.0-wmf.13/includes/page/PageArchive.php: [[gerrit:447636\|PageArchive: Pass correct overrides to newRevisionFromArchiveRow() (T200072)]] (duration: 01m 03s)	[production]
16:07	<jynus>	test switchover from es2018 to es2017	[production]
16:02	<dcausse>	T156137: unbanning elastic1031	[production]
15:59	<dcausse>	T156137: restarting elasticsearch on elastic1031 to disable G1GC	[production]
15:55	<jynus>	test switchover from es2017 to es2018	[production]
15:46	<marostegui@deploy1001>	Synchronized wmf-config/db-eqiad.php: Repool db1103:3314 (duration: 01m 02s)	[production]
15:38	<jynus>	stopping puppet on es2017, es2018; changing mysql configuration for production testing	[production]
15:29	<gehel>	restart postgres on maps1001 - T200228	[production]
14:53	<marostegui@deploy1001>	Synchronized wmf-config/db-eqiad.php: Repool db1084 (duration: 01m 02s)	[production]
14:47	<dcausse>	T156137: banning elastic1031 due to high load (same "getEntryAfterMiss" symptoms)	[production]
14:09	<marostegui>	Deploy schema change on db1103:3314 T144010 T51190 T199368	[production]
13:45	<ema>	apply alternate domains patch to text-eqiad T164609	[production]
13:43	<marostegui>	Deploy schema change on db1084 T144010 T51190 T199368	[production]
13:42	<marostegui>	Stop replication in sync db1084 and db1103:3314	[production]
13:40	<marostegui>	Deploy schema change on db1081 T144010 T51190 T199368	[production]
13:38	<marostegui>	Stop replication in sync db1081 and db1103:3314	[production]
13:38	<marostegui@deploy1001>	Synchronized wmf-config/db-eqiad.php: Depool db1084 (duration: 01m 59s)	[production]
13:07	<ema>	repool cp1067 with alternate domains support T164609	[production]
12:59	<marostegui@deploy1001>	Synchronized wmf-config/db-eqiad.php: Depool db1103:3314 (duration: 00m 55s)	[production]
12:11	<gehel>	vacuum full of postgres on maps1001 to try to reclaim space - T200228	[production]
12:07	<ema>	depool cp1067 to test alternate domains patch T164609	[production]
11:58	<zfilipin@deploy1001>	scap failed: CalledProcessError Command '/usr/local/bin/mwscript rebuildLocalisationCache.php --wiki="testwiki" --outdir="/tmp/scap_l10n_4179557944" --threads=30 --lang en --quiet' returned non-zero exit status 1 (duration: 02m 50s)	[production]
11:55	<zfilipin@deploy1001>	Started scap: testwiki to php-1.32.0-wmf.14 and rebuild l10n cache	[production]
11:45	<zfilipin@deploy1001>	scap failed: CalledProcessError Command '/usr/local/bin/mwscript rebuildLocalisationCache.php --wiki="testwiki" --outdir="/tmp/scap_l10n_2212739269" --threads=30 --lang en --quiet' returned non-zero exit status 1 (duration: 03m 42s)	[production]
11:42	<zfilipin@deploy1001>	Started scap: testwiki to php-1.32.0-wmf.14 and rebuild l10n cache	[production]
11:35	<ema>	disable puppet on cp-text hosts to merge alternate domains patch T164609	[production]
11:21	<Amir1>	start of ladsgroup@mwmaint1001:~$ foreachwikiindblist s1 populateChangeTagDef.php (T193873)	[production]
11:19	<Amir1>	start of ladsgroup@mwmaint1001:~$ foreachwikiindblist s2 populateChangeTagDef.php --sleep 2 (T193873)	[production]
11:12	<marostegui@deploy1001>	Synchronized wmf-config/db-eqiad.php: Repool db1084 (duration: 00m 55s)	[production]
11:08	<marostegui@deploy1001>	Synchronized wmf-config/db-eqiad.php: Repool db1097:3314 (duration: 01m 06s)	[production]
10:44	<marostegui>	Stop replication in sync on db1084 and db1097:3314	[production]
10:44	<marostegui@deploy1001>	Synchronized wmf-config/db-eqiad.php: Depool db1084 (duration: 00m 55s)	[production]
09:17	<marostegui>	Deploy schema change on db1097:3314 T144010 T51190 T199368	[production]
09:17	<marostegui@deploy1001>	Synchronized wmf-config/db-eqiad.php: Depool db1097:3314 (duration: 00m 54s)	[production]
08:38	<ema>	restart varnish-fe on cache_text instances with cold, labeled VCL T200207	[production]
08:21	<elukey>	rolling restart of kafka jumbo/main-(eqiad\|codfw) clusters to pick up the new max open files limit (infinity -> 128k)	[production]