production SAL

2401-2450 of 10000 results (58ms)

2018-05-29 §
14:13	<gehel>	comleted rolling restart of relforge for plugin upgrade - T193734	[production]
14:10	<addshore@tin>	Synchronized php-1.32.0-wmf.5/extensions/Wikibase: [[gerrit:436000\|track all wb_terms table access via statsd]] (duration: 02m 21s)	[production]
14:07	<addshore@tin>	Synchronized php-1.32.0-wmf.4/extensions/CentralNotice: [[gerrit:435817\|Convert numerical URL parameters to numbers]] for AndyRussG (was left on tin) (duration: 01m 25s)	[production]
14:03	<elukey>	swap zookeeper from conf1003 to conf1006	[production]
13:56	<XioNoX>	rolling back ns0 and ping1001 redirects - T187962	[production]
13:47	<zfilipin@tin>	Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:435693\|Create 2 extra namespaces for bdwikimedia (T195700)]] (duration: 01m 39s)	[production]
13:42	<moritzm>	upgrading remaining job runners in eqiad to hhvm-wikidiff 1.7.0	[production]
13:39	<zfilipin@tin>	Synchronized wmf-config/InitialiseSettings.php: SWAT: [[gerrit:435629\|Revert "Revert "Revert "Temp rate limit for arwiki due to mass vandalism""" (T192668)]] (duration: 01m 51s)	[production]
13:32	<volans>	restarted ircecho	[production]
13:28	<volans>	puppet run on failed hosts completed	[production]
13:25	<moritzm>	powered down mw2182 for hardware diagnosis	[production]
13:19	<gehel>	rolling restart of relforge for plugin upgrade - T193734	[production]
13:16	<volans>	running puppet on failed only hosts	[production]
13:12	<volans>	stopped ircecho temporarily	[production]
12:54	<moritzm>	installing xdg-utils security updates	[production]
11:21	<marostegui>	Restar db1125 mysql - T195595	[production]
11:14	<moritzm>	upgrading snapshot hosts to hhvm-wikidiff 1.7.0 (HHVM is unused, just for completeness)	[production]
11:08	<marostegui@tin>	Synchronized wmf-config/db-eqiad.php: Disable read only on s6 T194939 T187962 (duration: 01m 37s)	[production]
10:59	<marostegui@tin>	Synchronized wmf-config/db-eqiad.php: Enable read only on s6 T194939 T187962 (duration: 01m 35s)	[production]
10:55	<XioNoX>	Eqiad row C server move starting - T187962	[production]
10:53	<XioNoX>	Eqiad row C server move starting	[production]
10:35	<moritzm>	upgrading mw1308-mw1311 to hhvm-wikidiff 1.7.0 (HHVM bytecode cache needs to be pruned during rollout)	[production]
10:09	<mobrovac@tin>	Synchronized wmf-config/InitialiseSettings.php: Switch all jobs to EventBus file 2/2 - T190327 T195500 (duration: 01m 47s)	[production]
10:06	<mobrovac@tin>	Synchronized wmf-config/jobqueue.php: Switch all jobs to EventBus file 1/2 - T190327 T195500 (duration: 01m 39s)	[production]
10:05	<ppchelko@tin>	Finished deploy [cpjobqueue/deploy@c6dc83d]: Enable all jobs apart from exceptions for everything. T190327 (duration: 00m 58s)	[production]
10:04	<ppchelko@tin>	Started deploy [cpjobqueue/deploy@c6dc83d]: Enable all jobs apart from exceptions for everything. T190327	[production]
09:20	<XioNoX>	redirect ns0 to baham - T187962	[production]
09:16	<XioNoX>	disable ping1001 redirect - T187962	[production]
09:13	<marostegui>	Downtime s6 replicas for 4 hours - T195595	[production]
09:07	<marostegui@tin>	Synchronized wmf-config/db-eqiad.php: Depool all databases in row C - T187962 (duration: 01m 35s)	[production]
09:05	<moritzm>	upgrading labweb servers to hhvm-wikidiff 1.7.0 (HHVM bytecode cache needs to be pruned during rollout)	[production]
08:40	<jynus>	performing topology changes on s6 ahead of a possible failover	[production]
08:24	<moritzm>	upgrading remaining API servers in eqiad to hhvm-wikidiff 1.7.0 (HHVM bytecode cache needs to be pruned during rollout)	[production]
07:56	<moritzm>	upgrading mw1276-mw1290 to hhvm-wikidiff 1.7.0 (HHVM bytecode cache needs to be pruned during rollout)	[production]
07:49	<elukey>	reimage druid1002 to debian stretch	[production]
07:47	<gilles@tin>	Synchronized wmf-config/InitialiseSettings.php: T187299 Launch performance survey on ruwiki (duration: 01m 50s)	[production]
07:26	<moritzm>	upgrading remaining app servers in eqiad to hhvm-wikidiff 1.7.0 (HHVM bytecode cache needs to be pruned during rollout)	[production]
06:52	<elukey>	roll restart hadoop master daemons to pick up the new zookeeper settings	[production]
05:20	<marostegui>	Restart MySQL on db2045 (s8 codfw master) - T195598	[production]
05:13	<marostegui>	Stop MySQL on db2094 and db2095 for testing - T190704	[production]
04:12	<l10nupdate@tin>	ResourceLoader cache refresh completed at Tue May 29 04:12:10 UTC 2018 (duration 14m 32s)	[production]
03:57	<l10nupdate@tin>	scap sync-l10n completed (1.32.0-wmf.5) (duration: 14m 29s)	[production]
02:59	<l10nupdate@tin>	scap sync-l10n completed (1.32.0-wmf.4) (duration: 13m 18s)	[production]
2018-05-28 §
20:14	<twentyafterfour>	Test failures on https://gerrit.wikimedia.org/r/#/c/435825/ are preventing deployment of the fix for a critical deployment blocker (see T195514) 1.32.0-wmf.5 still blocked refs T191051	[production]
20:10	<twentyafterfour>	train still held up by test failures: https://gerrit.wikimedia.org/r/#/c/435825/	[production]
20:02	<elukey>	restart kafka on kafka1003 as attempt to solve the under-replicated partitions warning	[production]
19:22	<twentyafterfour@tin>	Synchronized php-1.32.0-wmf.5/extensions/CentralNotice/: sync wmf.5 CentralNotice for AndyRussG (duration: 01m 25s)	[production]
19:12	<elukey>	roll restart of kafka-mirror maker (main eqiad -> jumbo) on kafka-jumbo* for zookeeper conf updates	[production]
19:07	<twentyafterfour>	attempting to get the wmf.5 train back on track. Deploying a fix for T195514 (https://gerrit.wikimedia.org/r/c/435292/) to unblock T191051	[production]
18:16	<elukey>	restart kafka mirror maker on kafka1012->14 - failed after the last round of kafka restarts	[production]