production SAL

2651-2700 of 10000 results (80ms)

2019-10-30 §
14:32	<effie>	disable puppet on all mw* hosts	[production]
14:20	<gehel@cumin1001>	END (FAIL) - Cookbook sre.elasticsearch.rolling-upgrade (exit_code=99)	[production]
14:19	<ema@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0)	[production]
14:15	<ema@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
14:04	<gehel@cumin1001>	END (PASS) - Cookbook sre.elasticsearch.force-shard-allocation (exit_code=0)	[production]
14:04	<gehel@cumin1001>	START - Cookbook sre.elasticsearch.force-shard-allocation	[production]
14:04	<gehel@cumin1001>	START - Cookbook sre.elasticsearch.rolling-upgrade	[production]
13:39	<andrew@deploy1001>	Finished deploy [horizon/deploy@53028ab]: Rolling out improvments to the puppet git archiver (duration: 03m 38s)	[production]
13:36	<andrew@deploy1001>	Started deploy [horizon/deploy@53028ab]: Rolling out improvments to the puppet git archiver	[production]
12:59	<cdanis@cumin1001>	conftool action : set/pooled=inactive; selector: name=cp5008.eqsin.wmnet	[production]
12:58	<moritzm>	rolling restart of slapd to pick up LDAP schema change	[production]
12:57	<cdanis@cumin1001>	conftool action : set/pooled=no; selector: name=cp5008.eqsin.wmnet	[production]
12:50	<arturo>	updating package versions in install1002 for thirdparty/kubeadm-k8s stretch-wikimedia (T236824)	[production]
12:23	<ema@cumin1001>	END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99)	[production]
12:22	<ema@cumin1001>	START - Cookbook sre.hosts.downtime	[production]
11:49	<moritzm>	temporarily disabling puppet on LDAP servers for a schema change	[production]
11:42	<ema>	depool cp5008 and reimage as text_ats T227432	[production]
11:37	<gehel@cumin2001>	END (PASS) - Cookbook sre.elasticsearch.rolling-upgrade (exit_code=0)	[production]
11:31	<mlitn@deploy1001>	Synchronized wmf-config/InitialiseSettings.php: Increase rate limits for newbie non-ip users on Commons (duration: 01m 01s)	[production]
11:13	<Urbanecm>	EU SWAT done	[production]
11:12	<Urbanecm>	Synchronized wmf-config/InitialiseSettings.php: SWAT: 61cb77c: Re-apply: MCR: Set testwiki to use the new MCR-only schema (T198558) (duration: 00m 59s)	[production]
10:07	<jynus>	restarting bacula-dir, bacula-sd on backup1001 T236406	[production]
09:46	<vgutierrez>	Switch from nginx to ats-tls on cp4029 - T231627	[production]
09:34	<vgutierrez>	Switch from nginx to ats-tls on cp4028 - T231627	[production]
09:25	<gehel@cumin1001>	END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99)	[production]
08:51	<gehel@cumin1001>	START - Cookbook sre.wdqs.data-reload	[production]
08:45	<gehel@cumin2001>	START - Cookbook sre.elasticsearch.rolling-upgrade	[production]
08:25	<moritzm>	installing php7.0 security updates	[production]
07:58	<oblivian@deploy1001>	helmfile [CODFW] Ran 'apply' command on namespace 'blubberoid' for release 'production' .	[production]
07:57	<oblivian@deploy1001>	helmfile [EQIAD] Ran 'apply' command on namespace 'blubberoid' for release 'production' .	[production]
05:58	<vgutierrez>	Rolling restart of ats-tls to get rid of leaked sockets and benefit from the lower inactivity timeout - T236458	[production]
04:24	<vgutierrez>	restarting ats-tls on cp4027 with half open disabled - T236458	[production]
03:09	<vgutierrez>	Rolling restart of prometheus-exporter-trafficserver-tls - T236458	[production]
02:40	<vgutierrez>	restarting ats-tls on cp3050 with half open disabled - T236458	[production]
00:54	<dzahn@cumin1001>	conftool action : set/pooled=yes; selector: name=wtp1025.eqiad.wmnet,service=parsoid-php	[production]
2019-10-29 §
23:42	<dzahn@cumin1001>	conftool action : set/pooled=no; selector: name=wtp1025.eqiad.wmnet,service=parsoid-php	[production]
23:09	<mutante>	ganeti1003 - gnt-instance remove ununpentium.wikimedia.org (T236748)	[production]
23:05	<Urbanecm>	Evening SWAT done	[production]
23:05	<Urbanecm>	Purge https://en.wikipedia.org/static/images/project-logos/atjwiki* (T236777)	[production]
23:04	<urbanecm@deploy1001>	Synchronized static/images/project-logos/: SWAT: f7b9972: Revert "Milestone lobo for atjwiki" (T236777) (duration: 01m 01s)	[production]
22:26	<dzahn@cumin1001>	END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1)	[production]
22:24	<dzahn@cumin1001>	START - Cookbook sre.hosts.decommission	[production]
22:17	<mutante>	ununpentium - shutdown Ganeti VM - running decom script, schedule icinga downtime (T236748)	[production]
22:14	<mutante>	rsynced data dump and config from ununpentium to moscovium in /srv/ before shutting down the old server (T180641)	[production]
20:43	<papaul>	rebooting cp3056 for HW check	[production]
20:19	<Trey314159>	reindexing Slovak wikis on elastic@eqiad and elastic@codfw complete (T235654)	[production]
19:42	<andrew@deploy1001>	Finished deploy [horizon/deploy@dbe892e]: (no justification provided) (duration: 03m 59s)	[production]
19:38	<andrew@deploy1001>	Started deploy [horizon/deploy@dbe892e]: (no justification provided)	[production]
19:32	<jynus>	restarting bacula-fd on install1002 T236406	[production]
19:31	<andrew@deploy1001>	Finished deploy [horizon/deploy@bab5d37]: (no justification provided) (duration: 01m 35s)	[production]