production SAL

5901-5950 of 10000 results (45ms)

2021-07-14 §
19:26	<andrew@deploy1002>	Started deploy [horizon/deploy@156a984]: fix trove-dashboard bug	[production]
19:17	<nskaggs@cumin1001>	END (PASS) - Cookbook wmcs.wikireplicas.add_wiki (exit_code=0)	[production]
19:17	<nskaggs@cumin1001>	Added views for new wiki: dagwiki T284456	[production]
18:55	<nskaggs@cumin1001>	START - Cookbook wmcs.wikireplicas.add_wiki	[production]
18:54	<nskaggs@cumin1001>	END (ERROR) - Cookbook wmcs.wikireplicas.add_wiki (exit_code=97)	[production]
18:54	<nskaggs@cumin1001>	START - Cookbook wmcs.wikireplicas.add_wiki	[production]
18:36	<nskaggs@cumin1001>	END (PASS) - Cookbook wmcs.wikireplicas.add_wiki (exit_code=0)	[production]
18:36	<nskaggs@cumin1001>	Added views for new wiki: banwikisource T284390	[production]
18:30	<razzi@cumin1001>	END (PASS) - Cookbook sre.druid.roll-restart-workers (exit_code=0) for Druid public cluster: Roll restart of Druid's jvm daemons. - razzi@cumin1001	[production]
18:14	<nskaggs@cumin1001>	START - Cookbook wmcs.wikireplicas.add_wiki	[production]
17:52	<jmm@cumin2002>	END (FAIL) - Cookbook sre.idm.logout (exit_code=99) Logging Muehlenhoff out of all services on: 10 hosts	[production]
17:52	<jmm@cumin2002>	START - Cookbook sre.idm.logout Logging Muehlenhoff out of all services on: 10 hosts	[production]
17:49	<jmm@cumin2002>	END (FAIL) - Cookbook sre.idm.logout (exit_code=99) Logging Muehlenhoff out of all services on: 10 hosts	[production]
17:49	<jmm@cumin2002>	START - Cookbook sre.idm.logout Logging Muehlenhoff out of all services on: 10 hosts	[production]
17:39	<razzi@cumin1001>	START - Cookbook sre.druid.roll-restart-workers for Druid public cluster: Roll restart of Druid's jvm daemons. - razzi@cumin1001	[production]
17:35	<root@cumin2002>	START - Cookbook sre.idm.logout Logging Muehlenhoff out of all services on: 1733 hosts	[production]
17:35	<dancy@deploy1002>	Synchronized php-1.37.0-wmf.14/extensions/CentralAuth/includes/specials/SpecialCentralAutoLogin.php: Backport: [[gerrit:704383\|Do not lock preferences row for a rememberpassword check (T286521)]] (duration: 01m 06s)	[production]
17:00	<dancy@deploy1002>	Synchronized php-1.37.0-wmf.12/extensions/CentralAuth/includes/specials/SpecialCentralAutoLogin.php: Backport: [[gerrit:704382\|Do not lock preferences row for a rememberpassword check (T286521)]] (duration: 01m 05s)	[production]
16:27	<root@cumin2002>	END (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Muehlenhoff out of all services on: 1733 hosts	[production]
16:26	<root@cumin2002>	START - Cookbook sre.idm.logout Logging Muehlenhoff out of all services on: 1733 hosts	[production]
16:11	<dancy@deploy1002>	Synchronized php-1.37.0-wmf.12/extensions/Translate: Backport: [[gerrit:704404\|TranslationAid: Handle empty message definition (T285830)]] and [[gerrit:704405\|TranslationAid: Make sure to return successfully fetched definitions (T285830)]] (duration: 01m 09s)	[production]
16:07	<otto@deploy1002>	helmfile [codfw] Ran 'sync' command on namespace 'eventgate-analytics' for release 'production' .	[production]
15:37	<moritzm>	installing klibc security updates	[production]
15:36	<ottomata>	deploying eventgate-analytics with direct service-runner promethues support	[production]
15:34	<ryankemper>	[Elastic] Manually triggering readahead mitigation across whole fleet to prevent any further issues today: `ryankemper@cumin1001:~$ sudo cumin -b 12 'P{elastic*}' 'sudo systemctl restart elasticsearch-disable-readahead.service'` (still need to investigate why `elasticsearch-disable-readahead.timer` isn't re-firing every 30 mins as desired)	[production]
15:34	<moritzm>	installing apache security updates on otrs1001 (ticket.wikimedia.org)	[production]
15:34	<otto@deploy1002>	helmfile [eqiad] Ran 'sync' command on namespace 'eventgate-analytics' for release 'production' .	[production]
15:28	<urbanecm>	Start server-side upload of 3 large image files (T285708)	[production]
15:16	<moritzm>	installing apache security updates on lists1001 (lists.wikimedia.org)	[production]
14:51	<moritzm>	installing apache security updates on puppet masters	[production]
14:47	<jiji@cumin1001>	conftool action : set/pooled=inactive; selector: name=mw2384.codfw.wmnet	[production]
14:47	<effie>	set mw2384 as inactive to investigate mw2383 issue - T286463	[production]
14:44	<jiji@deploy1002>	helmfile [codfw] START helmfile.d/admin 'apply'.	[production]
14:44	<moritzm>	installing apache security updates on grafana*	[production]
14:43	<jiji@deploy1002>	helmfile [eqiad] DONE helmfile.d/admin 'apply'.	[production]
14:43	<jiji@deploy1002>	helmfile [eqiad] START helmfile.d/admin 'apply'.	[production]
14:40	<jiji@deploy1002>	helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'.	[production]
14:40	<jiji@deploy1002>	helmfile [staging-eqiad] START helmfile.d/admin 'apply'.	[production]
14:38	<dzahn@cumin1001>	conftool action : set/pooled=yes; selector: name=mw1422.eqiad.wmnet	[production]
14:33	<dcausse>	runnning elasticsearch-madvise-random ES_PID on elastic2045	[production]
14:31	<dcausse>	runnning elasticsearch-madvise-random 1022 on elastic2054	[production]
14:23	<jiji@deploy1002>	helmfile [staging-codfw] DONE helmfile.d/admin 'apply'.	[production]
14:19	<jiji@deploy1002>	helmfile [staging-codfw] START helmfile.d/admin 'apply'.	[production]
14:19	<jiji@deploy1002>	helmfile [staging-codfw] DONE helmfile.d/admin 'apply'.	[production]
14:19	<jiji@deploy1002>	helmfile [staging-codfw] START helmfile.d/admin 'apply'.	[production]
14:13	<elukey>	restart php-fpm on mw2370	[production]
13:43	<jmm@cumin2002>	END (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Muehlenhoff out of all services on: 1733 hosts	[production]
13:43	<jmm@cumin2002>	START - Cookbook sre.idm.logout Logging Muehlenhoff out of all services on: 1733 hosts	[production]
13:09	<kormat@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on 18 hosts with reason: Deploying schema change to s1 T277118	[production]
13:09	<kormat@cumin1001>	START - Cookbook sre.hosts.downtime for 4:00:00 on 18 hosts with reason: Deploying schema change to s1 T277118	[production]