production SAL

7301-7350 of 10000 results (67ms)

2022-05-18 §
19:34	<cmooney@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
19:30	<cmooney@cumin1001>	START - Cookbook sre.dns.netbox	[production]
19:24	<jhathaway@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on mx2001.wikimedia.org with reason: exim debug log capture	[production]
19:24	<jhathaway@cumin1001>	START - Cookbook sre.hosts.downtime for 1:00:00 on mx2001.wikimedia.org with reason: exim debug log capture	[production]
19:23	<jhathaway>	capturing debug logs on mx2001.wikimedia.org	[production]
19:12	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on db1163.eqiad.wmnet with reason: Maint	[production]
19:11	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 10:00:00 on db1163.eqiad.wmnet with reason: Maint	[production]
18:17	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on db1150.eqiad.wmnet with reason: Maintenance	[production]
18:17	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 10:00:00 on db1150.eqiad.wmnet with reason: Maintenance	[production]
18:16	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1113:3315 (T298555)', diff saved to https://phabricator.wikimedia.org/P27967 and previous config saved to /var/cache/conftool/dbconfig/20220518-181654-ladsgroup.json	[production]
18:01	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1113:3315', diff saved to https://phabricator.wikimedia.org/P27966 and previous config saved to /var/cache/conftool/dbconfig/20220518-180149-ladsgroup.json	[production]
17:46	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1113:3315', diff saved to https://phabricator.wikimedia.org/P27965 and previous config saved to /var/cache/conftool/dbconfig/20220518-174644-ladsgroup.json	[production]
17:40	<mforns@deploy1002>	Finished deploy [airflow-dags/analytics@ad59116]: (no justification provided) (duration: 00m 07s)	[production]
17:40	<mforns@deploy1002>	Started deploy [airflow-dags/analytics@ad59116]: (no justification provided)	[production]
17:31	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1113:3315 (T298555)', diff saved to https://phabricator.wikimedia.org/P27964 and previous config saved to /var/cache/conftool/dbconfig/20220518-173139-ladsgroup.json	[production]
16:43	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1113:3315 (T298555)', diff saved to https://phabricator.wikimedia.org/P27963 and previous config saved to /var/cache/conftool/dbconfig/20220518-164256-ladsgroup.json	[production]
16:42	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on db1113.eqiad.wmnet with reason: Maintenance	[production]
16:42	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 10:00:00 on db1113.eqiad.wmnet with reason: Maintenance	[production]
16:42	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1110 (T298555)', diff saved to https://phabricator.wikimedia.org/P27962 and previous config saved to /var/cache/conftool/dbconfig/20220518-164248-ladsgroup.json	[production]
16:27	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1110', diff saved to https://phabricator.wikimedia.org/P27961 and previous config saved to /var/cache/conftool/dbconfig/20220518-162743-ladsgroup.json	[production]
16:22	<razzi@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on an-tool1011.eqiad.wmnet with reason: Setting up turnilo for the first time, there will be errors	[production]
16:22	<razzi@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on an-tool1011.eqiad.wmnet with reason: Setting up turnilo for the first time, there will be errors	[production]
16:12	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1110', diff saved to https://phabricator.wikimedia.org/P27960 and previous config saved to /var/cache/conftool/dbconfig/20220518-161238-ladsgroup.json	[production]
15:57	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1110 (T298555)', diff saved to https://phabricator.wikimedia.org/P27959 and previous config saved to /var/cache/conftool/dbconfig/20220518-155733-ladsgroup.json	[production]
15:44	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
15:40	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
15:40	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
15:36	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
15:36	<Amir1>	promoted user:Ladsgroup to admin of testcommonswiki	[production]
15:32	<ladsgroup@deploy1002>	Synchronized php-1.39.0-wmf.12/extensions/CommonsMetadata/src: Backport: [[gerrit:792659\|Return early if the ParserOutput doesn't have any text (T308663)]] (duration: 00m 52s)	[production]
15:15	<mforns@deploy1002>	Finished deploy [airflow-dags/analytics@3072d55]: (no justification provided) (duration: 00m 07s)	[production]
15:15	<mforns@deploy1002>	Started deploy [airflow-dags/analytics@3072d55]: (no justification provided)	[production]
15:10	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1006.eqiad.wmnet	[production]
15:07	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1110 (T298555)', diff saved to https://phabricator.wikimedia.org/P27957 and previous config saved to /var/cache/conftool/dbconfig/20220518-150722-ladsgroup.json	[production]
15:07	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on db1110.eqiad.wmnet with reason: Maintenance	[production]
15:07	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 10:00:00 on db1110.eqiad.wmnet with reason: Maintenance	[production]
15:07	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1144:3315 (T298555)', diff saved to https://phabricator.wikimedia.org/P27956 and previous config saved to /var/cache/conftool/dbconfig/20220518-150714-ladsgroup.json	[production]
15:04	<btullis@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/datahub: sync on main	[production]
15:04	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host ganeti1006.eqiad.wmnet	[production]
15:04	<vgutierrez>	rolling upgrade to HAProxy 2.4.17 in eqiad - T307444	[production]
15:03	<btullis@deploy1002>	helmfile [eqiad] START helmfile.d/services/datahub: apply on main	[production]
14:56	<btullis@deploy1002>	helmfile [codfw] DONE helmfile.d/services/datahub: sync on main	[production]
14:56	<btullis@deploy1002>	helmfile [codfw] START helmfile.d/services/datahub: apply on main	[production]
14:56	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1168 (T303603)', diff saved to https://phabricator.wikimedia.org/P27955 and previous config saved to /var/cache/conftool/dbconfig/20220518-145603-ladsgroup.json	[production]
14:55	<btullis@deploy1002>	helmfile [staging] DONE helmfile.d/services/datahub: sync on main	[production]
14:54	<btullis@deploy1002>	helmfile [staging] START helmfile.d/services/datahub: apply on main	[production]
14:52	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1144:3315', diff saved to https://phabricator.wikimedia.org/P27954 and previous config saved to /var/cache/conftool/dbconfig/20220518-145208-ladsgroup.json	[production]
14:45	<jnuche@deploy1002>	rebuilt and synchronized wikiversions files: Set commonswiki to 1.39.0-wmf.12	[production]
14:40	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P27952 and previous config saved to /var/cache/conftool/dbconfig/20220518-144058-ladsgroup.json	[production]
14:39	<jnuche@deploy1002>	scap failed: average error rate on 6/8 canaries increased by 10x (rerun with --force to override this check, see https://logstash.wikimedia.org for details)	[production]