production SAL

2901-2950 of 10000 results (56ms)

2022-05-18 §
15:07	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 10:00:00 on db1110.eqiad.wmnet with reason: Maintenance	[production]
15:07	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1144:3315 (T298555)', diff saved to https://phabricator.wikimedia.org/P27956 and previous config saved to /var/cache/conftool/dbconfig/20220518-150714-ladsgroup.json	[production]
15:04	<btullis@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/datahub: sync on main	[production]
15:04	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host ganeti1006.eqiad.wmnet	[production]
15:04	<vgutierrez>	rolling upgrade to HAProxy 2.4.17 in eqiad - T307444	[production]
15:03	<btullis@deploy1002>	helmfile [eqiad] START helmfile.d/services/datahub: apply on main	[production]
14:56	<btullis@deploy1002>	helmfile [codfw] DONE helmfile.d/services/datahub: sync on main	[production]
14:56	<btullis@deploy1002>	helmfile [codfw] START helmfile.d/services/datahub: apply on main	[production]
14:56	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1168 (T303603)', diff saved to https://phabricator.wikimedia.org/P27955 and previous config saved to /var/cache/conftool/dbconfig/20220518-145603-ladsgroup.json	[production]
14:55	<btullis@deploy1002>	helmfile [staging] DONE helmfile.d/services/datahub: sync on main	[production]
14:54	<btullis@deploy1002>	helmfile [staging] START helmfile.d/services/datahub: apply on main	[production]
14:52	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1144:3315', diff saved to https://phabricator.wikimedia.org/P27954 and previous config saved to /var/cache/conftool/dbconfig/20220518-145208-ladsgroup.json	[production]
14:45	<jnuche@deploy1002>	rebuilt and synchronized wikiversions files: Set commonswiki to 1.39.0-wmf.12	[production]
14:40	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P27952 and previous config saved to /var/cache/conftool/dbconfig/20220518-144058-ladsgroup.json	[production]
14:39	<jnuche@deploy1002>	scap failed: average error rate on 6/8 canaries increased by 10x (rerun with --force to override this check, see https://logstash.wikimedia.org for details)	[production]
14:37	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1144:3315', diff saved to https://phabricator.wikimedia.org/P27951 and previous config saved to /var/cache/conftool/dbconfig/20220518-143703-ladsgroup.json	[production]
14:25	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1168', diff saved to https://phabricator.wikimedia.org/P27949 and previous config saved to /var/cache/conftool/dbconfig/20220518-142553-ladsgroup.json	[production]
14:21	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1144:3315 (T298555)', diff saved to https://phabricator.wikimedia.org/P27948 and previous config saved to /var/cache/conftool/dbconfig/20220518-142158-ladsgroup.json	[production]
14:15	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
14:10	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1168 (T303603)', diff saved to https://phabricator.wikimedia.org/P27947 and previous config saved to /var/cache/conftool/dbconfig/20220518-141048-ladsgroup.json	[production]
14:10	<vgutierrez>	rolling upgrade to HAProxy 2.4.17 in esams - T307444	[production]
14:09	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
14:09	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
14:08	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1168 (T303603)', diff saved to https://phabricator.wikimedia.org/P27946 and previous config saved to /var/cache/conftool/dbconfig/20220518-140812-ladsgroup.json	[production]
14:08	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1168.eqiad.wmnet with reason: Maintenance	[production]
14:08	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1168.eqiad.wmnet with reason: Maintenance	[production]
14:08	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1180 (T303603)', diff saved to https://phabricator.wikimedia.org/P27945 and previous config saved to /var/cache/conftool/dbconfig/20220518-140804-ladsgroup.json	[production]
14:02	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
13:57	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
13:52	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1180', diff saved to https://phabricator.wikimedia.org/P27944 and previous config saved to /var/cache/conftool/dbconfig/20220518-135259-ladsgroup.json	[production]
13:51	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
13:51	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
13:44	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
13:44	<jforrester@deploy1002>	Synchronized multiversion/MWMultiVersion.php: Config: [[gerrit:740304\|Make use of the ?? operator in more trivial situations]] (duration: 00m 53s)	[production]
13:43	<jforrester@deploy1002>	Synchronized wmf-config/Wikibase.php: Config: [[gerrit:740304\|Make use of the ?? operator in more trivial situations]] (duration: 00m 52s)	[production]
13:42	<jforrester@deploy1002>	Synchronized w/health-check.php: Config: [[gerrit:740304\|Make use of the ?? operator in more trivial situations]] (duration: 00m 52s)	[production]
13:40	<jforrester@deploy1002>	Synchronized rpc/RunJobs.php: Config: [[gerrit:740304\|Make use of the ?? operator in more trivial situations]] (duration: 00m 51s)	[production]
13:40	<mvernon@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be2060.codfw.wmnet with OS bullseye	[production]
13:39	<jforrester@deploy1002>	Synchronized docroot/noc/conf/highlight.php: Config: [[gerrit:740304\|Make use of the ?? operator in more trivial situations]] (duration: 00m 51s)	[production]
13:39	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
13:39	<volans@cumin1001>	END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) ns-recursor1.openstack.codfw1dev.wikimediacloud.org on all recursors	[production]
13:39	<volans@cumin1001>	START - Cookbook sre.dns.wipe-cache ns-recursor1.openstack.codfw1dev.wikimediacloud.org on all recursors	[production]
13:39	<volans@cumin1001>	END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) ns-recursor0.openstack.codfw1dev.wikimediacloud.org on all recursors	[production]
13:39	<volans@cumin1001>	START - Cookbook sre.dns.wipe-cache ns-recursor0.openstack.codfw1dev.wikimediacloud.org on all recursors	[production]
13:38	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
13:38	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
13:38	<jforrester@deploy1002>	Synchronized docroot/wwwportal/w/search-redirect.php: Config: [[gerrit:740304\|Make use of the ?? operator in more trivial situations]] (duration: 00m 51s)	[production]
13:37	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1180', diff saved to https://phabricator.wikimedia.org/P27943 and previous config saved to /var/cache/conftool/dbconfig/20220518-133753-ladsgroup.json	[production]
13:37	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
13:36	<volans@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]