production SAL

5901-5950 of 10000 results (51ms)

2022-02-24 §
01:25	<pt1979@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on elastic2076.codfw.wmnet with reason: host reimage	[production]
01:13	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic2075.codfw.wmnet with reason: host reimage	[production]
01:10	<pt1979@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on elastic2075.codfw.wmnet with reason: host reimage	[production]
01:08	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host elastic2076.codfw.wmnet with OS bullseye	[production]
01:01	<pt1979@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host elastic2075.codfw.wmnet with OS bullseye	[production]
01:01	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host elastic2075.codfw.wmnet with OS bullseye	[production]
00:59	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2074.codfw.wmnet with OS bullseye	[production]
00:53	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host elastic2075.codfw.wmnet with OS bullseye	[production]
00:51	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host elastic2073.codfw.wmnet with OS bullseye	[production]
00:49	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic2074.codfw.wmnet with reason: host reimage	[production]
00:45	<pt1979@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on elastic2074.codfw.wmnet with reason: host reimage	[production]
00:41	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic2073.codfw.wmnet with reason: host reimage	[production]
00:38	<pt1979@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on elastic2073.codfw.wmnet with reason: host reimage	[production]
00:28	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host elastic2074.codfw.wmnet with OS bullseye	[production]
00:23	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host elastic2079.mgmt.codfw.wmnet with reboot policy FORCED	[production]
00:21	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host elastic2073.codfw.wmnet with OS bullseye	[production]
00:06	<pt1979@cumin2002>	START - Cookbook sre.hosts.provision for host elastic2079.mgmt.codfw.wmnet with reboot policy FORCED	[production]
2022-02-23 §
23:39	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be2069.codfw.wmnet with OS stretch	[production]
23:13	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2069.codfw.wmnet with reason: host reimage	[production]
23:09	<pt1979@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2069.codfw.wmnet with reason: host reimage	[production]
22:58	<mutante>	phabricator - disabled empty but active repo: wikidata-query-LDFServer (WQLD) created in 2018 by qchris (T296022)	[production]
22:51	<mutante>	phabricator - disabled empty but active repos: dibyaduttabook and xtools-H (T296022)	[production]
22:50	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host ms-be2069.codfw.wmnet with OS stretch	[production]
22:37	<mutante>	phabricator - disabling repository dibyaduttabook	[production]
22:09	<reedy@deploy1002>	Synchronized php-1.38.0-wmf.23/extensions/SecurePoll/cli/wm-scripts/ucoc/: (no justification provided) (duration: 00m 50s)	[production]
22:08	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
22:07	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
22:07	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
22:06	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
21:17	<sukhe@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 5 days, 0:00:00 on doh[6001-6002].wikimedia.org with reason: bird6 errors expected, not serving any traffic	[production]
21:17	<sukhe@cumin1001>	START - Cookbook sre.hosts.downtime for 5 days, 0:00:00 on doh[6001-6002].wikimedia.org with reason: bird6 errors expected, not serving any traffic	[production]
21:11	<dduvall@deploy1002>	Synchronized php: group1 wikis to 1.38.0-wmf.23 refs T300199 (duration: 01m 31s)	[production]
21:10	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
21:10	<dduvall@deploy1002>	rebuilt and synchronized wikiversions files: group1 wikis to 1.38.0-wmf.23 refs T300199	[production]
21:09	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
21:09	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
21:08	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
21:03	<mwdebug-deploy@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mwdebug: apply	[production]
21:02	<mwdebug-deploy@deploy1002>	helmfile [codfw] START helmfile.d/services/mwdebug: apply	[production]
21:02	<mwdebug-deploy@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply	[production]
21:01	<mwdebug-deploy@deploy1002>	helmfile [eqiad] START helmfile.d/services/mwdebug: apply	[production]
20:44	<taavi>	run CentralAuthUser::importLocalNames for FuzzyBot T302399	[production]
19:42	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1158 (T302363)', diff saved to https://phabricator.wikimedia.org/P21414 and previous config saved to /var/cache/conftool/dbconfig/20220223-194254-ladsgroup.json	[production]
19:35	<dancy@deploy1002>	scap failed: CalledProcessError Command 'sudo -u mwbuilder /usr/bin/make -C /srv/mwbuilder/release/make-container-image -f Makefile build-and-push-all-images GIT_BASE=https://gerrit.wikimedia.org/r/ BRANCH=master workdir_volume=/srv/mediawiki-staging mv_image_name=docker-registry.discovery.wmnet/restricted/mediawiki-multiversion webserver_image_name=docker-registry.discovery.wmnet/restricted/mediawik	[production]
19:35	<dancy@deploy1002>	Started scap: testing scap container image building	[production]
19:33	<dancy@deploy1002>	scap failed: CalledProcessError Command 'make -f Makefile build-and-push-all-images GIT_BASE=https://gerrit.wikimedia.org/r/ BRANCH=master workdir_volume=/srv/mediawiki-staging mv_image_name=docker-registry.discovery.wmnet/restricted/mediawiki-multiversion webserver_image_name=docker-registry.discovery.wmnet/restricted/mediawiki-webserver' returned non-zero exit status 2. (duration: 00m 03s)	[production]
19:33	<dancy@deploy1002>	Started scap: testing scap container image building	[production]
19:32	<dancy@deploy1002>	Started scap: testing scap container image building	[production]
19:27	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1158', diff saved to https://phabricator.wikimedia.org/P21413 and previous config saved to /var/cache/conftool/dbconfig/20220223-192749-ladsgroup.json	[production]
19:27	<dancy@deploy1002>	scap failed: CalledProcessError Command 'make -f Makefile build-and-push-all-images GIT_BASE=https://gerrit.wikimedia.org/r/ BRANCH=master workdir_volume=/srv/mediawiki-staging mv_image_name=docker-registry.discovery.wmnet/restricted/mediawiki-multiversion webserver_image_name=docker-registry.discovery.wmnet/restricted/mediawiki-webserver' returned non-zero exit status 2. (duration: 00m 51s)	[production]