production SAL

3251-3300 of 10000 results (93ms)

2023-11-14 §
18:50	<eevans@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on aqs1011.eqiad.wmnet with reason: host reimage	[production]
18:46	<arnaudb@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2122 (T348183)', diff saved to https://phabricator.wikimedia.org/P53453 and previous config saved to /var/cache/conftool/dbconfig/20231114-184637-arnaudb.json	[production]
18:42	<arnaudb@cumin1001>	dbctl commit (dc=all): 'Depooling db2122 (T348183)', diff saved to https://phabricator.wikimedia.org/P53452 and previous config saved to /var/cache/conftool/dbconfig/20231114-184204-arnaudb.json	[production]
18:41	<arnaudb@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2122.codfw.wmnet with reason: Maintenance	[production]
18:41	<arnaudb@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2122.codfw.wmnet with reason: Maintenance	[production]
18:41	<arnaudb@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2121 (T348183)', diff saved to https://phabricator.wikimedia.org/P53451 and previous config saved to /var/cache/conftool/dbconfig/20231114-184142-arnaudb.json	[production]
18:36	<eevans@cumin1001>	START - Cookbook sre.hosts.reimage for host aqs1011.eqiad.wmnet with OS bullseye	[production]
18:33	<andrew@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudvirt1046.eqiad.wmnet with OS bookworm	[production]
18:32	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1048.eqiad.wmnet with reason: host reimage	[production]
18:27	<andrew@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1048.eqiad.wmnet with reason: host reimage	[production]
18:26	<arnaudb@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2121', diff saved to https://phabricator.wikimedia.org/P53450 and previous config saved to /var/cache/conftool/dbconfig/20231114-182636-arnaudb.json	[production]
18:22	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1047.eqiad.wmnet with reason: host reimage	[production]
18:19	<andrew@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1047.eqiad.wmnet with reason: host reimage	[production]
18:11	<arnaudb@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2121', diff saved to https://phabricator.wikimedia.org/P53449 and previous config saved to /var/cache/conftool/dbconfig/20231114-181130-arnaudb.json	[production]
18:11	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirt1048.eqiad.wmnet with OS bookworm	[production]
18:04	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirt1047.eqiad.wmnet with OS bookworm	[production]
17:56	<arnaudb@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2121 (T348183)', diff saved to https://phabricator.wikimedia.org/P53448 and previous config saved to /var/cache/conftool/dbconfig/20231114-175623-arnaudb.json	[production]
17:55	<hnowlan@deploy2002>	helmfile [staging] DONE helmfile.d/services/api-gateway: apply	[production]
17:54	<jbond@cumin1001>	END (FAIL) - Cookbook sre.puppet.migrate-role (exit_code=99) for role: wmcs::openstack::codfw1dev::control	[production]
17:52	<arnaudb@cumin1001>	dbctl commit (dc=all): 'Depooling db2121 (T348183)', diff saved to https://phabricator.wikimedia.org/P53447 and previous config saved to /var/cache/conftool/dbconfig/20231114-175202-arnaudb.json	[production]
17:51	<arnaudb@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2121.codfw.wmnet with reason: Maintenance	[production]
17:51	<arnaudb@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2121.codfw.wmnet with reason: Maintenance	[production]
17:51	<arnaudb@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2120 (T348183)', diff saved to https://phabricator.wikimedia.org/P53446 and previous config saved to /var/cache/conftool/dbconfig/20231114-175140-arnaudb.json	[production]
17:45	<hnowlan@deploy2002>	helmfile [staging] START helmfile.d/services/api-gateway: apply	[production]
17:43	<jbond@cumin1001>	START - Cookbook sre.puppet.migrate-role for role: wmcs::openstack::codfw1dev::control	[production]
17:36	<arnaudb@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2120', diff saved to https://phabricator.wikimedia.org/P53445 and previous config saved to /var/cache/conftool/dbconfig/20231114-173634-arnaudb.json	[production]
17:21	<andrew@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudvirt1043.eqiad.wmnet with OS bookworm	[production]
17:21	<arnaudb@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2120', diff saved to https://phabricator.wikimedia.org/P53444 and previous config saved to /var/cache/conftool/dbconfig/20231114-172127-arnaudb.json	[production]
17:12	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirt1046.eqiad.wmnet with OS bookworm	[production]
17:12	<andrew@cumin1001>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudvirt1046.eqiad.wmnet with OS bookworm	[production]
17:06	<arnaudb@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2120 (T348183)', diff saved to https://phabricator.wikimedia.org/P53442 and previous config saved to /var/cache/conftool/dbconfig/20231114-170621-arnaudb.json	[production]
17:03	<elukey@deploy2002>	helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: sync	[production]
17:02	<elukey@deploy2002>	helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: sync	[production]
17:02	<arnaudb@cumin1001>	dbctl commit (dc=all): 'Depooling db2120 (T348183)', diff saved to https://phabricator.wikimedia.org/P53441 and previous config saved to /var/cache/conftool/dbconfig/20231114-170158-arnaudb.json	[production]
17:02	<arnaudb@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2120.codfw.wmnet with reason: Maintenance	[production]
17:01	<arnaudb@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2120.codfw.wmnet with reason: Maintenance	[production]
17:01	<arnaudb@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2108 (T348183)', diff saved to https://phabricator.wikimedia.org/P53440 and previous config saved to /var/cache/conftool/dbconfig/20231114-170136-arnaudb.json	[production]
16:50	<ebernhardson@deploy2002>	Finished deploy [airflow-dags/search@0ae1184]: make cirrus index imports world readable in hdfs (duration: 00m 28s)	[production]
16:50	<ebernhardson@deploy2002>	Started deploy [airflow-dags/search@0ae1184]: make cirrus index imports world readable in hdfs	[production]
16:47	<elukey@deploy2002>	helmfile [codfw] DONE helmfile.d/services/changeprop: sync	[production]
16:47	<elukey@deploy2002>	helmfile [codfw] START helmfile.d/services/changeprop: sync	[production]
16:46	<arnaudb@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2108', diff saved to https://phabricator.wikimedia.org/P53438 and previous config saved to /var/cache/conftool/dbconfig/20231114-164630-arnaudb.json	[production]
16:44	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1044.eqiad.wmnet with OS bookworm	[production]
16:37	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirt1046.eqiad.wmnet with OS bookworm	[production]
16:35	<ebernhardson@deploy2002>	Finished deploy [airflow-dags/search@017fbf1]: search: clean wcqs revision map (duration: 00m 29s)	[production]
16:34	<ebernhardson@deploy2002>	Started deploy [airflow-dags/search@017fbf1]: search: clean wcqs revision map	[production]
16:31	<arnaudb@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2108', diff saved to https://phabricator.wikimedia.org/P53437 and previous config saved to /var/cache/conftool/dbconfig/20231114-163123-arnaudb.json	[production]
16:30	<aokoth@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host vrts1002.eqiad.wmnet	[production]
16:26	<aokoth@cumin1001>	START - Cookbook sre.hosts.reboot-single for host vrts1002.eqiad.wmnet	[production]
16:17	<andrew@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1044.eqiad.wmnet with reason: host reimage	[production]