production SAL

2251-2300 of 10000 results (92ms)

2023-12-08 §
00:15	<jclark@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on ganeti1035.eqiad.wmnet with reason: host reimage	[production]
00:15	<jclark@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on ganeti1038.eqiad.wmnet with reason: host reimage	[production]
00:01	<jclark@cumin1001>	START - Cookbook sre.hosts.reimage for host ganeti1038.eqiad.wmnet with OS bullseye	[production]
00:00	<jclark@cumin1001>	START - Cookbook sre.hosts.reimage for host ganeti1037.eqiad.wmnet with OS bullseye	[production]
00:00	<jclark@cumin1001>	START - Cookbook sre.hosts.reimage for host ganeti1036.eqiad.wmnet with OS bullseye	[production]
00:00	<jclark@cumin1001>	START - Cookbook sre.hosts.reimage for host ganeti1035.eqiad.wmnet with OS bullseye	[production]
2023-12-07 §
23:53	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1201 (T343198)', diff saved to https://phabricator.wikimedia.org/P54298 and previous config saved to /var/cache/conftool/dbconfig/20231207-235333-ladsgroup.json	[production]
23:53	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1201.eqiad.wmnet with reason: Maintenance	[production]
23:53	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1201.eqiad.wmnet with reason: Maintenance	[production]
23:53	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1187 (T343198)', diff saved to https://phabricator.wikimedia.org/P54297 and previous config saved to /var/cache/conftool/dbconfig/20231207-235310-ladsgroup.json	[production]
23:52	<jclark@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubernetes1061.eqiad.wmnet with OS bullseye	[production]
23:52	<jclark@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubernetes1062.eqiad.wmnet with OS bullseye	[production]
23:52	<jclark@cumin1001>	END (FAIL) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=99) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1001"	[production]
23:52	<jclark@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubernetes1059.eqiad.wmnet with OS bullseye	[production]
23:52	<jclark@cumin1001>	END (FAIL) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=99) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1001"	[production]
23:52	<jclark@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kubernetes1060.eqiad.wmnet with OS bullseye	[production]
23:52	<jclark@cumin1001>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1001"	[production]
23:52	<jclark@cumin1001>	END (FAIL) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=99) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1001"	[production]
23:47	<ebernhardson@deploy2002>	helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
23:47	<ebernhardson@deploy2002>	helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply	[production]
23:38	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P54296 and previous config saved to /var/cache/conftool/dbconfig/20231207-233802-ladsgroup.json	[production]
23:23	<ebernhardson@deploy2002>	helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
23:23	<ebernhardson@deploy2002>	helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply	[production]
23:23	<ryankemper@deploy2002>	helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'.	[production]
23:22	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1187', diff saved to https://phabricator.wikimedia.org/P54295 and previous config saved to /var/cache/conftool/dbconfig/20231207-232256-ladsgroup.json	[production]
23:21	<ryankemper@deploy2002>	helmfile [staging-eqiad] START helmfile.d/admin 'apply'.	[production]
23:21	<ryankemper@deploy2002>	helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'.	[production]
23:21	<ryankemper@deploy2002>	helmfile [staging-eqiad] START helmfile.d/admin 'apply'.	[production]
23:17	<ryankemper@deploy2002>	helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
23:15	<ryankemper@deploy2002>	helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply	[production]
23:07	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1187 (T343198)', diff saved to https://phabricator.wikimedia.org/P54294 and previous config saved to /var/cache/conftool/dbconfig/20231207-230749-ladsgroup.json	[production]
23:05	<jclark@cumin1001>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1001"	[production]
22:58	<jclark@cumin1001>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1001"	[production]
22:55	<pt1979@cumin2002>	END (FAIL) - Cookbook sre.hosts.dhcp (exit_code=99) for host cp4037.ulsfo.wmnet	[production]
22:53	<jclark@cumin1001>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1001"	[production]
22:48	<jclark@cumin1001>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1001"	[production]
22:38	<jclark@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes1061.eqiad.wmnet with reason: host reimage	[production]
22:35	<jclark@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes1060.eqiad.wmnet with reason: host reimage	[production]
22:35	<jclark@cumin1001>	END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on kubernetes1062.eqiad.wmnet with reason: host reimage	[production]
22:33	<jclark@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubernetes1059.eqiad.wmnet with reason: host reimage	[production]
22:31	<jclark@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes1061.eqiad.wmnet with reason: host reimage	[production]
22:30	<jclark@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes1062.eqiad.wmnet with reason: host reimage	[production]
22:30	<jclark@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes1060.eqiad.wmnet with reason: host reimage	[production]
22:29	<jclark@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on kubernetes1059.eqiad.wmnet with reason: host reimage	[production]
22:26	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1187 (T343198)', diff saved to https://phabricator.wikimedia.org/P54293 and previous config saved to /var/cache/conftool/dbconfig/20231207-222656-ladsgroup.json	[production]
22:26	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1187.eqiad.wmnet with reason: Maintenance	[production]
22:26	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1187.eqiad.wmnet with reason: Maintenance	[production]
22:26	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1180 (T343198)', diff saved to https://phabricator.wikimedia.org/P54292 and previous config saved to /var/cache/conftool/dbconfig/20231207-222633-ladsgroup.json	[production]
22:22	<ebernhardson@deploy2002>	helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
22:22	<ebernhardson@deploy2002>	helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply	[production]