production SAL

5451-5500 of 10000 results (99ms)

2024-06-06 §
17:57	<kamila@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-ctrl1001.eqiad.wmnet with OS bullseye	[production]
17:57	<kamila@cumin1002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - kamila@cumin1002"	[production]
17:56	<kamila@cumin1002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - kamila@cumin1002"	[production]
17:48	<topranks>	re-enabling pybal on lvs1017 after cable move T366361	[production]
17:31	<marostegui@cumin1002>	dbctl commit (dc=all): 'Depooling db1247 (T364069)', diff saved to https://phabricator.wikimedia.org/P64211 and previous config saved to /var/cache/conftool/dbconfig/20240606-173121-marostegui.json	[production]
17:31	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1247.eqiad.wmnet with reason: Maintenance	[production]
17:31	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1247.eqiad.wmnet with reason: Maintenance	[production]
17:26	<cmooney@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:20:00 on lvs1017.eqiad.wmnet with reason: moving lvs1017 link back to ssw1-e1-codfw	[production]
17:26	<topranks>	disabling pybal on lvs1017 to move traffic to lvs1020 in advance of cable move T366361	[production]
17:26	<cmooney@cumin1002>	START - Cookbook sre.hosts.downtime for 0:20:00 on lvs1017.eqiad.wmnet with reason: moving lvs1017 link back to ssw1-e1-codfw	[production]
17:23	<topranks>	re-enabling pybal on lvs1018 after cable move T366361	[production]
17:15	<cmooney@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:20:00 on lvs1018.eqiad.wmnet with reason: moving lvs1018 link back to ssw1-e1-codfw	[production]
17:15	<cmooney@cumin1002>	START - Cookbook sre.hosts.downtime for 0:20:00 on lvs1018.eqiad.wmnet with reason: moving lvs1018 link back to ssw1-e1-codfw	[production]
17:15	<cmooney@cumin1002>	END (ERROR) - Cookbook sre.hosts.downtime (exit_code=97) for 0:20:00 on lvs1019.eqiad.wmnet with reason: moving lvs1018 link back to ssw1-e1-codfw	[production]
17:14	<cmooney@cumin1002>	START - Cookbook sre.hosts.downtime for 0:20:00 on lvs1019.eqiad.wmnet with reason: moving lvs1018 link back to ssw1-e1-codfw	[production]
17:14	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Depooling db1186 (T352010)', diff saved to https://phabricator.wikimedia.org/P64210 and previous config saved to /var/cache/conftool/dbconfig/20240606-171359-ladsgroup.json	[production]
17:13	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1186.eqiad.wmnet with reason: Maintenance	[production]
17:13	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1186.eqiad.wmnet with reason: Maintenance	[production]
17:13	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1184 (T352010)', diff saved to https://phabricator.wikimedia.org/P64209 and previous config saved to /var/cache/conftool/dbconfig/20240606-171336-ladsgroup.json	[production]
17:11	<topranks>	disabling pybal on lvs1018 to move traffic to lvs1020 in advance of cable move T366361	[production]
17:11	<topranks>	re-enabling pybal on lvs1019 after cable move T366361	[production]
16:58	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1184', diff saved to https://phabricator.wikimedia.org/P64208 and previous config saved to /var/cache/conftool/dbconfig/20240606-165828-ladsgroup.json	[production]
16:52	<cmooney@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:20:00 on lvs1019.eqiad.wmnet with reason: moving lvs1019 link back to ssw1-f1-codfw	[production]
16:51	<cmooney@cumin1002>	START - Cookbook sre.hosts.downtime for 0:20:00 on lvs1019.eqiad.wmnet with reason: moving lvs1019 link back to ssw1-f1-codfw	[production]
16:50	<topranks>	disabling pybal on lvs1019 to move traffic to lvs1020 in advance of cable move T366361	[production]
16:43	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1184', diff saved to https://phabricator.wikimedia.org/P64207 and previous config saved to /var/cache/conftool/dbconfig/20240606-164320-ladsgroup.json	[production]
16:28	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1184 (T352010)', diff saved to https://phabricator.wikimedia.org/P64206 and previous config saved to /var/cache/conftool/dbconfig/20240606-162812-ladsgroup.json	[production]
16:28	<hashar@deploy1002>	Finished deploy [integration/docroot@eee90e6]: (no justification provided) (duration: 00m 05s)	[production]
16:28	<hashar@deploy1002>	Started deploy [integration/docroot@eee90e6]: (no justification provided)	[production]
16:25	<dancy@deploy1002>	Installation of scap version "4.86.1" completed for 285 hosts	[production]
16:25	<dancy@deploy1002>	Installing scap version "4.86.1" for 285 hosts	[production]
16:24	<dancy@deploy1002>	Installing scap version "4.86.1" for 286 hosts	[production]
16:13	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Depooling db2130 (T352010)', diff saved to https://phabricator.wikimedia.org/P64205 and previous config saved to /var/cache/conftool/dbconfig/20240606-161338-ladsgroup.json	[production]
16:13	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2130.codfw.wmnet with reason: Maintenance	[production]
16:13	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2130.codfw.wmnet with reason: Maintenance	[production]
16:13	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db2116 (T352010)', diff saved to https://phabricator.wikimedia.org/P64204 and previous config saved to /var/cache/conftool/dbconfig/20240606-161312-ladsgroup.json	[production]
16:10	<kamila@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on wikikube-ctrl1001.eqiad.wmnet with reason: reimage still running	[production]
16:10	<kamila@cumin1002>	START - Cookbook sre.hosts.downtime for 3:00:00 on wikikube-ctrl1001.eqiad.wmnet with reason: reimage still running	[production]
16:00	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Depooling db2162 (T352010)', diff saved to https://phabricator.wikimedia.org/P64203 and previous config saved to /var/cache/conftool/dbconfig/20240606-160028-ladsgroup.json	[production]
16:00	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2162.codfw.wmnet with reason: Maintenance	[production]
16:00	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2162.codfw.wmnet with reason: Maintenance	[production]
16:00	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db2161 (T352010)', diff saved to https://phabricator.wikimedia.org/P64202 and previous config saved to /var/cache/conftool/dbconfig/20240606-160004-ladsgroup.json	[production]
15:58	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db2116', diff saved to https://phabricator.wikimedia.org/P64201 and previous config saved to /var/cache/conftool/dbconfig/20240606-155804-ladsgroup.json	[production]
15:44	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db2161', diff saved to https://phabricator.wikimedia.org/P64199 and previous config saved to /var/cache/conftool/dbconfig/20240606-154457-ladsgroup.json	[production]
15:44	<swfrench@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/data-gateway: apply	[production]
15:42	<swfrench@deploy1002>	helmfile [eqiad] START helmfile.d/services/data-gateway: apply	[production]
15:42	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db2116', diff saved to https://phabricator.wikimedia.org/P64198 and previous config saved to /var/cache/conftool/dbconfig/20240606-154255-ladsgroup.json	[production]
15:40	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Depooling db1203 (T352010)', diff saved to https://phabricator.wikimedia.org/P64197 and previous config saved to /var/cache/conftool/dbconfig/20240606-154028-ladsgroup.json	[production]
15:40	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1203.eqiad.wmnet with reason: Maintenance	[production]
15:40	<swfrench@deploy1002>	helmfile [codfw] DONE helmfile.d/services/data-gateway: apply	[production]