production SAL

651-700 of 10000 results (72ms)

2023-08-24 §
10:32	<fabfur>	stopping pybal and rebooting lvs1019 (T344587)	[production]
10:31	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1194', diff saved to https://phabricator.wikimedia.org/P51285 and previous config saved to /var/cache/conftool/dbconfig/20230824-103153-ladsgroup.json	[production]
10:28	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2170:3312 (T344589)', diff saved to https://phabricator.wikimedia.org/P51284 and previous config saved to /var/cache/conftool/dbconfig/20230824-102848-ladsgroup.json	[production]
10:22	<jiji@cumin1001>	conftool action : set/pooled=true; selector: dnsdisc=kartotherian,name=codfw	[production]
10:22	<mvolz@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/citoid: apply	[production]
10:22	<effie>	pool kartotherian on codfw	[production]
10:21	<mvolz@deploy1002>	helmfile [eqiad] START helmfile.d/services/citoid: apply	[production]
10:21	<jayme@deploy1002>	helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
10:20	<jayme@deploy1002>	helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
10:19	<jiji@deploy1002>	helmfile [codfw] DONE helmfile.d/services/tegola-vector-tiles: apply	[production]
10:18	<jiji@deploy1002>	helmfile [codfw] START helmfile.d/services/tegola-vector-tiles: apply	[production]
10:17	<mvolz@deploy1002>	helmfile [codfw] DONE helmfile.d/services/citoid: apply	[production]
10:16	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1194 (T343718)', diff saved to https://phabricator.wikimedia.org/P51283 and previous config saved to /var/cache/conftool/dbconfig/20230824-101647-ladsgroup.json	[production]
10:16	<mvolz@deploy1002>	helmfile [codfw] START helmfile.d/services/citoid: apply	[production]
10:15	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2170:3311 (T344589)', diff saved to https://phabricator.wikimedia.org/P51282 and previous config saved to /var/cache/conftool/dbconfig/20230824-101527-ladsgroup.json	[production]
10:15	<effie>	Disable puppet on thanos-fe (eqiad), rollout cfssl on thanos-fe in codfw	[production]
10:14	<mvolz@deploy1002>	helmfile [staging] DONE helmfile.d/services/citoid: apply	[production]
10:14	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1194 (T343718)', diff saved to https://phabricator.wikimedia.org/P51281 and previous config saved to /var/cache/conftool/dbconfig/20230824-101437-ladsgroup.json	[production]
10:14	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1194.eqiad.wmnet with reason: Maintenance	[production]
10:14	<mvolz@deploy1002>	helmfile [staging] START helmfile.d/services/citoid: apply	[production]
10:14	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1194.eqiad.wmnet with reason: Maintenance	[production]
10:14	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1191 (T343718)', diff saved to https://phabricator.wikimedia.org/P51280 and previous config saved to /var/cache/conftool/dbconfig/20230824-101405-ladsgroup.json	[production]
10:08	<mvolz@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/zotero: apply	[production]
10:08	<mvolz@deploy1002>	helmfile [eqiad] START helmfile.d/services/zotero: apply	[production]
10:06	<mvolz@deploy1002>	helmfile [codfw] DONE helmfile.d/services/zotero: apply	[production]
10:06	<mvolz@deploy1002>	helmfile [codfw] START helmfile.d/services/zotero: apply	[production]
10:04	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host pybal-test2003.codfw.wmnet	[production]
10:03	<mvolz@deploy1002>	helmfile [staging] DONE helmfile.d/services/zotero: apply	[production]
10:03	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db2150 (T343718)', diff saved to https://phabricator.wikimedia.org/P51279 and previous config saved to /var/cache/conftool/dbconfig/20230824-100321-ladsgroup.json	[production]
10:03	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2150.codfw.wmnet with reason: Maintenance	[production]
10:03	<mvolz@deploy1002>	helmfile [staging] START helmfile.d/services/zotero: apply	[production]
10:03	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2150.codfw.wmnet with reason: Maintenance	[production]
10:03	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2122 (T343718)', diff saved to https://phabricator.wikimedia.org/P51278 and previous config saved to /var/cache/conftool/dbconfig/20230824-100259-ladsgroup.json	[production]
10:02	<fabfur>	end reboot of lvs1020 (pybal service enabled) (T344587)	[production]
10:00	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.reboot-single for host pybal-test2003.codfw.wmnet	[production]
10:00	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2170:3311', diff saved to https://phabricator.wikimedia.org/P51277 and previous config saved to /var/cache/conftool/dbconfig/20230824-100021-ladsgroup.json	[production]
09:58	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1191', diff saved to https://phabricator.wikimedia.org/P51276 and previous config saved to /var/cache/conftool/dbconfig/20230824-095858-ladsgroup.json	[production]
09:57	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief1001.eqiad.wmnet	[production]
09:57	<fabfur@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs1020.eqiad.wmnet	[production]
09:54	<fabfur@cumin1001>	START - Cookbook sre.hosts.reboot-single for host lvs1020.eqiad.wmnet	[production]
09:53	<vgutierrez@cumin1001>	START - Cookbook sre.hosts.reboot-single for host acmechief1001.eqiad.wmnet	[production]
09:52	<fabfur>	reboot lvs1020 to apply patch (T344587)	[production]
09:51	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1225.eqiad.wmnet with reason: Maintenance	[production]
09:51	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1225.eqiad.wmnet with reason: Maintenance	[production]
09:51	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1222 (T344589)', diff saved to https://phabricator.wikimedia.org/P51275 and previous config saved to /var/cache/conftool/dbconfig/20230824-095117-ladsgroup.json	[production]
09:49	<btullis@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host karapace1002.eqiad.wmnet	[production]
09:47	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2122', diff saved to https://phabricator.wikimedia.org/P51274 and previous config saved to /var/cache/conftool/dbconfig/20230824-094753-ladsgroup.json	[production]
09:45	<vgutierrez@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief-test2001.codfw.wmnet	[production]
09:45	<btullis@cumin1001>	START - Cookbook sre.hosts.reboot-single for host karapace1002.eqiad.wmnet	[production]
09:45	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2170:3311', diff saved to https://phabricator.wikimedia.org/P51273 and previous config saved to /var/cache/conftool/dbconfig/20230824-094515-ladsgroup.json	[production]