production SAL

4451-4500 of 10000 results (91ms)

2023-08-24 §
13:37	<marostegui>	failover m2-master to dbproxy1023	[production]
13:34	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2130', diff saved to https://phabricator.wikimedia.org/P51328 and previous config saved to /var/cache/conftool/dbconfig/20230824-133458-ladsgroup.json	[production]
13:33	<jayme@deploy1002>	helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
13:26	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1132', diff saved to https://phabricator.wikimedia.org/P51327 and previous config saved to /var/cache/conftool/dbconfig/20230824-132623-ladsgroup.json	[production]
13:25	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2168:3317 (T343718)', diff saved to https://phabricator.wikimedia.org/P51326 and previous config saved to /var/cache/conftool/dbconfig/20230824-132504-ladsgroup.json	[production]
13:23	<fabfur>	disabling puppet and pybal on lvs1017 for reboot (T344587)	[production]
13:19	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2130', diff saved to https://phabricator.wikimedia.org/P51325 and previous config saved to /var/cache/conftool/dbconfig/20230824-131952-ladsgroup.json	[production]
13:11	<jnuche@deploy1002>	Finished deploy [releng/jenkins-deploy@c579111] (releasing): (no justification provided) (duration: 00m 21s)	[production]
13:11	<jnuche@deploy1002>	Started deploy [releng/jenkins-deploy@c579111] (releasing): (no justification provided)	[production]
13:11	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1132 (T344589)', diff saved to https://phabricator.wikimedia.org/P51324 and previous config saved to /var/cache/conftool/dbconfig/20230824-131117-ladsgroup.json	[production]
13:08	<bblack>	cp3074: restart varnish frontend (changing malloc storage from https://gerrit.wikimedia.org/r/c/operations/puppet/+/952207/ )	[production]
13:05	<jnuche@deploy1002>	Finished deploy [releng/jenkins-deploy@c579111] (releasing): (no justification provided) (duration: 01m 27s)	[production]
13:05	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1132 (T344589)', diff saved to https://phabricator.wikimedia.org/P51323 and previous config saved to /var/cache/conftool/dbconfig/20230824-130519-ladsgroup.json	[production]
13:05	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1132.eqiad.wmnet with reason: Maintenance	[production]
13:05	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1132.eqiad.wmnet with reason: Maintenance	[production]
13:04	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1128 (T344589)', diff saved to https://phabricator.wikimedia.org/P51322 and previous config saved to /var/cache/conftool/dbconfig/20230824-130455-ladsgroup.json	[production]
13:04	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2130 (T344589)', diff saved to https://phabricator.wikimedia.org/P51321 and previous config saved to /var/cache/conftool/dbconfig/20230824-130446-ladsgroup.json	[production]
13:04	<fabfur>	puppet and pybal reenabled on lvs1018 (T344587)	[production]
13:04	<jnuche@deploy1002>	Started deploy [releng/jenkins-deploy@c579111] (releasing): (no justification provided)	[production]
13:04	<jiji@cumin1001>	conftool action : set/pooled=false; selector: dnsdisc=kartotherian,name=codfw	[production]
13:03	<marostegui>	failover m1-master to dbproxy1022	[production]
13:02	<jiji@cumin1001>	conftool action : set/pooled=true; selector: dnsdisc=kartotherian,name=eqiad	[production]
12:59	<sukhe>	running homer "asw1-b27-esams" commit "add doh300[34]"	[production]
12:58	<jayme@deploy1002>	helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
12:58	<jayme@deploy1002>	helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
12:57	<fabfur@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs1018.eqiad.wmnet	[production]
12:56	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db2130 (T344589)', diff saved to https://phabricator.wikimedia.org/P51320 and previous config saved to /var/cache/conftool/dbconfig/20230824-125607-ladsgroup.json	[production]
12:56	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2130.codfw.wmnet with reason: Maintenance	[production]
12:55	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2130.codfw.wmnet with reason: Maintenance	[production]
12:55	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2116 (T344589)', diff saved to https://phabricator.wikimedia.org/P51319 and previous config saved to /var/cache/conftool/dbconfig/20230824-125542-ladsgroup.json	[production]
12:54	<fabfur@cumin1001>	START - Cookbook sre.hosts.reboot-single for host lvs1018.eqiad.wmnet	[production]
12:49	<jiji@cumin1001>	conftool action : set/pooled=false; selector: dnsdisc=kartotherian,name=eqiad	[production]
12:49	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1128', diff saved to https://phabricator.wikimedia.org/P51318 and previous config saved to /var/cache/conftool/dbconfig/20230824-124942-ladsgroup.json	[production]
12:48	<effie>	depool kartotherian in eqiad	[production]
12:48	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db2168:3317 (T343718)', diff saved to https://phabricator.wikimedia.org/P51317 and previous config saved to /var/cache/conftool/dbconfig/20230824-124758-ladsgroup.json	[production]
12:47	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2168.codfw.wmnet with reason: Maintenance	[production]
12:47	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2168.codfw.wmnet with reason: Maintenance	[production]
12:47	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2159 (T343718)', diff saved to https://phabricator.wikimedia.org/P51316 and previous config saved to /var/cache/conftool/dbconfig/20230824-124737-ladsgroup.json	[production]
12:45	<btullis@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-coord1001.eqiad.wmnet	[production]
12:40	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2116', diff saved to https://phabricator.wikimedia.org/P51315 and previous config saved to /var/cache/conftool/dbconfig/20230824-124036-ladsgroup.json	[production]
12:39	<btullis@cumin1001>	START - Cookbook sre.hosts.reboot-single for host an-coord1001.eqiad.wmnet	[production]
12:35	<cgoubert@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply	[production]
12:34	<cgoubert@deploy1002>	helmfile [eqiad] START helmfile.d/services/mw-debug: apply	[production]
12:34	<cgoubert@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mw-debug: apply	[production]
12:34	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1128', diff saved to https://phabricator.wikimedia.org/P51314 and previous config saved to /var/cache/conftool/dbconfig/20230824-123436-ladsgroup.json	[production]
12:34	<cgoubert@deploy1002>	helmfile [codfw] START helmfile.d/services/mw-debug: apply	[production]
12:32	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2159', diff saved to https://phabricator.wikimedia.org/P51313 and previous config saved to /var/cache/conftool/dbconfig/20230824-123231-ladsgroup.json	[production]
12:25	<fabfur>	errata corrige: not lvs1020 but lvs1018	[production]
12:25	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2116', diff saved to https://phabricator.wikimedia.org/P51312 and previous config saved to /var/cache/conftool/dbconfig/20230824-122530-ladsgroup.json	[production]
12:25	<fabfur>	disabling puppet and pybal on lvs1020 for reboot (T344587)	[production]