production SAL

3851-3900 of 10000 results (91ms)

2023-08-24 §
13:54	<cgoubert@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mw-debug: apply	[production]
13:54	<cgoubert@deploy1002>	helmfile [eqiad] START helmfile.d/services/mw-debug: apply	[production]
13:54	<cgoubert@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mw-debug: apply	[production]
13:54	<cgoubert@deploy1002>	helmfile [codfw] START helmfile.d/services/mw-debug: apply	[production]
13:53	<oblivian@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mw-api-int: apply	[production]
13:53	<oblivian@deploy1002>	helmfile [codfw] START helmfile.d/services/mw-api-int: apply	[production]
13:51	<jayme@deploy1002>	helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
13:50	<jayme@deploy1002>	helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
13:50	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2141.codfw.wmnet with reason: Maintenance	[production]
13:50	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2141.codfw.wmnet with reason: Maintenance	[production]
13:50	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2130 (T344589)', diff saved to https://phabricator.wikimedia.org/P51331 and previous config saved to /var/cache/conftool/dbconfig/20230824-135004-ladsgroup.json	[production]
13:48	<fabfur>	enabled puppet and pybal on lvs1017 (T344587)	[production]
13:47	<fabfur@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs1017.eqiad.wmnet	[production]
13:46	<bblack>	cp3075: restart varnish frontend (changing malloc storage from https://gerrit.wikimedia.org/r/c/operations/puppet/+/952207/ )	[production]
13:43	<oblivian@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mw-web: apply	[production]
13:43	<fabfur@cumin1001>	START - Cookbook sre.hosts.reboot-single for host lvs1017.eqiad.wmnet	[production]
13:43	<jayme@deploy1002>	helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
13:43	<oblivian@deploy1002>	helmfile [eqiad] START helmfile.d/services/mw-web: apply	[production]
13:42	<oblivian@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mw-web: apply	[production]
13:42	<oblivian@deploy1002>	helmfile [codfw] START helmfile.d/services/mw-web: apply	[production]
13:41	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1132', diff saved to https://phabricator.wikimedia.org/P51330 and previous config saved to /var/cache/conftool/dbconfig/20230824-134129-ladsgroup.json	[production]
13:40	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2168:3317', diff saved to https://phabricator.wikimedia.org/P51329 and previous config saved to /var/cache/conftool/dbconfig/20230824-134010-ladsgroup.json	[production]
13:37	<marostegui>	failover m2-master to dbproxy1023	[production]
13:34	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2130', diff saved to https://phabricator.wikimedia.org/P51328 and previous config saved to /var/cache/conftool/dbconfig/20230824-133458-ladsgroup.json	[production]
13:33	<jayme@deploy1002>	helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
13:26	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1132', diff saved to https://phabricator.wikimedia.org/P51327 and previous config saved to /var/cache/conftool/dbconfig/20230824-132623-ladsgroup.json	[production]
13:25	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2168:3317 (T343718)', diff saved to https://phabricator.wikimedia.org/P51326 and previous config saved to /var/cache/conftool/dbconfig/20230824-132504-ladsgroup.json	[production]
13:23	<fabfur>	disabling puppet and pybal on lvs1017 for reboot (T344587)	[production]
13:19	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2130', diff saved to https://phabricator.wikimedia.org/P51325 and previous config saved to /var/cache/conftool/dbconfig/20230824-131952-ladsgroup.json	[production]
13:11	<jnuche@deploy1002>	Finished deploy [releng/jenkins-deploy@c579111] (releasing): (no justification provided) (duration: 00m 21s)	[production]
13:11	<jnuche@deploy1002>	Started deploy [releng/jenkins-deploy@c579111] (releasing): (no justification provided)	[production]
13:11	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1132 (T344589)', diff saved to https://phabricator.wikimedia.org/P51324 and previous config saved to /var/cache/conftool/dbconfig/20230824-131117-ladsgroup.json	[production]
13:08	<bblack>	cp3074: restart varnish frontend (changing malloc storage from https://gerrit.wikimedia.org/r/c/operations/puppet/+/952207/ )	[production]
13:05	<jnuche@deploy1002>	Finished deploy [releng/jenkins-deploy@c579111] (releasing): (no justification provided) (duration: 01m 27s)	[production]
13:05	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1132 (T344589)', diff saved to https://phabricator.wikimedia.org/P51323 and previous config saved to /var/cache/conftool/dbconfig/20230824-130519-ladsgroup.json	[production]
13:05	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1132.eqiad.wmnet with reason: Maintenance	[production]
13:05	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1132.eqiad.wmnet with reason: Maintenance	[production]
13:04	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1128 (T344589)', diff saved to https://phabricator.wikimedia.org/P51322 and previous config saved to /var/cache/conftool/dbconfig/20230824-130455-ladsgroup.json	[production]
13:04	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2130 (T344589)', diff saved to https://phabricator.wikimedia.org/P51321 and previous config saved to /var/cache/conftool/dbconfig/20230824-130446-ladsgroup.json	[production]
13:04	<fabfur>	puppet and pybal reenabled on lvs1018 (T344587)	[production]
13:04	<jnuche@deploy1002>	Started deploy [releng/jenkins-deploy@c579111] (releasing): (no justification provided)	[production]
13:04	<jiji@cumin1001>	conftool action : set/pooled=false; selector: dnsdisc=kartotherian,name=codfw	[production]
13:03	<marostegui>	failover m1-master to dbproxy1022	[production]
13:02	<jiji@cumin1001>	conftool action : set/pooled=true; selector: dnsdisc=kartotherian,name=eqiad	[production]
12:59	<sukhe>	running homer "asw1-b27-esams" commit "add doh300[34]"	[production]
12:58	<jayme@deploy1002>	helmfile [aux-k8s-eqiad] DONE helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
12:58	<jayme@deploy1002>	helmfile [aux-k8s-eqiad] START helmfile.d/aus-k8s-eqiad-services/jaeger: apply	[production]
12:57	<fabfur@cumin1001>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs1018.eqiad.wmnet	[production]
12:56	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db2130 (T344589)', diff saved to https://phabricator.wikimedia.org/P51320 and previous config saved to /var/cache/conftool/dbconfig/20230824-125607-ladsgroup.json	[production]
12:56	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2130.codfw.wmnet with reason: Maintenance	[production]