production SAL

4001-4050 of 10000 results (130ms)

2024-11-05 §
17:41	<cdanis@deploy2002>	helmfile [eqiad] START helmfile.d/services/chart-renderer: apply	[production]
17:39	<cdanis@deploy2002>	helmfile [staging] DONE helmfile.d/services/chart-renderer: apply	[production]
17:39	<cdanis@deploy2002>	helmfile [staging] START helmfile.d/services/chart-renderer: apply	[production]
17:36	<akosiaris@deploy2002>	helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply	[production]
17:36	<akosiaris@deploy2002>	helmfile [codfw] START helmfile.d/services/rest-gateway: apply	[production]
17:34	<akosiaris@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply	[production]
17:34	<akosiaris@deploy2002>	helmfile [eqiad] START helmfile.d/services/rest-gateway: apply	[production]
17:33	<akosiaris@deploy2002>	helmfile [staging] DONE helmfile.d/services/rest-gateway: apply	[production]
17:33	<akosiaris@deploy2002>	helmfile [staging] START helmfile.d/services/rest-gateway: apply	[production]
17:32	<cdanis@deploy2002>	helmfile [staging] DONE helmfile.d/services/chart-renderer: apply	[production]
17:32	<cdanis@deploy2002>	helmfile [staging] START helmfile.d/services/chart-renderer: apply	[production]
17:28	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance es1028', diff saved to https://phabricator.wikimedia.org/P70945 and previous config saved to /var/cache/conftool/dbconfig/20241105-172837-ladsgroup.json	[production]
17:13	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance es1028 (T376905)', diff saved to https://phabricator.wikimedia.org/P70943 and previous config saved to /var/cache/conftool/dbconfig/20241105-171330-ladsgroup.json	[production]
17:06	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Depooling es1028 (T376905)', diff saved to https://phabricator.wikimedia.org/P70942 and previous config saved to /var/cache/conftool/dbconfig/20241105-170636-ladsgroup.json	[production]
17:06	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es1028.eqiad.wmnet with reason: Maintenance	[production]
17:06	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on es1028.eqiad.wmnet with reason: Maintenance	[production]
17:06	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance es1031 (T376905)', diff saved to https://phabricator.wikimedia.org/P70941 and previous config saved to /var/cache/conftool/dbconfig/20241105-170609-ladsgroup.json	[production]
16:51	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance es1031', diff saved to https://phabricator.wikimedia.org/P70940 and previous config saved to /var/cache/conftool/dbconfig/20241105-165103-ladsgroup.json	[production]
16:37	<lucaswerkmeister-wmde@deploy2002>	Finished scap sync-world: Backport for [[gerrit:1087507\|Fixup paths to moved resources (T379080)]] (duration: 08m 02s)	[production]
16:35	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance es1031', diff saved to https://phabricator.wikimedia.org/P70939 and previous config saved to /var/cache/conftool/dbconfig/20241105-163556-ladsgroup.json	[production]
16:34	<cdanis@cumin1002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
16:32	<lucaswerkmeister-wmde@deploy2002>	lucaswerkmeister-wmde: Continuing with sync	[production]
16:32	<lucaswerkmeister-wmde@deploy2002>	lucaswerkmeister-wmde: Backport for [[gerrit:1087507\|Fixup paths to moved resources (T379080)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
16:32	<cdanis@cumin1002>	START - Cookbook sre.dns.netbox	[production]
16:29	<lucaswerkmeister-wmde@deploy2002>	Started scap sync-world: Backport for [[gerrit:1087507\|Fixup paths to moved resources (T379080)]]	[production]
16:20	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance es1031 (T376905)', diff saved to https://phabricator.wikimedia.org/P70938 and previous config saved to /var/cache/conftool/dbconfig/20241105-162048-ladsgroup.json	[production]
16:14	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Depooling es1031 (T376905)', diff saved to https://phabricator.wikimedia.org/P70937 and previous config saved to /var/cache/conftool/dbconfig/20241105-161455-ladsgroup.json	[production]
16:14	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es1031.eqiad.wmnet with reason: Maintenance	[production]
16:14	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on es1031.eqiad.wmnet with reason: Maintenance	[production]
16:13	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance es1033 (T376905)', diff saved to https://phabricator.wikimedia.org/P70936 and previous config saved to /var/cache/conftool/dbconfig/20241105-161340-ladsgroup.json	[production]
16:01	<arnaudb@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host pc1017.eqiad.wmnet with OS bookworm	[production]
16:00	<jmm@cumin2002>	END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1014.eqiad.wmnet	[production]
15:58	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance es1033', diff saved to https://phabricator.wikimedia.org/P70935 and previous config saved to /var/cache/conftool/dbconfig/20241105-155833-ladsgroup.json	[production]
15:54	<jmm@cumin2002>	START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1014.eqiad.wmnet	[production]
15:54	<jmm@cumin2002>	END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti1014.eqiad.wmnet	[production]
15:54	<jmm@cumin2002>	START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1014.eqiad.wmnet	[production]
15:53	<jmm@cumin2002>	END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti1042.eqiad.wmnet to cluster eqiad and group B	[production]
15:51	<jmm@cumin2002>	START - Cookbook sre.ganeti.addnode for new host ganeti1042.eqiad.wmnet to cluster eqiad and group B	[production]
15:51	<jmm@cumin2002>	END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti1041.eqiad.wmnet to cluster eqiad and group B	[production]
15:50	<jmm@cumin2002>	START - Cookbook sre.ganeti.addnode for new host ganeti1041.eqiad.wmnet to cluster eqiad and group B	[production]
15:48	<moritzm>	remove ganeti1013 from active ganeti nodes T378921	[production]
15:47	<jmm@cumin2002>	END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1013.eqiad.wmnet	[production]
15:43	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance es1033', diff saved to https://phabricator.wikimedia.org/P70934 and previous config saved to /var/cache/conftool/dbconfig/20241105-154326-ladsgroup.json	[production]
15:40	<arnaudb@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on pc1017.eqiad.wmnet with reason: host reimage	[production]
15:37	<arnaudb@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on pc1017.eqiad.wmnet with reason: host reimage	[production]
15:32	<hashar>	Switched PCC workers to Java 17 via https://horizon.wikimedia.org/project/prefixpuppet/?tab=prefix_puppet__puppet-pcc-worker # T359795	[production]
15:28	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance es1033 (T376905)', diff saved to https://phabricator.wikimedia.org/P70933 and previous config saved to /var/cache/conftool/dbconfig/20241105-152819-ladsgroup.json	[production]
15:27	<hashar>	Switched deployment-deploy04.deployment-prep.eqiad1.wikimedia.cloud to Java 17 # T359795	[production]
15:21	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Depooling es1033 (T376905)', diff saved to https://phabricator.wikimedia.org/P70932 and previous config saved to /var/cache/conftool/dbconfig/20241105-152139-ladsgroup.json	[production]
15:21	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on es1033.eqiad.wmnet with reason: Maintenance	[production]