production SAL

501-550 of 10000 results (85ms)

2024-06-20 §
12:11	<taavi@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1048.eqiad.wmnet with reason: host reimage	[production]
12:08	<taavi@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1047.eqiad.wmnet with reason: host reimage	[production]
12:06	<taavi@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1048.eqiad.wmnet with reason: host reimage	[production]
12:04	<taavi@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1047.eqiad.wmnet with reason: host reimage	[production]
11:52	<cgoubert@cumin1002>	conftool action : set/pooled=inactive; selector: name=mw2282.codfw.wmnet,cluster=kubernetes,service=kubesvc	[production]
11:48	<ayounsi@cumin1002>	END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox	[production]
11:48	<taavi@cumin1002>	START - Cookbook sre.hosts.reimage for host cloudvirt1048.eqiad.wmnet with OS bookworm	[production]
11:47	<taavi@cumin1002>	START - Cookbook sre.hosts.reimage for host cloudvirt1047.eqiad.wmnet with OS bookworm	[production]
11:41	<ayounsi@cumin1002>	START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox	[production]
11:38	<XioNoX>	merge netbox-extra CR1038869 - Fix lots of CI errors	[production]
11:33	<jgiannelos@deploy1002>	Finished deploy [restbase/deploy@f867c66]: (no justification provided) (duration: 30m 12s)	[production]
11:27	<akosiaris@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/mathoid: apply	[production]
11:26	<akosiaris@deploy1002>	helmfile [eqiad] START helmfile.d/services/mathoid: apply	[production]
11:25	<akosiaris@deploy1002>	helmfile [codfw] DONE helmfile.d/services/mathoid: apply	[production]
11:25	<akosiaris@deploy1002>	helmfile [codfw] START helmfile.d/services/mathoid: apply	[production]
11:21	<akosiaris>	upgrade mathoid to 2024-06-18-233457-production T349118	[production]
11:20	<akosiaris@deploy1002>	helmfile [staging] DONE helmfile.d/services/mathoid: sync	[production]
11:20	<akosiaris@deploy1002>	helmfile [staging] START helmfile.d/services/mathoid: sync	[production]
11:03	<jgiannelos@deploy1002>	Started deploy [restbase/deploy@f867c66]: (no justification provided)	[production]
10:57	<dreamyjazz@deploy1002>	Finished scap: Backport for [[gerrit:1047942\|[testwiki] Fix assignment of 'checkuser-temporary-account' right (T367170)]] (duration: 15m 03s)	[production]
10:48	<dreamyjazz@deploy1002>	dreamyjazz: Continuing with sync	[production]
10:44	<dreamyjazz@deploy1002>	dreamyjazz: Backport for [[gerrit:1047942\|[testwiki] Fix assignment of 'checkuser-temporary-account' right (T367170)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
10:42	<dreamyjazz@deploy1002>	Started scap: Backport for [[gerrit:1047942\|[testwiki] Fix assignment of 'checkuser-temporary-account' right (T367170)]]	[production]
10:41	<Amir1>	running extensions/Echo/maintenance/removeOrphanedEvents.php --force on all wikis (T308084)	[production]
10:37	<dreamyjazz@deploy1002>	Finished scap: Backport for [[gerrit:1047931\|[testwiki] Assign 'checkuser-temporary-account' to the sysop group (T367170)]] (duration: 13m 49s)	[production]
10:33	<taavi@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1046.eqiad.wmnet with OS bookworm	[production]
10:33	<taavi@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1045.eqiad.wmnet with OS bookworm	[production]
10:31	<cgoubert@cumin1002>	conftool action : set/pooled=yes; selector: name=mw2321.codfw.wmnet,cluster=kubernetes,service=kubesvc	[production]
10:31	<claime>	repooling and uncordoning mw2321.codfw.wmnet - T367862	[production]
10:31	<kamila@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-ctrl2002.codfw.wmnet with OS bullseye	[production]
10:30	<cgoubert@cumin1002>	END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw2321.codfw.wmnet	[production]
10:30	<cgoubert@cumin1002>	START - Cookbook sre.hosts.remove-downtime for mw2321.codfw.wmnet	[production]
10:28	<dreamyjazz@deploy1002>	dreamyjazz: Continuing with sync	[production]
10:25	<dreamyjazz@deploy1002>	dreamyjazz: Backport for [[gerrit:1047931\|[testwiki] Assign 'checkuser-temporary-account' to the sysop group (T367170)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
10:24	<kamila@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-ctrl2002.codfw.wmnet with OS bullseye	[production]
10:23	<kamila@cumin1002>	START - Cookbook sre.hosts.reimage for host wikikube-ctrl2002.codfw.wmnet with OS bullseye	[production]
10:23	<dreamyjazz@deploy1002>	Started scap: Backport for [[gerrit:1047931\|[testwiki] Assign 'checkuser-temporary-account' to the sysop group (T367170)]]	[production]
10:20	<cgoubert@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on mw2321.codfw.wmnet with reason: Test scap with host unavailable	[production]
10:20	<jiji@deploy1002>	helmfile [eqiad] DONE helmfile.d/admin 'apply'.	[production]
10:20	<cgoubert@cumin1002>	START - Cookbook sre.hosts.downtime for 1:00:00 on mw2321.codfw.wmnet with reason: Test scap with host unavailable	[production]
10:19	<jiji@deploy1002>	helmfile [eqiad] START helmfile.d/admin 'apply'.	[production]
10:18	<jayme@deploy1002>	helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'.	[production]
10:18	<jiji@deploy1002>	helmfile [codfw] DONE helmfile.d/admin 'apply'.	[production]
10:17	<jayme@deploy1002>	helmfile [staging-eqiad] START helmfile.d/admin 'apply'.	[production]
10:16	<jiji@deploy1002>	helmfile [codfw] START helmfile.d/admin 'apply'.	[production]
10:16	<jayme@deploy1002>	helmfile [staging-codfw] DONE helmfile.d/admin 'apply'.	[production]
10:15	<cgoubert@cumin1002>	conftool action : set/pooled=inactive; selector: name=mw2321.codfw.wmnet,cluster=kubernetes,service=kubesvc	[production]
10:14	<claime>	Draining and depooling mw2321.codfw.wmnet to test 1047031 - T367862	[production]
10:14	<jayme@deploy1002>	helmfile [staging-codfw] START helmfile.d/admin 'apply'.	[production]
10:07	<taavi@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1046.eqiad.wmnet with reason: host reimage	[production]