production SAL

1151-1200 of 10000 results (75ms)

2024-06-20 §
10:31	<kamila@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-ctrl2002.codfw.wmnet with OS bullseye	[production]
10:30	<cgoubert@cumin1002>	END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw2321.codfw.wmnet	[production]
10:30	<cgoubert@cumin1002>	START - Cookbook sre.hosts.remove-downtime for mw2321.codfw.wmnet	[production]
10:28	<dreamyjazz@deploy1002>	dreamyjazz: Continuing with sync	[production]
10:25	<dreamyjazz@deploy1002>	dreamyjazz: Backport for [[gerrit:1047931\|[testwiki] Assign 'checkuser-temporary-account' to the sysop group (T367170)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
10:24	<kamila@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-ctrl2002.codfw.wmnet with OS bullseye	[production]
10:23	<kamila@cumin1002>	START - Cookbook sre.hosts.reimage for host wikikube-ctrl2002.codfw.wmnet with OS bullseye	[production]
10:23	<dreamyjazz@deploy1002>	Started scap: Backport for [[gerrit:1047931\|[testwiki] Assign 'checkuser-temporary-account' to the sysop group (T367170)]]	[production]
10:20	<cgoubert@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on mw2321.codfw.wmnet with reason: Test scap with host unavailable	[production]
10:20	<jiji@deploy1002>	helmfile [eqiad] DONE helmfile.d/admin 'apply'.	[production]
10:20	<cgoubert@cumin1002>	START - Cookbook sre.hosts.downtime for 1:00:00 on mw2321.codfw.wmnet with reason: Test scap with host unavailable	[production]
10:19	<jiji@deploy1002>	helmfile [eqiad] START helmfile.d/admin 'apply'.	[production]
10:18	<jayme@deploy1002>	helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'.	[production]
10:18	<jiji@deploy1002>	helmfile [codfw] DONE helmfile.d/admin 'apply'.	[production]
10:17	<jayme@deploy1002>	helmfile [staging-eqiad] START helmfile.d/admin 'apply'.	[production]
10:16	<jiji@deploy1002>	helmfile [codfw] START helmfile.d/admin 'apply'.	[production]
10:16	<jayme@deploy1002>	helmfile [staging-codfw] DONE helmfile.d/admin 'apply'.	[production]
10:15	<cgoubert@cumin1002>	conftool action : set/pooled=inactive; selector: name=mw2321.codfw.wmnet,cluster=kubernetes,service=kubesvc	[production]
10:14	<claime>	Draining and depooling mw2321.codfw.wmnet to test 1047031 - T367862	[production]
10:14	<jayme@deploy1002>	helmfile [staging-codfw] START helmfile.d/admin 'apply'.	[production]
10:07	<taavi@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1046.eqiad.wmnet with reason: host reimage	[production]
10:04	<taavi@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1045.eqiad.wmnet with reason: host reimage	[production]
10:04	<claime>	Running puppet on A:wikikube-worker	[production]
10:02	<taavi@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1046.eqiad.wmnet with reason: host reimage	[production]
10:01	<taavi@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1045.eqiad.wmnet with reason: host reimage	[production]
10:00	<hnowlan@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/shellbox-video: apply	[production]
10:00	<hnowlan@deploy1002>	helmfile [eqiad] START helmfile.d/services/shellbox-video: apply	[production]
09:51	<hnowlan@deploy1002>	helmfile [codfw] DONE helmfile.d/services/shellbox-video: apply	[production]
09:51	<hnowlan@deploy1002>	helmfile [codfw] START helmfile.d/services/shellbox-video: apply	[production]
09:50	<hnowlan@deploy1002>	helmfile [staging] DONE helmfile.d/services/shellbox-video: apply	[production]
09:49	<hnowlan@deploy1002>	helmfile [staging] START helmfile.d/services/shellbox-video: apply	[production]
09:47	<jayme@deploy1002>	helmfile [staging-codfw] DONE helmfile.d/admin 'apply'.	[production]
09:45	<taavi@cumin1002>	START - Cookbook sre.hosts.reimage for host cloudvirt1046.eqiad.wmnet with OS bookworm	[production]
09:45	<taavi@cumin1002>	START - Cookbook sre.hosts.reimage for host cloudvirt1045.eqiad.wmnet with OS bookworm	[production]
09:45	<jayme@deploy1002>	helmfile [staging-codfw] START helmfile.d/admin 'apply'.	[production]
09:16	<zabe>	zabe@mwmaint1002:~$ mwscript createAndPromote.php sysop_plwiki AramilFeraxa REDACTED --bureaucrat --sysop # T361041	[production]
08:57	<kamila@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-ctrl2001.codfw.wmnet with OS bullseye	[production]
08:51	<kamila@cumin1002>	START - Cookbook sre.hosts.reimage for host wikikube-ctrl2001.codfw.wmnet with OS bullseye	[production]
08:51	<cmooney@cumin1002>	END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) homer to cumin2002.codfw.wmnet,cumin1002.eqiad.wmnet with reason: Release v0.6.6 - cmooney@cumin1002	[production]
08:50	<kamila@cumin1002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-ctrl2001.codfw.wmnet with OS bullseye	[production]
08:49	<cmooney@cumin1002>	START - Cookbook sre.deploy.python-code homer to cumin2002.codfw.wmnet,cumin1002.eqiad.wmnet with reason: Release v0.6.6 - cmooney@cumin1002	[production]
08:36	<taavi@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1044.eqiad.wmnet with OS bookworm	[production]
08:33	<kamila@cumin1002>	START - Cookbook sre.hosts.reimage for host wikikube-ctrl2001.codfw.wmnet with OS bullseye	[production]
08:23	<kamila@cumin1002>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wikikube-ctrl2001.codfw.wmnet with OS bullseye	[production]
08:16	<jnuche@deploy1002>	rebuilt and synchronized wikiversions files: group2 wikis to 1.43.0-wmf.10 refs T361404	[production]
08:10	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host irc1001.wikimedia.org	[production]
08:08	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host irc1001.wikimedia.org	[production]
08:08	<moritzm>	reboot of irc1001 to nudge clients to re-connect to the new bullseye host T331702	[production]
08:06	<taavi@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1044.eqiad.wmnet with reason: host reimage	[production]
08:03	<taavi@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1044.eqiad.wmnet with reason: host reimage	[production]