5001-5050 of 10000 results (92ms)
2023-02-20 ยง
16:28 <nfraison@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-presto1001.eqiad.wmnet with OS bullseye [production]
16:25 <volans@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:05:00 on cumin2002.codfw.wmnet with reason: test spicerack v6.2.1 [production]
16:25 <volans@cumin2002> START - Cookbook sre.hosts.downtime for 0:05:00 on cumin2002.codfw.wmnet with reason: test spicerack v6.2.1 [production]
16:20 <volans> uploaded spicerack_6.2.1 to apt.wikimedia.org bullseye-wikimedia [production]
16:18 <elukey@deploy1002> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
16:09 <nfraison@cumin1001> START - Cookbook sre.hosts.reimage for host an-presto1001.eqiad.wmnet with OS bullseye [production]
16:08 <elukey@deploy1002> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
15:57 <akosiaris@deploy1002> helmfile [staging] DONE helmfile.d/services/mathoid: sync [production]
15:57 <akosiaris@deploy1002> helmfile [staging] START helmfile.d/services/mathoid: sync [production]
15:54 <nfraison@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-presto1001.eqiad.wmnet with OS bullseye [production]
15:53 <elukey@deploy1002> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
15:43 <elukey@deploy1002> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
15:34 <elukey@deploy1002> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
15:26 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ml-staging2001.codfw.wmnet [production]
15:24 <elukey@deploy1002> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
15:18 <elukey@deploy1002> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
15:16 <elukey@cumin1001> START - Cookbook sre.hosts.reboot-single for host ml-staging2001.codfw.wmnet [production]
15:13 <TheresNoTime> closing UTC afternoon backport window [production]
15:13 <samtar@deploy1002> Finished scap: Backport for [[gerrit:890447|PageAssessments.i18n.alias.php: Fix spelling mistake (T328224)]] (duration: 22m 03s) [production]
15:08 <elukey@deploy1002> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
15:04 <samtar@deploy1002> samtar: Backport for [[gerrit:890447|PageAssessments.i18n.alias.php: Fix spelling mistake (T328224)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet [production]
15:03 <TheresNoTime> UTC afternoon backport window overrunning [production]
14:58 <nfraison@cumin1001> START - Cookbook sre.hosts.reimage for host an-presto1001.eqiad.wmnet with OS bullseye [production]
14:51 <nfraison@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-presto1001.eqiad.wmnet with OS bullseye [production]
14:51 <samtar@deploy1002> Started scap: Backport for [[gerrit:890447|PageAssessments.i18n.alias.php: Fix spelling mistake (T328224)]] [production]
14:38 <lucaswerkmeister-wmde@deploy1002> Finished scap: Backport for [[gerrit:885422|Enable WIP Wikibase REST API routes on beta wikidata (T326313)]] (duration: 08m 12s) [production]
14:31 <lucaswerkmeister-wmde@deploy1002> lucaswerkmeister-wmde and ollieshotton: Backport for [[gerrit:885422|Enable WIP Wikibase REST API routes on beta wikidata (T326313)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet [production]
14:30 <lucaswerkmeister-wmde@deploy1002> Started scap: Backport for [[gerrit:885422|Enable WIP Wikibase REST API routes on beta wikidata (T326313)]] [production]
14:25 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc-gp2003.codfw.wmnet with OS bullseye [production]
14:20 <samtar@deploy1002> Finished scap: Backport for [[gerrit:890387|Remove unused $wgLexemeEnableNewAlpha (T307866)]] (duration: 07m 44s) [production]
14:14 <samtar@deploy1002> lucaswerkmeister-wmde and samtar: Backport for [[gerrit:890387|Remove unused $wgLexemeEnableNewAlpha (T307866)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet [production]
14:12 <samtar@deploy1002> Started scap: Backport for [[gerrit:890387|Remove unused $wgLexemeEnableNewAlpha (T307866)]] [production]
14:10 <samtar@deploy1002> Finished scap: Backport for [[gerrit:890187|zhwiki(books|quote): Enable block feature for AbuseFilter (T330026)]] (duration: 09m 00s) [production]
14:08 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc-gp2003.codfw.wmnet with reason: host reimage [production]
14:05 <jiji@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mc-gp2003.codfw.wmnet with reason: host reimage [production]
14:03 <samtar@deploy1002> samtar and stang: Backport for [[gerrit:890187|zhwiki(books|quote): Enable block feature for AbuseFilter (T330026)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet [production]
14:01 <samtar@deploy1002> Started scap: Backport for [[gerrit:890187|zhwiki(books|quote): Enable block feature for AbuseFilter (T330026)]] [production]
13:55 <nfraison@cumin1001> START - Cookbook sre.hosts.reimage for host an-presto1001.eqiad.wmnet with OS bullseye [production]
13:51 <nfraison@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-presto1001.eqiad.wmnet with OS bullseye [production]
13:50 <jiji@cumin1001> START - Cookbook sre.hosts.reimage for host mc-gp2003.codfw.wmnet with OS bullseye [production]
13:12 <nfraison@cumin1001> START - Cookbook sre.hosts.reimage for host an-presto1001.eqiad.wmnet with OS bullseye [production]
13:06 <jbond> switch netbox to active/passive (had issues with active/active config) [production]
12:50 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc-gp2002.codfw.wmnet with OS bullseye [production]
12:49 <jbond> switch netbox to active/active [production]
12:33 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc-gp2002.codfw.wmnet with reason: host reimage [production]
12:30 <jiji@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mc-gp2002.codfw.wmnet with reason: host reimage [production]
12:19 <cgoubert@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on 32 hosts with reason: In setup [production]
12:19 <cgoubert@cumin1001> START - Cookbook sre.hosts.downtime for 14 days, 0:00:00 on 32 hosts with reason: In setup [production]
12:15 <jiji@cumin1001> START - Cookbook sre.hosts.reimage for host mc-gp2002.codfw.wmnet with OS bullseye [production]
12:12 <moritzm> installing Java 8 security updates on Bullseye [production]