151-200 of 10000 results (71ms)
2023-06-20 §
12:46 <aokoth@cumin1001> START - Cookbook sre.hosts.decommission for hosts otrs1001.eqiad.wmnet [production]
12:37 <vgutierrez> depooling cp3050 - T339898 [production]
12:32 <klausman@deploy1002> helmfile [eqiad] DONE helmfile.d/services/changeprop: apply [production]
12:32 <klausman@deploy1002> helmfile [eqiad] START helmfile.d/services/changeprop: apply [production]
12:26 <klausman@deploy1002> helmfile [codfw] DONE helmfile.d/services/changeprop: apply [production]
12:25 <klausman@deploy1002> helmfile [codfw] START helmfile.d/services/changeprop: apply [production]
11:27 <jnuche@deploy1002> deploy aborted: (no justification provided) (duration: 01m 32s) [production]
11:26 <jnuche@deploy1002> Started deploy [releng/jenkins-deploy@0c82f2d] (releasing): (no justification provided) [production]
11:15 <hnowlan@deploy1002> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
11:14 <hnowlan@deploy1002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply [production]
11:13 <hnowlan@deploy1002> helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
11:13 <hnowlan@deploy1002> helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: apply [production]
11:10 <hnowlan@deploy1002> helmfile [staging] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
11:10 <hnowlan@deploy1002> helmfile [staging] START helmfile.d/services/changeprop-jobqueue: apply [production]
10:57 <isaranto@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
10:30 <ladsgroup@deploy1002> Finished scap: Backport for [[gerrit:931306|Stop setting wgLegacyEncdoing (T128150 T128151)]] (duration: 08m 06s) [production]
10:23 <ladsgroup@deploy1002> ladsgroup: Backport for [[gerrit:931306|Stop setting wgLegacyEncdoing (T128150 T128151)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet [production]
10:22 <ladsgroup@deploy1002> Started scap: Backport for [[gerrit:931306|Stop setting wgLegacyEncdoing (T128150 T128151)]] [production]
10:16 <Lucas_WMDE> deployed patches for T339111 [production]
09:35 <isaranto@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
09:23 <ariel@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dumpsdata1003.eqiad.wmnet with OS bullseye [production]
09:20 <isaranto@deploy1002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
09:02 <jmm@cumin2002> END (PASS) - Cookbook sre.elasticsearch.restart-nginx (exit_code=0) rolling restart_daemons on A:elastic-eqiad [production]
08:39 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1119.eqiad.wmnet with OS bookworm [production]
08:37 <ariel@cumin1001> START - Cookbook sre.hosts.reimage for host dumpsdata1003.eqiad.wmnet with OS bullseye [production]
08:37 <jmm@cumin2002> START - Cookbook sre.elasticsearch.restart-nginx rolling restart_daemons on A:elastic-eqiad [production]
08:06 <ariel@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dumpsdata1003.eqiad.wmnet with OS bullseye [production]
07:40 <jmm@cumin2002> END (PASS) - Cookbook sre.elasticsearch.restart-nginx (exit_code=0) rolling restart_daemons on A:elastic-codfw [production]
07:29 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1119.eqiad.wmnet with reason: host reimage [production]
07:26 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1119.eqiad.wmnet with reason: host reimage [production]
07:20 <ariel@cumin1001> START - Cookbook sre.hosts.reimage for host dumpsdata1003.eqiad.wmnet with OS bullseye [production]
07:18 <jmm@cumin2002> START - Cookbook sre.elasticsearch.restart-nginx rolling restart_daemons on A:elastic-codfw [production]
07:18 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host db1119.eqiad.wmnet with OS bookworm [production]
07:14 <kartik@deploy1002> Finished scap: Backport for [[gerrit:931260|Enable Content and Section Translation for a 3rd group of 10 languages previously lacking MT (T337834)]] (duration: 10m 25s) [production]
07:07 <moritzm> installing openssl securit updates on buster [production]
07:05 <kartik@deploy1002> kartik: Backport for [[gerrit:931260|Enable Content and Section Translation for a 3rd group of 10 languages previously lacking MT (T337834)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet [production]
07:04 <kartik@deploy1002> Started scap: Backport for [[gerrit:931260|Enable Content and Section Translation for a 3rd group of 10 languages previously lacking MT (T337834)]] [production]
06:34 <marostegui@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host db1119.eqiad.wmnet with OS bookworm [production]
06:29 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host db1119.eqiad.wmnet with OS bookworm [production]
05:33 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 14860 [production]
05:33 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'email' for AS: 14860 [production]
00:14 <zabe> Deployed patch for T330968 [production]
2023-06-19 §
16:41 <ladsgroup@deploy1002> Finished scap: Backport for [[gerrit:931078|Revert "Temporarily bring back legacy encoding in four wikis"]] (duration: 15m 19s) [production]
16:27 <ladsgroup@deploy1002> ladsgroup: Backport for [[gerrit:931078|Revert "Temporarily bring back legacy encoding in four wikis"]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet [production]
16:26 <ladsgroup@deploy1002> Started scap: Backport for [[gerrit:931078|Revert "Temporarily bring back legacy encoding in four wikis"]] [production]
16:22 <aikochou@deploy1002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
16:16 <aikochou@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
16:09 <aikochou@deploy1002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
15:50 <elukey@cumin1001> END (ERROR) - Cookbook sre.cassandra.roll-restart (exit_code=97) for nodes matching A:ml-cache-codfw: Applying internode-encryption: all - elukey@cumin1001 [production]
15:47 <elukey@cumin1001> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-codfw: Applying internode-encryption: all - elukey@cumin1001 [production]