1801-1850 of 10000 results (87ms)
2023-06-20 §
09:23 <ariel@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dumpsdata1003.eqiad.wmnet with OS bullseye [production]
09:20 <isaranto@deploy1002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
09:02 <jmm@cumin2002> END (PASS) - Cookbook sre.elasticsearch.restart-nginx (exit_code=0) rolling restart_daemons on A:elastic-eqiad [production]
08:39 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1119.eqiad.wmnet with OS bookworm [production]
08:37 <ariel@cumin1001> START - Cookbook sre.hosts.reimage for host dumpsdata1003.eqiad.wmnet with OS bullseye [production]
08:37 <jmm@cumin2002> START - Cookbook sre.elasticsearch.restart-nginx rolling restart_daemons on A:elastic-eqiad [production]
08:06 <ariel@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host dumpsdata1003.eqiad.wmnet with OS bullseye [production]
07:40 <jmm@cumin2002> END (PASS) - Cookbook sre.elasticsearch.restart-nginx (exit_code=0) rolling restart_daemons on A:elastic-codfw [production]
07:29 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1119.eqiad.wmnet with reason: host reimage [production]
07:26 <marostegui@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on db1119.eqiad.wmnet with reason: host reimage [production]
07:20 <ariel@cumin1001> START - Cookbook sre.hosts.reimage for host dumpsdata1003.eqiad.wmnet with OS bullseye [production]
07:18 <jmm@cumin2002> START - Cookbook sre.elasticsearch.restart-nginx rolling restart_daemons on A:elastic-codfw [production]
07:18 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host db1119.eqiad.wmnet with OS bookworm [production]
07:14 <kartik@deploy1002> Finished scap: Backport for [[gerrit:931260|Enable Content and Section Translation for a 3rd group of 10 languages previously lacking MT (T337834)]] (duration: 10m 25s) [production]
07:07 <moritzm> installing openssl securit updates on buster [production]
07:05 <kartik@deploy1002> kartik: Backport for [[gerrit:931260|Enable Content and Section Translation for a 3rd group of 10 languages previously lacking MT (T337834)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet [production]
07:04 <kartik@deploy1002> Started scap: Backport for [[gerrit:931260|Enable Content and Section Translation for a 3rd group of 10 languages previously lacking MT (T337834)]] [production]
06:34 <marostegui@cumin1001> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host db1119.eqiad.wmnet with OS bookworm [production]
06:29 <marostegui@cumin1001> START - Cookbook sre.hosts.reimage for host db1119.eqiad.wmnet with OS bookworm [production]
05:33 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'email' for AS: 14860 [production]
05:33 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'email' for AS: 14860 [production]
00:14 <zabe> Deployed patch for T330968 [production]
2023-06-19 §
16:41 <ladsgroup@deploy1002> Finished scap: Backport for [[gerrit:931078|Revert "Temporarily bring back legacy encoding in four wikis"]] (duration: 15m 19s) [production]
16:27 <ladsgroup@deploy1002> ladsgroup: Backport for [[gerrit:931078|Revert "Temporarily bring back legacy encoding in four wikis"]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet [production]
16:26 <ladsgroup@deploy1002> Started scap: Backport for [[gerrit:931078|Revert "Temporarily bring back legacy encoding in four wikis"]] [production]
16:22 <aikochou@deploy1002> helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
16:16 <aikochou@deploy1002> helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revertrisk' for release 'main' . [production]
16:09 <aikochou@deploy1002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
15:50 <elukey@cumin1001> END (ERROR) - Cookbook sre.cassandra.roll-restart (exit_code=97) for nodes matching A:ml-cache-codfw: Applying internode-encryption: all - elukey@cumin1001 [production]
15:47 <elukey@cumin1001> START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-codfw: Applying internode-encryption: all - elukey@cumin1001 [production]
15:22 <brett> Rolling reboot of codfw cache_text nodes to apply Linux update for CVE-2023-1872 - T335835 [production]
15:07 <moritzm> installing libxpm security updates [production]
15:06 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host krb1001.eqiad.wmnet [production]
15:00 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host krb1001.eqiad.wmnet [production]
14:48 <hnowlan@deploy1002> helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply [production]
14:47 <hnowlan@deploy1002> helmfile [codfw] START helmfile.d/services/rest-gateway: apply [production]
14:47 <hnowlan@deploy1002> helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply [production]
14:47 <hnowlan@deploy1002> helmfile [eqiad] START helmfile.d/services/rest-gateway: apply [production]
14:46 <hnowlan@deploy1002> helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply [production]
14:46 <hnowlan@deploy1002> helmfile [eqiad] START helmfile.d/services/rest-gateway: apply [production]
14:45 <hnowlan@deploy1002> helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply [production]
14:45 <hnowlan@deploy1002> helmfile [eqiad] START helmfile.d/services/rest-gateway: apply [production]
14:39 <ladsgroup@deploy1002> Finished scap: Backport for [[gerrit:931073|file: Make pre-gen rendering of multi-page files (pdf, ...) serial (T337649)]] (duration: 20m 07s) [production]
14:27 <hnowlan@deploy1002> helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply [production]
14:27 <hnowlan@deploy1002> helmfile [eqiad] START helmfile.d/services/rest-gateway: apply [production]
14:26 <hnowlan@deploy1002> helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply [production]
14:26 <hnowlan@deploy1002> helmfile [codfw] START helmfile.d/services/rest-gateway: apply [production]
14:24 <hnowlan@deploy1002> helmfile [codfw] DONE helmfile.d/services/rest-gateway: apply [production]
14:23 <hnowlan@deploy1002> helmfile [codfw] START helmfile.d/services/rest-gateway: apply [production]
14:20 <ladsgroup@deploy1002> ladsgroup: Backport for [[gerrit:931073|file: Make pre-gen rendering of multi-page files (pdf, ...) serial (T337649)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet [production]