4201-4250 of 10000 results (106ms)
2023-05-02 ยง
12:31 <eevans@cumin1001> START - Cookbook sre.discovery.service-route check 2 services: maintenance [production]
12:30 <cmooney@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
12:29 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
12:28 <elukey@deploy1002> helmfile [ml-staging-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
12:24 <moritzm> installing LInux 5.10.178 on bullseye hosts [production]
12:20 <sukhe> run authdns-update to depool codfwL T334049 [production]
12:17 <Amir1> stop slave on eqiad masters of s1, x1, s8 (T334049) [production]
12:10 <jmm@puppetmaster1001> conftool action : set/pooled=no; selector: name=ldap-replica2005.wikimedia.org [production]
12:05 <Amir1> stop slave again on db1130 (eqiad master of s5) (T334049) [production]
12:03 <jclark@cumin1001> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host backup1011 [production]
12:03 <jclark@cumin1001> START - Cookbook sre.network.configure-switch-interfaces for host backup1011 [production]
11:57 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host poolcounter2003.codfw.wmnet [production]
11:53 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host poolcounter2003.codfw.wmnet [production]
11:52 <jclark@cumin1001> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host backup1011 [production]
11:52 <jclark@cumin1001> START - Cookbook sre.network.configure-switch-interfaces for host backup1011 [production]
11:52 <jclark@cumin1001> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host backup1010 [production]
11:52 <jclark@cumin1001> START - Cookbook sre.network.configure-switch-interfaces for host backup1010 [production]
11:52 <jclark@cumin1001> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host puppetmaster1006 [production]
11:51 <Amir1> stop slave on db1130 (eqiad master of s5) (T334049) [production]
11:51 <elukey@deploy1002> helmfile [ml-staging-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
11:50 <jclark@cumin1001> START - Cookbook sre.network.configure-switch-interfaces for host puppetmaster1006 [production]
11:50 <jclark@cumin1001> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host sretest1003 [production]
11:50 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host poolcounter2004.codfw.wmnet [production]
11:49 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on 41 hosts with reason: Row c switch maint T334049 [production]
11:49 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on 41 hosts with reason: Row c switch maint T334049 [production]
11:49 <jclark@cumin1001> START - Cookbook sre.network.configure-switch-interfaces for host sretest1003 [production]
11:46 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host poolcounter2004.codfw.wmnet [production]
11:32 <jclark@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
11:30 <jclark@cumin1001> START - Cookbook sre.dns.netbox [production]
10:52 <akosiaris@cumin1001> END (PASS) - Cookbook sre.discovery.datacenter (exit_code=0) depool all active/active services in codfw: codfw row C switches upgrade - T334049 [production]
10:47 <ladsgroup@deploy1002> Finished scap: Backport for [[gerrit:914280|Set externallinks migration to read new in testwiki (T335343)]] (duration: 13m 27s) [production]
10:35 <ladsgroup@deploy1002> ladsgroup: Backport for [[gerrit:914280|Set externallinks migration to read new in testwiki (T335343)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet [production]
10:33 <ladsgroup@deploy1002> Started scap: Backport for [[gerrit:914280|Set externallinks migration to read new in testwiki (T335343)]] [production]
10:00 <ladsgroup@deploy1002> Finished scap: Backport for [[gerrit:912837|Remove 1024px and 1920px from pre-gen thumbsizes (T211661)]] (duration: 08m 40s) [production]
09:59 <akosiaris@cumin1001> START - Cookbook sre.discovery.datacenter depool all active/active services in codfw: codfw row C switches upgrade - T334049 [production]
09:53 <ladsgroup@deploy1002> ladsgroup: Backport for [[gerrit:912837|Remove 1024px and 1920px from pre-gen thumbsizes (T211661)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet [production]
09:51 <ladsgroup@deploy1002> Started scap: Backport for [[gerrit:912837|Remove 1024px and 1920px from pre-gen thumbsizes (T211661)]] [production]
09:21 <eoghan@cumin1001> END (ERROR) - Cookbook sre.gitlab.failover (exit_code=97) Failover of gitlab from gitlab2002.wikimedia.org to gitlab1004.wikimedia.org [production]
09:13 <ladsgroup@deploy1002> scap failed: CalledProcessError Command 'sudo -u mwbuilder /usr/local/bin/update-mediawiki-tools-release' returned non-zero exit status 1. (duration: 00m 05s) [production]
09:12 <ladsgroup@deploy1002> Started scap: Backport for [[gerrit:912837|Remove 1024px and 1920px from pre-gen thumbsizes (T211661)]] [production]
09:10 <eoghan@cumin1001> START - Cookbook sre.gitlab.failover Failover of gitlab from gitlab2002.wikimedia.org to gitlab1004.wikimedia.org [production]
08:51 <elukey@deploy1002> helmfile [ml-staging-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
08:44 <vgutierrez> testing haproxy 2.6.12-1~bpo10+1+wmf1 in cp1077 and cp1085 - T334448 [production]
08:40 <elukey@deploy1002> helmfile [ml-staging-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
08:28 <moritzm> updated netboot image for Bullseye 11.7 T335575 [production]
08:27 <XioNoX> stage Junos 21 on asw-c-codfw - T334049 [production]
08:07 <godog> upgrade grafana to 9.3.13 [production]
07:49 <tgr_> UTC morning deploys done [production]
07:48 <tgr@deploy1002> Finished scap: Backport for [[gerrit:910815|OAuth: Do not require approval for read-only grants on public wikis (T67750)]] (duration: 07m 39s) [production]
07:42 <tgr@deploy1002> tgr: Backport for [[gerrit:910815|OAuth: Do not require approval for read-only grants on public wikis (T67750)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet [production]