4301-4350 of 10000 results (91ms)
2023-05-02 ยง
14:07 <akosiaris@deploy1002> helmfile [staging] START helmfile.d/services/machinetranslation: apply [production]
13:51 <sukhe> run authdns-update to repool codfw [production]
13:47 <cgoubert@cumin1001> conftool action : set/pooled=yes; selector: dc=codfw,cluster=parsoid [production]
13:47 <cgoubert@cumin1001> conftool action : set/pooled=yes; selector: dc=codfw,cluster=appserver [production]
13:47 <cgoubert@cumin1001> conftool action : set/pooled=yes; selector: dc=codfw,cluster=api_appserver [production]
13:45 <akosiaris@deploy1002> helmfile [staging] DONE helmfile.d/services/machinetranslation: apply [production]
13:37 <urbanecm@deploy1002> Finished scap: Backport for [[gerrit:914267|Enable Kartographer Nearby on mobile (T333137)]], [[gerrit:914286|Fix clearing wrong container when closing fullscreen map (T335648)]], [[gerrit:914287|Fix clearing wrong container when closing fullscreen map (T335648)]] (duration: 14m 54s) [production]
13:25 <akosiaris@deploy1002> helmfile [staging] START helmfile.d/services/machinetranslation: apply [production]
13:24 <jmm@puppetmaster1001> conftool action : set/pooled=yes; selector: name=ldap-replica2005.wikimedia.org [production]
13:24 <urbanecm@deploy1002> wmde-fisch and urbanecm: Backport for [[gerrit:914267|Enable Kartographer Nearby on mobile (T333137)]], [[gerrit:914286|Fix clearing wrong container when closing fullscreen map (T335648)]], [[gerrit:914287|Fix clearing wrong container when closing fullscreen map (T335648)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet [production]
13:22 <urbanecm@deploy1002> Started scap: Backport for [[gerrit:914267|Enable Kartographer Nearby on mobile (T333137)]], [[gerrit:914286|Fix clearing wrong container when closing fullscreen map (T335648)]], [[gerrit:914287|Fix clearing wrong container when closing fullscreen map (T335648)]] [production]
13:16 <urbanecm@deploy1002> Sync cancelled. [production]
13:16 <urbanecm@deploy1002> urbanecm and wmde-fisch: Backport for [[gerrit:914267|Enable Kartographer Nearby on mobile (T333137)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet [production]
13:07 <urbanecm@deploy1002> Started scap: Backport for [[gerrit:914267|Enable Kartographer Nearby on mobile (T333137)]] [production]
13:05 <XioNoX> rebooting asw-c-codfw for software upgrade - T334049 [production]
13:03 <ayounsi@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on 185 hosts with reason: codfw row C upgrade [production]
13:01 <ayounsi@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on 185 hosts with reason: codfw row C upgrade [production]
12:54 <ayounsi@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on 186 hosts with reason: codfw row C upgrade [production]
12:54 <ayounsi@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on 186 hosts with reason: codfw row C upgrade [production]
12:31 <eevans@cumin1001> END (PASS) - Cookbook sre.discovery.service-route (exit_code=0) check 2 services: maintenance [production]
12:31 <eevans@cumin1001> START - Cookbook sre.discovery.service-route check 2 services: maintenance [production]
12:30 <cmooney@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
12:29 <cmooney@cumin1001> START - Cookbook sre.dns.netbox [production]
12:28 <elukey@deploy1002> helmfile [ml-staging-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
12:24 <moritzm> installing LInux 5.10.178 on bullseye hosts [production]
12:20 <sukhe> run authdns-update to depool codfwL T334049 [production]
12:17 <Amir1> stop slave on eqiad masters of s1, x1, s8 (T334049) [production]
12:10 <jmm@puppetmaster1001> conftool action : set/pooled=no; selector: name=ldap-replica2005.wikimedia.org [production]
12:05 <Amir1> stop slave again on db1130 (eqiad master of s5) (T334049) [production]
12:03 <jclark@cumin1001> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host backup1011 [production]
12:03 <jclark@cumin1001> START - Cookbook sre.network.configure-switch-interfaces for host backup1011 [production]
11:57 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host poolcounter2003.codfw.wmnet [production]
11:53 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host poolcounter2003.codfw.wmnet [production]
11:52 <jclark@cumin1001> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host backup1011 [production]
11:52 <jclark@cumin1001> START - Cookbook sre.network.configure-switch-interfaces for host backup1011 [production]
11:52 <jclark@cumin1001> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host backup1010 [production]
11:52 <jclark@cumin1001> START - Cookbook sre.network.configure-switch-interfaces for host backup1010 [production]
11:52 <jclark@cumin1001> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host puppetmaster1006 [production]
11:51 <Amir1> stop slave on db1130 (eqiad master of s5) (T334049) [production]
11:51 <elukey@deploy1002> helmfile [ml-staging-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
11:50 <jclark@cumin1001> START - Cookbook sre.network.configure-switch-interfaces for host puppetmaster1006 [production]
11:50 <jclark@cumin1001> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host sretest1003 [production]
11:50 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host poolcounter2004.codfw.wmnet [production]
11:49 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on 41 hosts with reason: Row c switch maint T334049 [production]
11:49 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 6:00:00 on 41 hosts with reason: Row c switch maint T334049 [production]
11:49 <jclark@cumin1001> START - Cookbook sre.network.configure-switch-interfaces for host sretest1003 [production]
11:46 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host poolcounter2004.codfw.wmnet [production]
11:32 <jclark@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
11:30 <jclark@cumin1001> START - Cookbook sre.dns.netbox [production]
10:52 <akosiaris@cumin1001> END (PASS) - Cookbook sre.discovery.datacenter (exit_code=0) depool all active/active services in codfw: codfw row C switches upgrade - T334049 [production]