1151-1200 of 10000 results (80ms)
2023-03-28 ยง
13:47 <akosiaris> depool swift in eqiad for row B upgrade [production]
13:47 <akosiaris@cumin1001> conftool action : set/pooled=false; selector: dnsdisc=swift-ro,name=eqiad [production]
13:47 <akosiaris@cumin1001> conftool action : set/pooled=false; selector: dnsdisc=swift,name=eqiad [production]
13:46 <akosiaris@deploy2002> helmfile [codfw] DONE helmfile.d/services/thumbor: sync [production]
13:46 <ayounsi@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on 249 hosts with reason: eqiad row B upgrade [production]
13:45 <akosiaris@deploy2002> helmfile [codfw] START helmfile.d/services/thumbor: sync [production]
13:45 <akosiaris@deploy2002> helmfile [codfw] DONE helmfile.d/services/thumbor: sync [production]
13:44 <akosiaris@deploy2002> helmfile [codfw] START helmfile.d/services/thumbor: sync [production]
13:42 <akosiaris@cumin1001> conftool action : set/pooled=true; selector: dnsdisc=swift,name=eqiad [production]
13:41 <akosiaris@cumin1001> conftool action : set/pooled=true; selector: dnsdisc=swift-ro,name=eqiad [production]
13:36 <akosiaris@deploy2002> helmfile [eqiad] DONE helmfile.d/services/thumbor: apply [production]
13:34 <hnowlan@puppetmaster1001> conftool action : set/pooled=true; selector: dnsdisc=thumbor,name=eqiad [production]
13:33 <hnowlan@puppetmaster1001> conftool action : set/pooled=no; selector: name=thumbor1002.eqiad.wmnet [production]
13:33 <hnowlan@puppetmaster1001> conftool action : set/pooled=no; selector: name=thumbor1001.eqiad.wmnet [production]
13:30 <akosiaris@deploy2002> helmfile [eqiad] START helmfile.d/services/thumbor: apply [production]
13:17 <akosiaris@cumin1001> END (PASS) - Cookbook sre.discovery.datacenter (exit_code=0) depool all active/active services in eqiad: eqiad row B switches upgrade - T330165 [production]
12:59 <XioNoX> depool eqiad for network maintenance - T330165 [production]
12:58 <akosiaris@cumin1001> START - Cookbook sre.discovery.datacenter depool all active/active services in eqiad: eqiad row B switches upgrade - T330165 [production]
12:57 <elukey@deploy2002> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
12:56 <elukey@deploy2002> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
12:56 <elukey@deploy2002> helmfile [ml-serve-codfw] DONE helmfile.d/admin 'sync'. [production]
12:56 <elukey@deploy2002> helmfile [ml-serve-codfw] START helmfile.d/admin 'sync'. [production]
12:44 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.debug (exit_code=0) for Netbox circuit ID 108 [production]
12:44 <ayounsi@cumin1001> START - Cookbook sre.network.debug for Netbox circuit ID 108 [production]
12:43 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.debug (exit_code=0) for Netbox circuit ID 108 [production]
12:43 <ayounsi@cumin1001> START - Cookbook sre.network.debug for Netbox circuit ID 108 [production]
12:38 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.debug (exit_code=0) for Netbox circuit ID 108 [production]
12:38 <ayounsi@cumin1001> START - Cookbook sre.network.debug for Netbox circuit ID 108 [production]
12:36 <eoghan@cumin1001> END (PASS) - Cookbook sre.ganeti.reimage (exit_code=0) for host aphlict1002.eqiad.wmnet with OS bullseye [production]
12:34 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.debug (exit_code=0) for Netbox circuit ID 112 [production]
12:34 <ayounsi@cumin1001> START - Cookbook sre.network.debug for Netbox circuit ID 112 [production]
12:24 <eoghan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aphlict1002.eqiad.wmnet with reason: host reimage [production]
12:21 <eoghan@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on aphlict1002.eqiad.wmnet with reason: host reimage [production]
12:20 <elukey@deploy2002> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
12:20 <elukey@deploy2002> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
12:16 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 45295 [production]
12:15 <ayounsi@cumin1001> START - Cookbook sre.network.peering with action 'configure' for AS: 45295 [production]
12:09 <eoghan@cumin1001> START - Cookbook sre.ganeti.reimage for host aphlict1002.eqiad.wmnet with OS bullseye [production]
11:57 <elukey@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kafka-main1002.eqiad.wmnet with reason: stop kafka and dist-upgrade [production]
11:57 <elukey@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on kafka-main1002.eqiad.wmnet with reason: stop kafka and dist-upgrade [production]
11:56 <elukey> dist-upgrade kafka-main1002 to debian bullseye - T332013 [production]
11:51 <ladsgroup@deploy2002> Finished scap: Backport for [[gerrit:903549|api: Mark query as read-only to avoid regex on SQL (T332942)]] (duration: 18m 42s) [production]
11:47 <hnowlan@deploy2002> helmfile [eqiad] DONE helmfile.d/services/thumbor: apply [production]
11:37 <hnowlan@deploy2002> helmfile [eqiad] START helmfile.d/services/thumbor: apply [production]
11:34 <hnowlan@deploy2002> helmfile [eqiad] DONE helmfile.d/services/thumbor: apply [production]
11:34 <ladsgroup@deploy2002> ladsgroup: Backport for [[gerrit:903549|api: Mark query as read-only to avoid regex on SQL (T332942)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet [production]
11:32 <ladsgroup@deploy2002> Started scap: Backport for [[gerrit:903549|api: Mark query as read-only to avoid regex on SQL (T332942)]] [production]
11:24 <hnowlan@deploy2002> helmfile [eqiad] START helmfile.d/services/thumbor: apply [production]
11:23 <hnowlan@deploy2002> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
11:22 <hnowlan@deploy2002> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]