1251-1300 of 10000 results (95ms)
2023-03-28 ยง
14:48 <hnowlan@puppetmaster1001> conftool action : set/weight=10; selector: service=thumbor,name=kubernetes201[0123].codfw.wmnet [production]
14:46 <hnowlan@puppetmaster1001> conftool action : set/weight=8; selector: service=thumbor,name=kubernetes201[0123].codfw.wmnet [production]
14:40 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: name=thumbor100[12].eqiad.wmnet [production]
14:38 <hnowlan@puppetmaster1001> conftool action : set/weight=6; selector: service=thumbor,name=kubernetes201[0123].codfw.wmnet [production]
14:32 <akosiaris@cumin1001> START - Cookbook sre.discovery.datacenter pool all active/active services in eqiad: eqiad row B switches upgrade done - T330165 [production]
14:31 <sukhe> run authdns-update to revert eqiad depool [production]
14:25 <filippo@cumin1001> conftool action : set/pooled=no; selector: name=thanos-fe1002.eqiad.wmnet,service=thanos-web [production]
14:25 <filippo@cumin1001> conftool action : set/pooled=no; selector: name=THANOS-FE-OLD-FQDN,service=thanos-web [production]
14:05 <XioNoX> reboot eqiad row B for upgrade - T330165 [production]
13:58 <godog> depool thanos-fe1002 - T330165 [production]
13:54 <Emperor> depool ms-fe1010 before switch work T330165 [production]
13:53 <hnowlan@puppetmaster1001> conftool action : set/weight=5; selector: service=thumbor,name=kubernetes201[0123].codfw.wmnet [production]
13:49 <ayounsi@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on 249 hosts with reason: eqiad row B upgrade [production]
13:48 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes:weight=4; selector: service=thumbor,name=kubernetes201[0123].codfw.wmnet [production]
13:47 <akosiaris> depool swift in eqiad for row B upgrade [production]
13:47 <akosiaris@cumin1001> conftool action : set/pooled=false; selector: dnsdisc=swift-ro,name=eqiad [production]
13:47 <akosiaris@cumin1001> conftool action : set/pooled=false; selector: dnsdisc=swift,name=eqiad [production]
13:46 <akosiaris@deploy2002> helmfile [codfw] DONE helmfile.d/services/thumbor: sync [production]
13:46 <ayounsi@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on 249 hosts with reason: eqiad row B upgrade [production]
13:45 <akosiaris@deploy2002> helmfile [codfw] START helmfile.d/services/thumbor: sync [production]
13:45 <akosiaris@deploy2002> helmfile [codfw] DONE helmfile.d/services/thumbor: sync [production]
13:44 <akosiaris@deploy2002> helmfile [codfw] START helmfile.d/services/thumbor: sync [production]
13:42 <akosiaris@cumin1001> conftool action : set/pooled=true; selector: dnsdisc=swift,name=eqiad [production]
13:41 <akosiaris@cumin1001> conftool action : set/pooled=true; selector: dnsdisc=swift-ro,name=eqiad [production]
13:36 <akosiaris@deploy2002> helmfile [eqiad] DONE helmfile.d/services/thumbor: apply [production]
13:34 <hnowlan@puppetmaster1001> conftool action : set/pooled=true; selector: dnsdisc=thumbor,name=eqiad [production]
13:33 <hnowlan@puppetmaster1001> conftool action : set/pooled=no; selector: name=thumbor1002.eqiad.wmnet [production]
13:33 <hnowlan@puppetmaster1001> conftool action : set/pooled=no; selector: name=thumbor1001.eqiad.wmnet [production]
13:30 <akosiaris@deploy2002> helmfile [eqiad] START helmfile.d/services/thumbor: apply [production]
13:17 <akosiaris@cumin1001> END (PASS) - Cookbook sre.discovery.datacenter (exit_code=0) depool all active/active services in eqiad: eqiad row B switches upgrade - T330165 [production]
12:59 <XioNoX> depool eqiad for network maintenance - T330165 [production]
12:58 <akosiaris@cumin1001> START - Cookbook sre.discovery.datacenter depool all active/active services in eqiad: eqiad row B switches upgrade - T330165 [production]
12:57 <elukey@deploy2002> helmfile [ml-serve-eqiad] DONE helmfile.d/admin 'sync'. [production]
12:56 <elukey@deploy2002> helmfile [ml-serve-eqiad] START helmfile.d/admin 'sync'. [production]
12:56 <elukey@deploy2002> helmfile [ml-serve-codfw] DONE helmfile.d/admin 'sync'. [production]
12:56 <elukey@deploy2002> helmfile [ml-serve-codfw] START helmfile.d/admin 'sync'. [production]
12:44 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.debug (exit_code=0) for Netbox circuit ID 108 [production]
12:44 <ayounsi@cumin1001> START - Cookbook sre.network.debug for Netbox circuit ID 108 [production]
12:43 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.debug (exit_code=0) for Netbox circuit ID 108 [production]
12:43 <ayounsi@cumin1001> START - Cookbook sre.network.debug for Netbox circuit ID 108 [production]
12:38 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.debug (exit_code=0) for Netbox circuit ID 108 [production]
12:38 <ayounsi@cumin1001> START - Cookbook sre.network.debug for Netbox circuit ID 108 [production]
12:36 <eoghan@cumin1001> END (PASS) - Cookbook sre.ganeti.reimage (exit_code=0) for host aphlict1002.eqiad.wmnet with OS bullseye [production]
12:34 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.debug (exit_code=0) for Netbox circuit ID 112 [production]
12:34 <ayounsi@cumin1001> START - Cookbook sre.network.debug for Netbox circuit ID 112 [production]
12:24 <eoghan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aphlict1002.eqiad.wmnet with reason: host reimage [production]
12:21 <eoghan@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on aphlict1002.eqiad.wmnet with reason: host reimage [production]
12:20 <elukey@deploy2002> helmfile [ml-staging-codfw] DONE helmfile.d/admin 'sync'. [production]
12:20 <elukey@deploy2002> helmfile [ml-staging-codfw] START helmfile.d/admin 'sync'. [production]
12:16 <ayounsi@cumin1001> END (PASS) - Cookbook sre.network.peering (exit_code=0) with action 'configure' for AS: 45295 [production]