201-250 of 10000 results (47ms)
2022-08-04 ยง
15:25 <jelto@cumin1001> START - Cookbook sre.hosts.downtime for 3:30:00 on phab2001.codfw.wmnet with reason: PDU swap [production]
15:25 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on cp[2037-2038].codfw.wmnet with reason: shutdown for PDU upgrade [production]
15:24 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 4:00:00 on cp[2037-2038].codfw.wmnet with reason: shutdown for PDU upgrade [production]
15:24 <sukhe@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp203[78]\.codfw\.wmnet,service=varnish-fe [production]
15:23 <sukhe@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp203[78]\.codfw\.wmnet,service=ats-be [production]
15:23 <sukhe@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp203[78]\.codfw\.wmnet,service=ats-tls [production]
15:21 <XioNoX> un-drain codfw-ulsfo link - T310310 [production]
15:21 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on db[2116,2127,2167-2168].codfw.wmnet,es2022.codfw.wmnet with reason: Maintenance (T310145) [production]
15:20 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 10:00:00 on db[2116,2127,2167-2168].codfw.wmnet,es2022.codfw.wmnet with reason: Maintenance (T310145) [production]
15:19 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depool C6 for PDU maint (T310145)', diff saved to https://phabricator.wikimedia.org/P32285 and previous config saved to /var/cache/conftool/dbconfig/20220804-151958-ladsgroup.json [production]
15:16 <btullis@cumin1001> END (PASS) - Cookbook sre.aqs.roll-restart (exit_code=0) for AQS aqs cluster: Roll restart of all AQS's nodejs daemons. [production]
15:16 <hnowlan@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on restbase[2016,2020,2025].codfw.wmnet with reason: PDU maintenance [production]
15:16 <hnowlan@cumin1001> START - Cookbook sre.hosts.downtime for 3:00:00 on restbase[2016,2020,2025].codfw.wmnet with reason: PDU maintenance [production]
15:13 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 10:00:00 on db[2114,2126,2166].codfw.wmnet with reason: Maintenance (T310145) [production]
15:13 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 10:00:00 on db[2114,2126,2166].codfw.wmnet with reason: Maintenance (T310145) [production]
15:13 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp203[12]\.codfw\.wmnet,service=varnish-fe [production]
15:13 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp203[12]\.codfw\.wmnet,service=ats-be [production]
15:13 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp203[12]\.codfw\.wmnet,service=ats-tls [production]
15:12 <mvernon@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for ms-be[2058,2064].codfw.wmnet [production]
15:12 <mvernon@cumin1001> START - Cookbook sre.hosts.remove-downtime for ms-be[2058,2064].codfw.wmnet [production]
15:11 <ladsgroup@cumin1001> dbctl commit (dc=all): 'Depool hosts for PDU maint (T310145)', diff saved to https://phabricator.wikimedia.org/P32284 and previous config saved to /var/cache/conftool/dbconfig/20220804-151121-ladsgroup.json [production]
15:09 <godog> poweroff logstash2002 - T310145 [production]
15:07 <_joe_> pwoering down mc203{0,1} [production]
15:07 <root@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on logstash2002.codfw.wmnet with reason: pdu [production]
15:06 <root@cumin1001> START - Cookbook sre.hosts.downtime for 4:00:00 on logstash2002.codfw.wmnet with reason: pdu [production]
15:05 <btullis@cumin1001> START - Cookbook sre.aqs.roll-restart for AQS aqs cluster: Roll restart of all AQS's nodejs daemons. [production]
14:58 <jelto> power off mc20[30-31] [production]
14:56 <jelto@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on mc[2030-2031].codfw.wmnet with reason: PDU swap [production]
14:56 <jelto@cumin1001> START - Cookbook sre.hosts.downtime for 4:00:00 on mc[2030-2031].codfw.wmnet with reason: PDU swap [production]
14:56 <XioNoX> draining codfw-ulsfo link - T310310 [production]
14:36 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: name=maps2009.codfw.wmnet [production]
14:35 <hnowlan@puppetmaster1001> conftool action : set/pooled=no; selector: name=maps2007.codfw.wmnet [production]
14:35 <hnowlan@puppetmaster1001> conftool action : set/pooled=no; selector: name=restbase2025.codfw.wmnet [production]
14:35 <hnowlan@puppetmaster1001> conftool action : set/pooled=no; selector: name=restbase2020.codfw.wmnet [production]
14:35 <hnowlan@puppetmaster1001> conftool action : set/pooled=no; selector: name=restbase2016.codfw.wmnet [production]
14:32 <bking@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on wdqs2011.codfw.wmnet with reason: T310145 [production]
14:31 <bking@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on wdqs2011.codfw.wmnet with reason: T310145 [production]
14:25 <jelto> power off gitlab-runner2003 [production]
14:25 <jelto@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:30:00 on gitlab-runner2003.codfw.wmnet with reason: PDU swap [production]
14:25 <bking@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on wdqs2001.codfw.wmnet with reason: T310145 [production]
14:24 <bking@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on wdqs2001.codfw.wmnet with reason: T310145 [production]
14:24 <jelto@cumin1001> START - Cookbook sre.hosts.downtime for 4:30:00 on gitlab-runner2003.codfw.wmnet with reason: PDU swap [production]
14:23 <bking@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on elastic2032.codfw.wmnet with reason: T310145 [production]
14:22 <bking@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on elastic2032.codfw.wmnet with reason: T310145 [production]
14:22 <root@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on logstash2035.codfw.wmnet with reason: pdu [production]
14:22 <godog> poweroff logstash2035 - T310145 [production]
14:22 <root@cumin1001> START - Cookbook sre.hosts.downtime for 4:00:00 on logstash2035.codfw.wmnet with reason: pdu [production]
14:21 <Emperor> shutdown ms-be20[58,64].codfw.wmnet for PDU swap T310145 [production]
14:20 <mwdebug-deploy@deploy1002> helmfile [codfw] DONE helmfile.d/services/mwdebug: apply [production]
14:19 <mwdebug-deploy@deploy1002> helmfile [codfw] START helmfile.d/services/mwdebug: apply [production]