2022-08-04
ยง
|
17:16 |
<sukhe@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on cp[2035-2036].codfw.wmnet with reason: shutdown for PDU upgrade |
[production] |
17:15 |
<sukhe@cumin2002> |
START - Cookbook sre.hosts.downtime for 4:00:00 on cp[2035-2036].codfw.wmnet with reason: shutdown for PDU upgrade |
[production] |
17:15 |
<mvernon@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on 9 hosts with reason: PDU work |
[production] |
17:15 |
<mvernon@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on 9 hosts with reason: PDU work |
[production] |
17:15 |
<sukhe@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp203[56]\.codfw\.wmnet,service=varnish-fe |
[production] |
17:15 |
<sukhe@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp203[56]\.codfw\.wmnet,service=ats-be |
[production] |
17:15 |
<sukhe@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=cp203[56]\.codfw\.wmnet,service=ats-tls |
[production] |
17:13 |
<mvernon@cumin1001> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for ms-be[2036,2049,2054].codfw.wmnet,thanos-be2003.codfw.wmnet |
[production] |
17:13 |
<mvernon@cumin1001> |
START - Cookbook sre.hosts.remove-downtime for ms-be[2036,2049,2054].codfw.wmnet,thanos-be2003.codfw.wmnet |
[production] |
17:12 |
<sukhe@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=cp2037.codfw.wmnet,service=varnish-fe |
[production] |
17:12 |
<sukhe@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=cp2037.codfw.wmnet,service=ats-be |
[production] |
17:12 |
<sukhe@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=cp2037.codfw.wmnet,service=ats-tls |
[production] |
17:12 |
<bking@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on elastic2050.codfw.wmnet with reason: T310146 |
[production] |
17:12 |
<bking@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on elastic2050.codfw.wmnet with reason: T310146 |
[production] |
17:11 |
<ebysans@deploy1002> |
Finished deploy [analytics/refinery@2553288] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@2553288] (duration: 00m 04s) |
[production] |
17:11 |
<ebysans@deploy1002> |
Started deploy [analytics/refinery@2553288] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@2553288] |
[production] |
17:11 |
<ebysans@deploy1002> |
Finished deploy [analytics/refinery@2553288] (thin): Regular analytics weekly train THIN [analytics/refinery@2553288] (duration: 00m 07s) |
[production] |
17:10 |
<ebysans@deploy1002> |
Started deploy [analytics/refinery@2553288] (thin): Regular analytics weekly train THIN [analytics/refinery@2553288] |
[production] |
17:10 |
<ebysans@deploy1002> |
Finished deploy [analytics/refinery@2553288]: Regular analytics weekly train [analytics/refinery@2553288] (duration: 00m 15s) |
[production] |
17:09 |
<ebysans@deploy1002> |
Started deploy [analytics/refinery@2553288]: Regular analytics weekly train [analytics/refinery@2553288] |
[production] |
17:07 |
<sukhe@cumin2002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on lvs2010.codfw.wmnet with reason: shutdown for PDU upgrade |
[production] |
17:07 |
<sukhe@cumin2002> |
START - Cookbook sre.hosts.downtime for 1:00:00 on lvs2010.codfw.wmnet with reason: shutdown for PDU upgrade |
[production] |
16:55 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=maps2008.codfw.wmnet |
[production] |
16:51 |
<ebysans@deploy1002> |
Finished deploy [analytics/refinery@2553288] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@2553288] (duration: 07m 14s) |
[production] |
16:45 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=restbase2016.codfw.wmnet |
[production] |
16:45 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=restbase202[05].codfw.wmnet |
[production] |
16:45 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=no; selector: name=restbase202[05].codfw.wmnet |
[production] |
16:45 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=yes; selector: name=maps2007.codfw.wmnet |
[production] |
16:43 |
<ebysans@deploy1002> |
Started deploy [analytics/refinery@2553288] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@2553288] |
[production] |
16:43 |
<ebysans@deploy1002> |
Finished deploy [analytics/refinery@2553288] (thin): Regular analytics weekly train THIN [analytics/refinery@2553288] (duration: 00m 07s) |
[production] |
16:43 |
<ebysans@deploy1002> |
Started deploy [analytics/refinery@2553288] (thin): Regular analytics weekly train THIN [analytics/refinery@2553288] |
[production] |
16:37 |
<jayme@cumin1001> |
END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for 18 hosts |
[production] |
16:37 |
<jayme@cumin1001> |
START - Cookbook sre.hosts.remove-downtime for 18 hosts |
[production] |
16:35 |
<bking@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on elastic2059.codfw.wmnet with reason: T310145 |
[production] |
16:35 |
<bking@cumin1001> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on elastic2059.codfw.wmnet with reason: T310145 |
[production] |
16:34 |
<jayme@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kafka-main2003.codfw.wmnet with reason: PDU swap |
[production] |
16:34 |
<ebysans@deploy1002> |
Finished deploy [analytics/refinery@2553288]: Regular analytics weekly train [analytics/refinery@2553288] (duration: 00m 20s) |
[production] |
16:34 |
<jayme@cumin1001> |
START - Cookbook sre.hosts.downtime for 1:00:00 on kafka-main2003.codfw.wmnet with reason: PDU swap |
[production] |
16:34 |
<ebysans@deploy1002> |
Started deploy [analytics/refinery@2553288]: Regular analytics weekly train [analytics/refinery@2553288] |
[production] |
16:32 |
<ebysans@deploy1002> |
Finished deploy [analytics/refinery@2553288]: Regular analytics weekly train [analytics/refinery@2553288] (duration: 29m 59s) |
[production] |
16:30 |
<ladsgroup@cumin1001> |
dbctl commit (dc=all): 'Depool D3 for PDU maint', diff saved to https://phabricator.wikimedia.org/P32286 and previous config saved to /var/cache/conftool/dbconfig/20220804-163037-ladsgroup.json |
[production] |
16:28 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/mwdebug: apply |
[production] |
16:28 |
<ladsgroup@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: Config: [[gerrit:820376|Start reading from new templatelinks columns in commons (T306673)]] (duration: 03m 00s) |
[production] |
16:27 |
<mwdebug-deploy@deploy1002> |
helmfile [codfw] START helmfile.d/services/mwdebug: apply |
[production] |
16:27 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/mwdebug: apply |
[production] |
16:26 |
<mwdebug-deploy@deploy1002> |
helmfile [eqiad] START helmfile.d/services/mwdebug: apply |
[production] |
16:17 |
<brett> |
deploying authdns - geodns: Map out African countries by DC latency (T311472) |
[production] |
16:12 |
<cwhite> |
poweroff logstash2028 - T310145 |
[production] |
16:06 |
<Emperor> |
shutdown ms-be20[39,49,54].codfw.wmnet,thanos-be2003 for PDU swap T310145 |
[production] |
16:03 |
<mvernon@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on ms-be[2036,2049,2054].codfw.wmnet,thanos-be2003.codfw.wmnet with reason: PDU work |
[production] |