351-400 of 10000 results (57ms)
2022-08-04 ยง
17:55 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs2009.codfw.wmnet with reason: shutdown for PDU upgrade [production]
17:55 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on lvs2009.codfw.wmnet with reason: shutdown for PDU upgrade [production]
17:43 <mutante> maps2008 - downtime and shutdown for D3 maintenance [production]
17:42 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on maps2008.codfw.wmnet with reason: codfw reboots [production]
17:42 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on maps2008.codfw.wmnet with reason: codfw reboots [production]
17:42 <mutante> thunmbor2006 - downtime and shutdown for D3 maintenance [production]
17:42 <dzahn@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on thumbor2006.codfw.wmnet with reason: codfw reboots [production]
17:41 <dzahn@cumin1001> START - Cookbook sre.hosts.downtime for 1:00:00 on thumbor2006.codfw.wmnet with reason: codfw reboots [production]
17:39 <mutante> mw2386 - systemctl reset-failed [production]
17:31 <mutante> phab2001 - systemctl restart ssh-phab, attempting to clear Icinga pybal alerts, related to reboots [production]
17:30 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on dns2001.wikimedia.org with reason: shutdown for PDU upgrade [production]
17:30 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 3:00:00 on dns2001.wikimedia.org with reason: shutdown for PDU upgrade [production]
17:29 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on dns2001.wikimedia.org with reason: shutdown for PDU upgrade [production]
17:29 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 1:00:00 on dns2001.wikimedia.org with reason: shutdown for PDU upgrade [production]
17:28 <Amir1> dbmaint at s4@eqiad (T312863) [production]
17:26 <bd808@deploy1002> helmfile [eqiad] DONE helmfile.d/services/developer-portal: apply [production]
17:26 <bd808@deploy1002> helmfile [eqiad] START helmfile.d/services/developer-portal: apply [production]
17:24 <bd808@deploy1002> helmfile [codfw] DONE helmfile.d/services/developer-portal: apply [production]
17:23 <bd808@deploy1002> helmfile [codfw] START helmfile.d/services/developer-portal: apply [production]
17:23 <bd808@deploy1002> helmfile [staging] DONE helmfile.d/services/developer-portal: apply [production]
17:23 <bd808@deploy1002> helmfile [staging] START helmfile.d/services/developer-portal: apply [production]
17:20 <mutante> [an-launcher1002:~] $ sudo systemctl reset-failed [production]
17:20 <mvernon@cumin1001> conftool action : set/pooled=no; selector: name=ms-fe2012.codfw.wmnet [production]
17:18 <ladsgroup@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1150.eqiad.wmnet with reason: Maintenance [production]
17:18 <ladsgroup@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1150.eqiad.wmnet with reason: Maintenance [production]
17:18 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp2038.codfw.wmnet,service=varnish-fe [production]
17:18 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp2038.codfw.wmnet,service=ats-be [production]
17:18 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp2038.codfw.wmnet,service=ats-tls [production]
17:16 <Emperor> shutdown of moss-fe2002.codfw.wmnet,ms-be20[37,38,43,61,65,69].codfw.wmnet,ms-fe2012.codfw.wmnet,thanos-fe2003.codfw.wmnet for power work T310146 [production]
17:16 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on cp[2035-2036].codfw.wmnet with reason: shutdown for PDU upgrade [production]
17:15 <sukhe@cumin2002> START - Cookbook sre.hosts.downtime for 4:00:00 on cp[2035-2036].codfw.wmnet with reason: shutdown for PDU upgrade [production]
17:15 <mvernon@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on 9 hosts with reason: PDU work [production]
17:15 <mvernon@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on 9 hosts with reason: PDU work [production]
17:15 <sukhe@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp203[56]\.codfw\.wmnet,service=varnish-fe [production]
17:15 <sukhe@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp203[56]\.codfw\.wmnet,service=ats-be [production]
17:15 <sukhe@puppetmaster1001> conftool action : set/pooled=no; selector: name=cp203[56]\.codfw\.wmnet,service=ats-tls [production]
17:13 <mvernon@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for ms-be[2036,2049,2054].codfw.wmnet,thanos-be2003.codfw.wmnet [production]
17:13 <mvernon@cumin1001> START - Cookbook sre.hosts.remove-downtime for ms-be[2036,2049,2054].codfw.wmnet,thanos-be2003.codfw.wmnet [production]
17:12 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp2037.codfw.wmnet,service=varnish-fe [production]
17:12 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp2037.codfw.wmnet,service=ats-be [production]
17:12 <sukhe@puppetmaster1001> conftool action : set/pooled=yes; selector: name=cp2037.codfw.wmnet,service=ats-tls [production]
17:12 <bking@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on elastic2050.codfw.wmnet with reason: T310146 [production]
17:12 <bking@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on elastic2050.codfw.wmnet with reason: T310146 [production]
17:11 <ebysans@deploy1002> Finished deploy [analytics/refinery@2553288] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@2553288] (duration: 00m 04s) [production]
17:11 <ebysans@deploy1002> Started deploy [analytics/refinery@2553288] (hadoop-test): Regular analytics weekly train TEST [analytics/refinery@2553288] [production]
17:11 <ebysans@deploy1002> Finished deploy [analytics/refinery@2553288] (thin): Regular analytics weekly train THIN [analytics/refinery@2553288] (duration: 00m 07s) [production]
17:10 <ebysans@deploy1002> Started deploy [analytics/refinery@2553288] (thin): Regular analytics weekly train THIN [analytics/refinery@2553288] [production]
17:10 <ebysans@deploy1002> Finished deploy [analytics/refinery@2553288]: Regular analytics weekly train [analytics/refinery@2553288] (duration: 00m 15s) [production]
17:09 <ebysans@deploy1002> Started deploy [analytics/refinery@2553288]: Regular analytics weekly train [analytics/refinery@2553288] [production]
17:07 <sukhe@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on lvs2010.codfw.wmnet with reason: shutdown for PDU upgrade [production]