2951-3000 of 10000 results (35ms)
2021-02-04 ยง
13:29 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host mwdebug1003.eqiad.wmnet [production]
13:10 <jbond42> upload cas_6.2.7 to downgrade cas T273867 [production]
13:04 <ariel@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on snapshot1010.eqiad.wmnet with reason: REIMAGE [production]
13:02 <ariel@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on snapshot1010.eqiad.wmnet with reason: REIMAGE [production]
12:27 <moritzm> installing libdatetime-timezone-perl updates on Buster [production]
12:17 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on 17 hosts with reason: reboot [production]
12:17 <jmm@cumin2001> START - Cookbook sre.hosts.downtime for 4:00:00 on 17 hosts with reason: reboot [production]
12:17 <moritzm> rebooting mw[1264-1268,1276-1277,1337-1338,1404-1409,1411,1413].eqiad.wmnet for kernel update [production]
12:08 <godog> bounce rsyslog on centrallog1001 [production]
11:47 <hnowlan@puppetmaster1001> conftool action : set/pooled=no; selector: dc=eqiad,cluster=maps,service=kartotherian,name=maps1009.eqiad.wmnet [production]
11:47 <hnowlan@puppetmaster1001> conftool action : set/pooled=no; selector: dc=eqiad,cluster=maps,service=kartotherian-ssl,name=maps1009.eqiad.wmnet [production]
11:30 <elukey@cumin1001> END (PASS) - Cookbook sre.aqs.roll-restart (exit_code=0) [production]
11:26 <elukey@cumin1001> START - Cookbook sre.aqs.roll-restart [production]
11:07 <elukey@puppetmaster1001> conftool action : set/pooled=true; selector: dnsdisc=eventstreams-internal [production]
10:35 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on 93 hosts with reason: reboot [production]
10:35 <moritzm> rebooting mw[2261-2262,2268-2271,2273-2277,2283-2288,2290-2335,2337-2339,2350-2376].codfw.wmnet [production]
10:34 <jmm@cumin2001> START - Cookbook sre.hosts.downtime for 4:00:00 on 93 hosts with reason: reboot [production]
10:23 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 100%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14204 and previous config saved to /var/cache/conftool/dbconfig/20210204-102312-root.json [production]
10:15 <elukey> restart pybal on lvs1015 (low-traffic active) to pick up new changes for eventstreams-internal (new VIP) - T269160 [production]
10:13 <elukey> restart pybal on lvs2009 (low-traffic active) to pick up new changes for eventstreams-internal (new VIP) - T269160 [production]
10:08 <elukey> restart pybal on lvs1016 (low-traffic standby) to pick up new changes for eventstreams-internal (new VIP) - T269160 [production]
10:08 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 75%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14203 and previous config saved to /var/cache/conftool/dbconfig/20210204-100808-root.json [production]
10:05 <elukey> restart pybal on lvs2010 (low-traffic standby) to pick up new changes for eventstreams-internal (new VIP) - T269160 [production]
09:58 <marostegui@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) [production]
09:53 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 60%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14202 and previous config saved to /var/cache/conftool/dbconfig/20210204-095305-root.json [production]
09:49 <marostegui@cumin1001> START - Cookbook sre.hosts.decommission [production]
09:45 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on 37 hosts with reason: reboot [production]
09:44 <jmm@cumin2001> START - Cookbook sre.hosts.downtime for 4:00:00 on 37 hosts with reason: reboot [production]
09:41 <moritzm> rebooting mw[2215-2219,2221-2243,2246-2249,2251-2253,2255,2258 for kernel update [production]
09:38 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 50%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14201 and previous config saved to /var/cache/conftool/dbconfig/20210204-093801-root.json [production]
09:37 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host flowspec1001.eqiad.wmnet [production]
09:33 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host flowspec1001.eqiad.wmnet [production]
09:24 <XioNoX> re-enable ping offload in esams - T273278 [production]
09:24 <marostegui@cumin1001> dbctl commit (dc=all): 'Remove db1078 from dbctl T273597', diff saved to https://phabricator.wikimedia.org/P14199 and previous config saved to /var/cache/conftool/dbconfig/20210204-092414-marostegui.json [production]
09:23 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ping3001.esams.wmnet [production]
09:22 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 30%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14198 and previous config saved to /var/cache/conftool/dbconfig/20210204-092257-root.json [production]
09:20 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ping3001.esams.wmnet [production]
09:17 <XioNoX> disable ping offload in esams (eqiad re-enabled) - T273278 [production]
09:15 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ping1001.eqiad.wmnet [production]
09:15 <godog> roll restart lvs low-traffic in codfw/eqiad for swift healthcheck updates [production]
09:11 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ping1001.eqiad.wmnet [production]
09:10 <XioNoX> disable ping offload in eqiad (codfw-re-enabled) - T273278 [production]
09:07 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 25%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14197 and previous config saved to /var/cache/conftool/dbconfig/20210204-090754-root.json [production]
09:06 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ping2001.codfw.wmnet [production]
09:04 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host ping2001.codfw.wmnet [production]
09:02 <XioNoX> disable ping offload in codfw - T273278 [production]
08:52 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 20%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14196 and previous config saved to /var/cache/conftool/dbconfig/20210204-085250-root.json [production]
08:37 <marostegui@cumin1001> dbctl commit (dc=all): 'db1173 (re)pooling @ 15%: Slowly pooling db1173 for the first time in s6', diff saved to https://phabricator.wikimedia.org/P14195 and previous config saved to /var/cache/conftool/dbconfig/20210204-083747-root.json [production]
08:33 <jmm@cumin2001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host sretest1001.eqiad.wmnet [production]
08:29 <jmm@cumin2001> START - Cookbook sre.hosts.reboot-single for host sretest1001.eqiad.wmnet [production]