2021-01-15
ยง
|
16:31 |
<wm-bot> |
<bd808> Updated to matterbridge 1.21.0 |
[tools.bridgebot] |
16:17 |
<bstorm> |
canceled downtime for maintain-dbusers on labstore1004 T272127 |
[production] |
15:43 |
<wm-bot> |
<bd808> Restarting bot. IRC connections are missing NickServ auth. |
[tools.bridgebot] |
15:30 |
<elukey> |
restart archiva to apply hot-fix for T272082 |
[production] |
15:17 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ldap-replica1002.wikimedia.org |
[production] |
15:14 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ldap-replica1002.wikimedia.org |
[production] |
15:11 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ldap-replica1001.wikimedia.org |
[production] |
15:07 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ldap-replica1001.wikimedia.org |
[production] |
15:05 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host moscovium.eqiad.wmnet |
[production] |
15:01 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host moscovium.eqiad.wmnet |
[production] |
14:42 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ldap-replica2003.wikimedia.org |
[production] |
14:39 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ldap-replica2003.wikimedia.org |
[production] |
14:29 |
<jmm@cumin2001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ldap-replica2004.wikimedia.org |
[production] |
14:25 |
<jmm@cumin2001> |
START - Cookbook sre.hosts.reboot-single for host ldap-replica2004.wikimedia.org |
[production] |
13:41 |
<arturo> |
icinga downtime labstore1004 maintain-dbuser alert until 2021-01-19 (T272125) |
[admin] |
11:30 |
<jynus> |
rolling restart of eqiad source backup dbs |
[production] |
11:19 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-conf1002.eqiad.wmnet |
[production] |
11:15 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host an-conf1002.eqiad.wmnet |
[production] |
11:11 |
<XioNoX> |
update cloud-in4 firewall rules |
[production] |
11:06 |
<jiji@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc2036.codfw.wmnet |
[production] |
10:59 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host mc2036.codfw.wmnet |
[production] |
10:58 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-conf1003.eqiad.wmnet |
[production] |
10:56 |
<jiji@cumin1001> |
END (ERROR) - Cookbook sre.hosts.reboot-single (exit_code=97) for host mc2036.codfw.wmnet |
[production] |
10:55 |
<jiji@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host mc2036.codfw.wmnet |
[production] |
10:53 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host an-conf1003.eqiad.wmnet |
[production] |
10:53 |
<vgutierrez> |
re-enable puppet on acme-chief clients |
[production] |
10:53 |
<jynus> |
rolling restart of dbprov2* hosts |
[production] |
10:52 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief1001.eqiad.wmnet |
[production] |
10:52 |
<_joe_> |
rebuilding the docker images coredns,nutcracker,prometheus-statsd-exporter,service-checker,wmfdebug to use wikimedia-buster as a base |
[production] |
10:51 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-conf1001.eqiad.wmnet |
[production] |
10:48 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host acmechief1001.eqiad.wmnet |
[production] |
10:46 |
<vgutierrez> |
disable puppet on acme-chief clients |
[production] |
10:45 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host an-conf1001.eqiad.wmnet |
[production] |
10:43 |
<effie> |
reboot mc2036 - T269596 |
[production] |
10:40 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief-test1001.eqiad.wmnet |
[production] |
10:27 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host acmechief-test1001.eqiad.wmnet |
[production] |
10:26 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host acmechief-test2001.codfw.wmnet |
[production] |
10:21 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host acmechief-test2001.codfw.wmnet |
[production] |
10:10 |
<elukey@cumin1001> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-test-client1001.eqiad.wmnet |
[production] |
10:07 |
<elukey@cumin1001> |
START - Cookbook sre.hosts.reboot-single for host an-test-client1001.eqiad.wmnet |
[production] |
10:02 |
<vgutierrez@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) |
[production] |
09:58 |
<reedy@deploy1001> |
Synchronized php-1.36.0-wmf.26/extensions/GrowthExperiments/includes/NewcomerTasks/TaskSuggester/CacheDecorator.php: T272103 (duration: 00m 57s) |
[production] |
09:49 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.reboot-single |
[production] |
09:47 |
<arturo> |
labstore1004 maintain-dbusers affected by T272127 and T272125 |
[admin] |
09:36 |
<vgutierrez> |
rolling restart acme-chief servers to catch up on kernel upgrades |
[production] |
09:24 |
<jynus> |
rolling restart of dbprov1* hosts |
[production] |
09:22 |
<arturo> |
restart maintain-dbusers.service in labstore1004 |
[admin] |
09:21 |
<elukey> |
roll restart druid brokers on druid public - stuck after datasource drop |
[analytics] |
09:18 |
<godog> |
swift codfw-prod: more weight to ms-be20[58-61] - T269337 |
[production] |
09:07 |
<moritzm> |
installing bast5002 T257324 |
[production] |