|
2026-05-18
ยง
|
| 17:46 |
<dzahn@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on phab1005.eqiad.wmnet with reason: T426563 |
[production] |
| 17:46 |
<herron> |
rebooting alert2002 |
[production] |
| 17:45 |
<dzahn@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on phab2003.codfw.wmnet with reason: T426563 |
[production] |
| 17:45 |
<herron@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host alert2002.wikimedia.org |
[production] |
| 17:45 |
<herron@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host alert2002.wikimedia.org |
[production] |
| 17:44 |
<herron@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host grafana1002.eqiad.wmnet |
[production] |
| 17:44 |
<herron@cumin1003> |
END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host alert2002.wikimedia.org |
[production] |
| 17:44 |
<herron@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host alert2002.wikimedia.org |
[production] |
| 17:44 |
<herron@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host centrallog2002.codfw.wmnet |
[production] |
| 17:43 |
<jiji@cumin1003> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc1062.eqiad.wmnet with OS bookworm |
[production] |
| 17:40 |
<herron@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host grafana1002.eqiad.wmnet |
[production] |
| 17:38 |
<herron@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host graphite1005.eqiad.wmnet |
[production] |
| 17:37 |
<mutante> |
stewards* - rebooting |
[production] |
| 17:36 |
<herron@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host grafana2001.codfw.wmnet |
[production] |
| 17:32 |
<jiji@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1061.eqiad.wmnet with reason: host reimage |
[production] |
| 17:32 |
<herron@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host grafana2001.codfw.wmnet |
[production] |
| 17:31 |
<herron@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host graphite1005.eqiad.wmnet |
[production] |
| 17:30 |
<herron@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host graphite2004.codfw.wmnet |
[production] |
| 17:28 |
<jiji@cumin1003> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1062.eqiad.wmnet with reason: host reimage |
[production] |
| 17:25 |
<herron@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mwlog1003.eqiad.wmnet |
[production] |
| 17:23 |
<jiji@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mc1061.eqiad.wmnet with reason: host reimage |
[production] |
| 17:23 |
<jiji@cumin1003> |
START - Cookbook sre.hosts.downtime for 2:00:00 on mc1062.eqiad.wmnet with reason: host reimage |
[production] |
| 17:23 |
<herron@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host graphite2004.codfw.wmnet |
[production] |
| 17:22 |
<herron@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host arclamp1001.eqiad.wmnet |
[production] |
| 17:21 |
<herron@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host webperf1003.eqiad.wmnet |
[production] |
| 17:18 |
<herron@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host mwlog1003.eqiad.wmnet |
[production] |
| 17:16 |
<herron@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host arclamp1001.eqiad.wmnet |
[production] |
| 17:16 |
<herron@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host webperf1003.eqiad.wmnet |
[production] |
| 17:16 |
<herron@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host arclamp2001.codfw.wmnet |
[production] |
| 17:15 |
<herron@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mwlog2003.codfw.wmnet |
[production] |
| 17:14 |
<dzahn@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on doc2003.codfw.wmnet with reason: T426563 |
[production] |
| 17:14 |
<mutante> |
doc.wikimedia.org - rebooting backends |
[production] |
| 17:13 |
<herron@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host webperf2003.codfw.wmnet |
[production] |
| 17:13 |
<herron@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host o11ytest2001.codfw.wmnet |
[production] |
| 17:13 |
<dzahn@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on doc1004.eqiad.wmnet with reason: T426563 |
[production] |
| 17:13 |
<topranks> |
restarted gnmic on netflow3004 as series missing for cr2-esams |
[production] |
| 17:12 |
<herron@cumin1003> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host o11ytest1001.eqiad.wmnet |
[production] |
| 17:11 |
<jiji@cumin1003> |
START - Cookbook sre.hosts.reimage for host mc1061.eqiad.wmnet with OS bookworm |
[production] |
| 17:11 |
<jiji@cumin1003> |
START - Cookbook sre.hosts.reimage for host mc1062.eqiad.wmnet with OS bookworm |
[production] |
| 17:11 |
<mutante> |
etherpad - rebooting backends |
[production] |
| 17:10 |
<dzahn@cumin2002> |
DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on etherpad1004.eqiad.wmnet with reason: T426563 |
[production] |
| 17:10 |
<herron@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host arclamp2001.codfw.wmnet |
[production] |
| 17:10 |
<herron@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host webperf2003.codfw.wmnet |
[production] |
| 17:08 |
<herron@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host mwlog2003.codfw.wmnet |
[production] |
| 17:07 |
<herron@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host o11ytest2001.codfw.wmnet |
[production] |
| 17:07 |
<herron@cumin1003> |
START - Cookbook sre.hosts.reboot-single for host o11ytest1001.eqiad.wmnet |
[production] |
| 17:05 |
<herron@cumin1003> |
START - Cookbook sre.kafka.roll-restart-reboot-brokers rolling reboot on A:kafka-logging-eqiad |
[production] |
| 17:04 |
<mutante> |
contint2002, phab2002 - rebooting |
[production] |
| 16:49 |
<jiji@cumin1003> |
END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on P{wikikube-worker[1328-1384].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad) |
[production] |
| 16:49 |
<jiji@cumin1003> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[1373-1374].eqiad.wmnet |
[production] |