101-150 of 10000 results (25ms)
2026-05-18 ยง
17:46 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on phab1005.eqiad.wmnet with reason: T426563 [production]
17:46 <herron> rebooting alert2002 [production]
17:45 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on phab2003.codfw.wmnet with reason: T426563 [production]
17:45 <herron@cumin1003> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host alert2002.wikimedia.org [production]
17:45 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host alert2002.wikimedia.org [production]
17:44 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host grafana1002.eqiad.wmnet [production]
17:44 <herron@cumin1003> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host alert2002.wikimedia.org [production]
17:44 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host alert2002.wikimedia.org [production]
17:44 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host centrallog2002.codfw.wmnet [production]
17:43 <jiji@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc1062.eqiad.wmnet with OS bookworm [production]
17:40 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host grafana1002.eqiad.wmnet [production]
17:38 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host graphite1005.eqiad.wmnet [production]
17:37 <mutante> stewards* - rebooting [production]
17:36 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host grafana2001.codfw.wmnet [production]
17:32 <jiji@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1061.eqiad.wmnet with reason: host reimage [production]
17:32 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host grafana2001.codfw.wmnet [production]
17:31 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host graphite1005.eqiad.wmnet [production]
17:30 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host graphite2004.codfw.wmnet [production]
17:28 <jiji@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc1062.eqiad.wmnet with reason: host reimage [production]
17:25 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mwlog1003.eqiad.wmnet [production]
17:23 <jiji@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on mc1061.eqiad.wmnet with reason: host reimage [production]
17:23 <jiji@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on mc1062.eqiad.wmnet with reason: host reimage [production]
17:23 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host graphite2004.codfw.wmnet [production]
17:22 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host arclamp1001.eqiad.wmnet [production]
17:21 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host webperf1003.eqiad.wmnet [production]
17:18 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host mwlog1003.eqiad.wmnet [production]
17:16 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host arclamp1001.eqiad.wmnet [production]
17:16 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host webperf1003.eqiad.wmnet [production]
17:16 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host arclamp2001.codfw.wmnet [production]
17:15 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mwlog2003.codfw.wmnet [production]
17:14 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on doc2003.codfw.wmnet with reason: T426563 [production]
17:14 <mutante> doc.wikimedia.org - rebooting backends [production]
17:13 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host webperf2003.codfw.wmnet [production]
17:13 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host o11ytest2001.codfw.wmnet [production]
17:13 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on doc1004.eqiad.wmnet with reason: T426563 [production]
17:13 <topranks> restarted gnmic on netflow3004 as series missing for cr2-esams [production]
17:12 <herron@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host o11ytest1001.eqiad.wmnet [production]
17:11 <jiji@cumin1003> START - Cookbook sre.hosts.reimage for host mc1061.eqiad.wmnet with OS bookworm [production]
17:11 <jiji@cumin1003> START - Cookbook sre.hosts.reimage for host mc1062.eqiad.wmnet with OS bookworm [production]
17:11 <mutante> etherpad - rebooting backends [production]
17:10 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:15:00 on etherpad1004.eqiad.wmnet with reason: T426563 [production]
17:10 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host arclamp2001.codfw.wmnet [production]
17:10 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host webperf2003.codfw.wmnet [production]
17:08 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host mwlog2003.codfw.wmnet [production]
17:07 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host o11ytest2001.codfw.wmnet [production]
17:07 <herron@cumin1003> START - Cookbook sre.hosts.reboot-single for host o11ytest1001.eqiad.wmnet [production]
17:05 <herron@cumin1003> START - Cookbook sre.kafka.roll-restart-reboot-brokers rolling reboot on A:kafka-logging-eqiad [production]
17:04 <mutante> contint2002, phab2002 - rebooting [production]
16:49 <jiji@cumin1003> END (PASS) - Cookbook sre.k8s.reboot-nodes (exit_code=0) rolling reboot on P{wikikube-worker[1328-1384].eqiad.wmnet} and (A:wikikube-master-eqiad or A:wikikube-worker-eqiad) [production]
16:49 <jiji@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[1373-1374].eqiad.wmnet [production]