1-50 of 10000 results (96ms)
2026-03-20 ยง
23:30 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for lvs2013.codfw.wmnet [production]
23:30 <brett@cumin2002> START - Cookbook sre.hosts.remove-downtime for lvs2013.codfw.wmnet [production]
22:34 <brett@cumin2002> END (ERROR) - Cookbook sre.hosts.reboot-single (exit_code=97) for host lvs2013.codfw.wmnet [production]
22:34 <brett> Started pybal on lvs2013 [production]
22:27 <brett@cumin2002> START - Cookbook sre.hosts.reboot-single for host lvs2013.codfw.wmnet [production]
21:57 <cdobbins@cumin2002> conftool action : set/pooled=yes; selector: name=cp5023.eqsin.wmnet [reason: trixie reimaging] [production]
21:57 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5023.eqsin.wmnet with OS trixie [production]
21:55 <hashar> Upgrading CI Jenkins T420477 [production]
21:25 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5023.eqsin.wmnet with reason: host reimage [production]
21:21 <cdobbins@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp5023.eqsin.wmnet with reason: host reimage [production]
21:04 <brett@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on lvs2013.codfw.wmnet with reason: debugging ipip [production]
20:46 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp5023.eqsin.wmnet with OS trixie [production]
20:45 <mutante> contint1003/2003 apt remove --purge apache2* ; apt remove --purge php* | T418521 [production]
20:43 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs2013.codfw.wmnet [production]
20:40 <brett@cumin2002> START - Cookbook sre.hosts.reboot-single for host lvs2013.codfw.wmnet [production]
20:38 <cdobbins@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cp5023.eqsin.wmnet with OS trixie [production]
20:24 <sukhe@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on doh3006.wikimedia.org with reason: depooled host [production]
20:24 <sukhe@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on doh3005.wikimedia.org with reason: depooled host [production]
20:23 <sukhe@cumin1003> DONE (ERROR) - Cookbook sre.hosts.downtime (exit_code=97) for 1 day, 0:00:00 on doh3005.wikimedia.org with reason: depooled host [production]
19:50 <brett@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on lvs2013.codfw.wmnet with reason: debugging ipip [production]
19:33 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host lvs2013.codfw.wmnet [production]
19:30 <cdobbins@cumin2002> START - Cookbook sre.hosts.reboot-single for host lvs2013.codfw.wmnet [production]
19:21 <brett@cumin2002> END (PASS) - Cookbook sre.cdn.roll-restart-reboot-tcp-proxy (exit_code=0) rolling reboot on A:tcpproxy and A:tcpproxy [production]
19:16 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp5023.eqsin.wmnet with OS trixie [production]
19:16 <cdobbins@cumin2002> conftool action : set/pooled=no; selector: name=cp5023.eqsin.wmnet [reason: trixie reimaging] [production]
19:16 <cdobbins@cumin2002> conftool action : set/pooled=yes; selector: name=cp5021.eqsin.wmnet [reason: trixie reimaging] [production]
19:14 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5021.eqsin.wmnet with OS trixie [production]
18:52 <cdobbins@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on lvs2013.codfw.wmnet with reason: reboot [production]
18:43 <cdobbins@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5021.eqsin.wmnet with reason: host reimage [production]
18:39 <cdobbins@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on cp5021.eqsin.wmnet with reason: host reimage [production]
18:28 <cdobbins@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on lvs2013.codfw.wmnet with reason: reboot [production]
18:16 <brett@cumin2002> START - Cookbook sre.cdn.roll-restart-reboot-tcp-proxy rolling reboot on A:tcpproxy and A:tcpproxy [production]
18:14 <jhathaway@cumin1003> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on db1253.eqiad.wmnet with reason: T420041 [production]
17:59 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp5021.eqsin.wmnet with OS trixie [production]
17:54 <cdobbins@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cp5021.eqsin.wmnet with OS trixie [production]
17:51 <cdobbins@cumin2002> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host lvs2014.codfw.wmnet [production]
17:40 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4 days, 0:00:00 on contint1003.wikimedia.org with reason: jenkins on java21 [production]
17:39 <cdobbins@cumin2002> START - Cookbook sre.hosts.reboot-single for host lvs2014.codfw.wmnet [production]
16:54 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host an-worker1172.eqiad.wmnet with OS bullseye [production]
16:54 <btullis@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-worker1172.eqiad.wmnet with OS bullseye [production]
16:46 <btullis@cumin1003> START - Cookbook sre.hosts.reimage for host an-worker1172.eqiad.wmnet with OS bullseye [production]
16:33 <cdobbins@cumin2002> START - Cookbook sre.hosts.reimage for host cp5021.eqsin.wmnet with OS trixie [production]
16:32 <cdobbins@cumin2002> conftool action : set/pooled=no; selector: name=cp5021.eqsin.wmnet [reason: trixie reimaging] [production]
16:09 <btullis@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1172.eqiad.wmnet with OS bullseye [production]
16:08 <jiji@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc-wf2002.codfw.wmnet [production]
16:02 <jiji@cumin1003> START - Cookbook sre.hosts.reboot-single for host mc-wf2002.codfw.wmnet [production]
15:51 <jiji@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc-wf2001.codfw.wmnet [production]
15:45 <dpogorzelski@deploy2002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'edit-check' for release 'main' . [production]
15:45 <jiji@cumin1003> START - Cookbook sre.hosts.reboot-single for host mc-wf2001.codfw.wmnet [production]
15:43 <jiji@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host mc2041.codfw.wmnet [production]