2021-11-23
ยง
|
17:34 |
<sukhe@cumin1001> |
START - Cookbook sre.ganeti.reboot-vm for VM durum2001.codfw.wmnet |
[production] |
17:33 |
<sukhe@cumin1001> |
END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM doh2002.wikimedia.org |
[production] |
17:31 |
<cmjohnson1> |
upgrading msw's in row D eqiad T259758 |
[production] |
17:28 |
<sukhe@cumin1001> |
START - Cookbook sre.ganeti.reboot-vm for VM doh2002.wikimedia.org |
[production] |
17:26 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-fe2012.codfw.wmnet with OS stretch |
[production] |
17:16 |
<sukhe@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on durum2002.codfw.wmnet with reason: apply new KVM machine settings |
[production] |
17:16 |
<sukhe@cumin1001> |
START - Cookbook sre.hosts.downtime for 0:30:00 on durum2002.codfw.wmnet with reason: apply new KVM machine settings |
[production] |
17:16 |
<sukhe@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on durum2001.codfw.wmnet with reason: apply new KVM machine settings |
[production] |
17:16 |
<sukhe@cumin1001> |
START - Cookbook sre.hosts.downtime for 0:30:00 on durum2001.codfw.wmnet with reason: apply new KVM machine settings |
[production] |
17:15 |
<sukhe@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on doh2002.wikimedia.org with reason: apply new KVM machine settings |
[production] |
17:15 |
<sukhe@cumin1001> |
START - Cookbook sre.hosts.downtime for 0:30:00 on doh2002.wikimedia.org with reason: apply new KVM machine settings |
[production] |
17:15 |
<sukhe@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on doh2001.wikimedia.org with reason: apply new KVM machine settings |
[production] |
17:15 |
<sukhe@cumin1001> |
START - Cookbook sre.hosts.downtime for 0:30:00 on doh2001.wikimedia.org with reason: apply new KVM machine settings |
[production] |
17:15 |
<sukhe@cumin1001> |
END (ERROR) - Cookbook sre.ganeti.reboot-vm (exit_code=97) for VM doh2001.wikimedia.org |
[production] |
17:14 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM mwdebug2002.codfw.wmnet |
[production] |
17:14 |
<sukhe@cumin1001> |
START - Cookbook sre.ganeti.reboot-vm for VM doh2001.wikimedia.org |
[production] |
17:14 |
<sukhe@cumin1001> |
END (ERROR) - Cookbook sre.ganeti.reboot-vm (exit_code=97) for VM doh2001.wikimedia.org |
[production] |
17:11 |
<sukhe@cumin1001> |
START - Cookbook sre.ganeti.reboot-vm for VM doh2001.wikimedia.org |
[production] |
17:10 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.reboot-vm for VM mwdebug2002.codfw.wmnet |
[production] |
17:05 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM mwdebug2001.codfw.wmnet |
[production] |
17:02 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.reboot-vm for VM mwdebug2001.codfw.wmnet |
[production] |
16:59 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM miscweb2002.codfw.wmnet |
[production] |
16:57 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.reboot-vm for VM miscweb2002.codfw.wmnet |
[production] |
16:55 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM doc2001.codfw.wmnet |
[production] |
16:53 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.reboot-vm for VM doc2001.codfw.wmnet |
[production] |
16:51 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host ms-fe2012.codfw.wmnet with OS stretch |
[production] |
16:49 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM pybal-test2001.codfw.wmnet |
[production] |
16:47 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.reboot-vm for VM pybal-test2001.codfw.wmnet |
[production] |
16:41 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM pybal-test2003.codfw.wmnet |
[production] |
16:39 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.reboot-vm for VM pybal-test2003.codfw.wmnet |
[production] |
16:28 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-fe2011.codfw.wmnet with OS stretch |
[production] |
16:13 |
<cmjohnson1> |
updating mgmt switches in row C, racks C2-C8 eqiad T259758 |
[production] |
15:54 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host ms-fe2011.codfw.wmnet with OS stretch |
[production] |
15:46 |
<oblivian@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'apple-search' for release 'main' . |
[production] |
15:46 |
<pt1979@cumin2002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-fe2010.codfw.wmnet with OS stretch |
[production] |
15:41 |
<oblivian@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'apple-search' for release 'main' . |
[production] |
15:31 |
<oblivian@deploy1002> |
helmfile [staging] Ran 'sync' command on namespace 'apple-search' for release 'main' . |
[production] |
15:27 |
<Emperor> |
rolling restart of thanos frontends T294380 |
[production] |
15:01 |
<pt1979@cumin2002> |
START - Cookbook sre.hosts.reimage for host ms-fe2010.codfw.wmnet with OS stretch |
[production] |
14:40 |
<btullis@cumin1001> |
END (PASS) - Cookbook sre.kafka.roll-restart-mirror-maker (exit_code=0) restart MirrorMaker for Kafka A:kafka-mirror-maker-test-eqiad cluster: Roll restart of jvm daemons. |
[production] |
14:34 |
<jbond@cumin1001> |
conftool action : set/pooled=false; selector: name=codfw,dnsdisc=puppetboard |
[production] |
14:30 |
<btullis@cumin1001> |
START - Cookbook sre.kafka.roll-restart-mirror-maker restart MirrorMaker for Kafka A:kafka-mirror-maker-test-eqiad cluster: Roll restart of jvm daemons. |
[production] |
14:09 |
<kharlan@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'linkrecommendation' for release 'external' . |
[production] |
14:09 |
<kharlan@deploy1002> |
helmfile [codfw] Ran 'sync' command on namespace 'linkrecommendation' for release 'internal' . |
[production] |
14:03 |
<kharlan@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'linkrecommendation' for release 'internal' . |
[production] |
14:03 |
<kharlan@deploy1002> |
helmfile [eqiad] Ran 'sync' command on namespace 'linkrecommendation' for release 'external' . |
[production] |
14:00 |
<marostegui> |
Failover m5 from db1128 to db1132 - T288720 |
[production] |
14:00 |
<filippo@cumin1001> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host prometheus2006.codfw.wmnet with OS bullseye |
[production] |
13:50 |
<godog> |
powercycle (again) ms-be2058 |
[production] |
13:48 |
<godog> |
add 80G to prometheus global in eqiad |
[production] |