51-100 of 10000 results (26ms)
2021-11-23 ยง
17:31 <cmjohnson1> upgrading msw's in row D eqiad T259758 [production]
17:28 <sukhe@cumin1001> START - Cookbook sre.ganeti.reboot-vm for VM doh2002.wikimedia.org [production]
17:26 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-fe2012.codfw.wmnet with OS stretch [production]
17:16 <sukhe@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on durum2002.codfw.wmnet with reason: apply new KVM machine settings [production]
17:16 <sukhe@cumin1001> START - Cookbook sre.hosts.downtime for 0:30:00 on durum2002.codfw.wmnet with reason: apply new KVM machine settings [production]
17:16 <sukhe@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on durum2001.codfw.wmnet with reason: apply new KVM machine settings [production]
17:16 <sukhe@cumin1001> START - Cookbook sre.hosts.downtime for 0:30:00 on durum2001.codfw.wmnet with reason: apply new KVM machine settings [production]
17:15 <sukhe@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on doh2002.wikimedia.org with reason: apply new KVM machine settings [production]
17:15 <sukhe@cumin1001> START - Cookbook sre.hosts.downtime for 0:30:00 on doh2002.wikimedia.org with reason: apply new KVM machine settings [production]
17:15 <sukhe@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on doh2001.wikimedia.org with reason: apply new KVM machine settings [production]
17:15 <sukhe@cumin1001> START - Cookbook sre.hosts.downtime for 0:30:00 on doh2001.wikimedia.org with reason: apply new KVM machine settings [production]
17:15 <sukhe@cumin1001> END (ERROR) - Cookbook sre.ganeti.reboot-vm (exit_code=97) for VM doh2001.wikimedia.org [production]
17:14 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM mwdebug2002.codfw.wmnet [production]
17:14 <sukhe@cumin1001> START - Cookbook sre.ganeti.reboot-vm for VM doh2001.wikimedia.org [production]
17:14 <sukhe@cumin1001> END (ERROR) - Cookbook sre.ganeti.reboot-vm (exit_code=97) for VM doh2001.wikimedia.org [production]
17:11 <sukhe@cumin1001> START - Cookbook sre.ganeti.reboot-vm for VM doh2001.wikimedia.org [production]
17:10 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM mwdebug2002.codfw.wmnet [production]
17:05 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM mwdebug2001.codfw.wmnet [production]
17:02 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM mwdebug2001.codfw.wmnet [production]
16:59 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM miscweb2002.codfw.wmnet [production]
16:57 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM miscweb2002.codfw.wmnet [production]
16:55 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM doc2001.codfw.wmnet [production]
16:53 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM doc2001.codfw.wmnet [production]
16:51 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host ms-fe2012.codfw.wmnet with OS stretch [production]
16:49 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM pybal-test2001.codfw.wmnet [production]
16:47 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM pybal-test2001.codfw.wmnet [production]
16:41 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM pybal-test2003.codfw.wmnet [production]
16:39 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM pybal-test2003.codfw.wmnet [production]
16:28 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-fe2011.codfw.wmnet with OS stretch [production]
16:13 <cmjohnson1> updating mgmt switches in row C, racks C2-C8 eqiad T259758 [production]
15:54 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host ms-fe2011.codfw.wmnet with OS stretch [production]
15:46 <oblivian@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'apple-search' for release 'main' . [production]
15:46 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-fe2010.codfw.wmnet with OS stretch [production]
15:41 <oblivian@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'apple-search' for release 'main' . [production]
15:31 <oblivian@deploy1002> helmfile [staging] Ran 'sync' command on namespace 'apple-search' for release 'main' . [production]
15:27 <Emperor> rolling restart of thanos frontends T294380 [production]
15:01 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host ms-fe2010.codfw.wmnet with OS stretch [production]
14:40 <btullis@cumin1001> END (PASS) - Cookbook sre.kafka.roll-restart-mirror-maker (exit_code=0) restart MirrorMaker for Kafka A:kafka-mirror-maker-test-eqiad cluster: Roll restart of jvm daemons. [production]
14:34 <jbond@cumin1001> conftool action : set/pooled=false; selector: name=codfw,dnsdisc=puppetboard [production]
14:30 <btullis@cumin1001> START - Cookbook sre.kafka.roll-restart-mirror-maker restart MirrorMaker for Kafka A:kafka-mirror-maker-test-eqiad cluster: Roll restart of jvm daemons. [production]
14:09 <kharlan@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'linkrecommendation' for release 'external' . [production]
14:09 <kharlan@deploy1002> helmfile [codfw] Ran 'sync' command on namespace 'linkrecommendation' for release 'internal' . [production]
14:03 <kharlan@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'linkrecommendation' for release 'internal' . [production]
14:03 <kharlan@deploy1002> helmfile [eqiad] Ran 'sync' command on namespace 'linkrecommendation' for release 'external' . [production]
14:00 <marostegui> Failover m5 from db1128 to db1132 - T288720 [production]
14:00 <filippo@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host prometheus2006.codfw.wmnet with OS bullseye [production]
13:50 <godog> powercycle (again) ms-be2058 [production]
13:48 <godog> add 80G to prometheus global in eqiad [production]
13:31 <filippo@cumin1001> START - Cookbook sre.hosts.reimage for host prometheus2006.codfw.wmnet with OS bullseye [production]
13:29 <filippo@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host prometheus2005.codfw.wmnet with OS bullseye [production]