3551-3600 of 10000 results (63ms)
2022-05-09 ยง
11:11 <mvernon@cumin1001> START - Cookbook sre.hosts.reboot-single for host ms-fe1010.eqiad.wmnet [production]
11:10 <_joe_> removing stale files from config-master on puppetmaster1001; this could cause some flapping confd alerts [production]
11:07 <mvernon@cumin1001> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host ms-fe1010.eqiad.wmnet [production]
11:07 <mvernon@cumin1001> START - Cookbook sre.hosts.reboot-single for host ms-fe1010.eqiad.wmnet [production]
11:05 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM ncredir6001.drmrs.wmnet [production]
10:59 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM prometheus6001.drmrs.wmnet [production]
10:55 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM prometheus6001.drmrs.wmnet [production]
10:52 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM bast6001.wikimedia.org [production]
10:48 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM bast6001.wikimedia.org [production]
10:46 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM netflow6001.drmrs.wmnet [production]
10:42 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM netflow6001.drmrs.wmnet [production]
10:39 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM install6001.wikimedia.org [production]
10:34 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM install6001.wikimedia.org [production]
10:30 <mvernon@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host ms-be2052.codfw.wmnet with OS bullseye [production]
10:07 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM ncredir3002.esams.wmnet [production]
10:02 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM ncredir3002.esams.wmnet [production]
09:55 <mvernon@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ms-be2052.codfw.wmnet with reason: host reimage [production]
09:52 <mvernon@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on ms-be2052.codfw.wmnet with reason: host reimage [production]
09:42 <elukey@deploy1002> Finished deploy [ores/deploy@98a1b2e]: (no justification provided) (duration: 00m 05s) [production]
09:42 <elukey@deploy1002> Started deploy [ores/deploy@98a1b2e]: (no justification provided) [production]
09:41 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM ncredir3001.esams.wmnet [production]
09:38 <elukey@deploy1002> Finished deploy [ores/deploy@98a1b2e]: (no justification provided) (duration: 00m 32s) [production]
09:38 <elukey@deploy1002> Started deploy [ores/deploy@98a1b2e]: (no justification provided) [production]
09:37 <elukey@deploy1002> Finished deploy [ores/deploy@98a1b2e]: (no justification provided) (duration: 00m 08s) [production]
09:36 <elukey@deploy1002> Started deploy [ores/deploy@98a1b2e]: (no justification provided) [production]
09:35 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM ncredir3001.esams.wmnet [production]
09:30 <marostegui@cumin1001> dbctl commit (dc=all): 'Increase traffic on db1172 to test 10.6 T307546', diff saved to https://phabricator.wikimedia.org/P27768 and previous config saved to /var/cache/conftool/dbconfig/20220509-093032-marostegui.json [production]
09:29 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM ping3002.esams.wmnet [production]
09:25 <mvernon@cumin1001> START - Cookbook sre.hosts.reimage for host ms-be2052.codfw.wmnet with OS bullseye [production]
09:24 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM ping3002.esams.wmnet [production]
09:13 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM prometheus3001.esams.wmnet [production]
09:09 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM prometheus3001.esams.wmnet [production]
08:53 <jelto> mw241[2-9]: scap pull [production]
08:51 <hashar> Gerrit is back and operational [production]
08:47 <hashar> Restarting Gerrit for plugin update [production]
08:45 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM prometheus5001.eqsin.wmnet [production]
08:43 <hashar@deploy1002> Finished deploy [gerrit/gerrit@94c5028]: Update Zuul plugin - T307621 (duration: 00m 07s) [production]
08:43 <hashar@deploy1002> Started deploy [gerrit/gerrit@94c5028]: Update Zuul plugin - T307621 [production]
08:42 <hashar> Restarting Gerrit on replica gerrit2001.wikimedia.org to update the Zuul plugin # T307621 [production]
08:41 <hashar@deploy1002> Finished deploy [gerrit/gerrit@94c5028]: Update Zuul plugin - T307621 (duration: 00m 09s) [production]
08:41 <hashar@deploy1002> Started deploy [gerrit/gerrit@94c5028]: Update Zuul plugin - T307621 [production]
08:41 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM prometheus5001.eqsin.wmnet [production]
08:41 <ladsgroup@cumin1001> conftool action : set/pooled=no; selector: name=elastic2033.codfw.wmnet [production]
08:40 <ladsgroup@cumin1001> conftool action : set/pooled=no; selector: name=ores2002.codfw.wmnet [production]
08:40 <ladsgroup@cumin1001> conftool action : set/pooled=no; selector: name=mw2412.codfw.wmnet [production]
08:30 <dcausse> restarting blazegraph on wdqs1004 (BlazegraphFreeAllocatorsDecreasingRapidly) [production]
08:26 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.reboot-vm (exit_code=0) for VM prometheus4001.ulsfo.wmnet [production]
08:22 <Amir1> restarting confd on puppetmaster100[12] [production]
08:21 <jmm@cumin2002> START - Cookbook sre.ganeti.reboot-vm for VM prometheus4001.ulsfo.wmnet [production]
08:09 <godog> temp stop tegola-swift-container delete - T307184 [production]