2019-10-25
ยง
|
14:41 |
<bblack> |
cr[23]-esams: re-route ns2 IP to ganeti3003 |
[production] |
14:36 |
<gehel@cumin1001> |
START - Cookbook sre.wdqs.data-reload |
[production] |
14:32 |
<crusnov@deploy1001> |
Finished deploy [netbox/deploy@690f9ae]: deploy netbox scripts (netbox2001) -T223292 (duration: 00m 44s) |
[production] |
14:31 |
<crusnov@deploy1001> |
Started deploy [netbox/deploy@690f9ae]: deploy netbox scripts (netbox2001) -T223292 |
[production] |
14:30 |
<crusnov@deploy1001> |
Finished deploy [netbox/deploy@690f9ae]: deploy netbox scripts (netbox2001) T223292 (duration: 00m 05s) |
[production] |
14:30 |
<crusnov@deploy1001> |
Started deploy [netbox/deploy@690f9ae]: deploy netbox scripts (netbox2001) T223292 |
[production] |
14:28 |
<crusnov@deploy1001> |
Finished deploy [netbox/deploy@690f9ae]: deploy netbox scripts T223292 (duration: 01m 02s) |
[production] |
14:27 |
<crusnov@deploy1001> |
Started deploy [netbox/deploy@690f9ae]: deploy netbox scripts T223292 |
[production] |
14:17 |
<gehel@cumin1001> |
END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) |
[production] |
14:15 |
<gehel@cumin1001> |
START - Cookbook sre.wdqs.data-reload |
[production] |
14:10 |
<gehel@cumin1001> |
END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) |
[production] |
14:10 |
<gehel@cumin1001> |
START - Cookbook sre.wdqs.data-reload |
[production] |
14:09 |
<bblack> |
reboot ganeti3003 |
[production] |
13:57 |
<bblack@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
13:57 |
<ema> |
pool cp4032 with ATS backend T227432 |
[production] |
13:55 |
<bblack@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
13:48 |
<effie> |
depool mw1334 and pool back |
[production] |
13:30 |
<filippo@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
13:30 |
<ema@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
13:30 |
<filippo@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
13:28 |
<ema@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
13:07 |
<ema@cumin1001> |
conftool action : set/weight=100; selector: name=cp4032.ulsfo.wmnet,service=ats-be |
[production] |
13:05 |
<ema> |
depool cp4032 and reimage as text_ats T227432 |
[production] |
12:34 |
<jynus> |
introducing new freshnesh check for bacula T234900 |
[production] |
12:11 |
<ema> |
pool cp4031 with ATS backend T227432 |
[production] |
10:20 |
<ema@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
10:18 |
<ema@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
10:01 |
<godog> |
swift eqiad-prod: add weight to ms-be105[1-6] - T232367 |
[production] |
09:59 |
<ema@puppetmaster1001> |
conftool action : set/weight=100; selector: name=cp4031.ulsfo.wmnet,service=ats-be |
[production] |
09:56 |
<ema> |
depool cp4031 and reimage as text_ats T227432 |
[production] |
09:39 |
<ema> |
pool cp4030 with ATS backend T227432 |
[production] |
09:22 |
<ema@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
09:21 |
<XioNoX> |
powering off mr1-esams again |
[production] |
09:20 |
<ema@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
09:06 |
<XioNoX> |
going to power down mr1-esams (esams mgmt is going to go down) for 30min the time to move power cables |
[production] |
09:02 |
<jynus> |
disabling persistent journald on db1074 |
[production] |
09:01 |
<ema@cumin1001> |
conftool action : set/weight=100; selector: name=cp4030.ulsfo.wmnet,service=ats-be |
[production] |
08:58 |
<ema> |
depool cp4030 and reimage as text_ats T227432 |
[production] |
08:48 |
<vgutierrez> |
switch from nginx to ats-tls on cp3050 - T231627 |
[production] |
08:45 |
<godog> |
stop prometheus on bast300[24] and done last round of rsync data - T236329 |
[production] |
08:37 |
<ema> |
lvs1015: restart pybal to add labweb-ssl T210411 |
[production] |
08:36 |
<ema> |
test |
[production] |
08:34 |
<ema@cumin1001> |
conftool action : set/pooled=yes; selector: service=labweb-ssl |
[production] |
08:32 |
<ema> |
lvs1016: restart pybal to add labweb-ssl T210411 |
[production] |
08:02 |
<vgutierrez> |
rolling restart of ats-tls to introduce a SSL handshake timeout of 60 secs - T236458 |
[production] |
07:35 |
<akosiaris> |
reboot webperf1002 for disk resize T235455 |
[production] |
07:29 |
<akosiaris> |
reboot webperf2002 for disk resize T235455 |
[production] |
05:58 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
05:56 |
<vgutierrez@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
05:35 |
<vgutierrez> |
reimage lvs3007 to let it get the proper partman configuration - T236294 |
[production] |