2019-10-25
§
|
23:41 |
<bstorm_> |
Deployed custom webhook controllers for registry and ingress checking to toolsbeta-test kubernetes cluster T215531 T215678 T234231 |
[toolsbeta] |
22:54 |
<mutante> |
moscovium rm /dev/shm/envoy_shared_memory_0 to revive envoy which failed to run after changing ports and reinstalling it (T180641) |
[production] |
22:42 |
<mutante> |
moscovium - manually deleting envoy listener on 1443 and letting puppet recreate config because it's not removed if you change the port (T180641) |
[production] |
21:55 |
<mutante> |
running puppet on ulsfo cp-ats servers to pick up config change for RT backend |
[production] |
21:06 |
<wm-bot> |
<lucaswerkmeister> deployed temporary experiment (clear session on OAuth callback error) |
[tools.wd-image-positions] |
20:59 |
<paladox> |
created cloud/instance-puppet-dev gerrit repo per andrewbogott |
[releng] |
20:42 |
<twentyafterfour@deploy1001> |
Finished deploy [design/style-guide@c69242e]: deploying design/style-guide for demonstration purposes (duration: 00m 06s) |
[production] |
20:41 |
<twentyafterfour@deploy1001> |
Started deploy [design/style-guide@c69242e]: deploying design/style-guide for demonstration purposes |
[production] |
20:04 |
<twentyafterfour@deploy1001> |
Finished deploy [design/style-guide@c69242e]: test deploy design/style-guide (duration: 00m 10s) |
[production] |
20:04 |
<twentyafterfour@deploy1001> |
Started deploy [design/style-guide@c69242e]: test deploy design/style-guide |
[production] |
18:30 |
<brennen> |
Updating docker-pkg files on contint1001 for T236333 |
[releng] |
17:49 |
<bblack@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
17:47 |
<bblack@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
17:26 |
<bblack> |
lvs3005 - reimaging to fix partman issue, high-traffic1 (text) to lvs3007 for the duration |
[production] |
16:47 |
<bd808> |
Reverted local hack that disabled /api/ route (T236423) |
[tools.deadlinks] |
16:43 |
<bblack@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) |
[production] |
16:40 |
<bblack@cumin1001> |
START - Cookbook sre.hosts.downtime |
[production] |
16:19 |
<bblack> |
lvs3006 - reimaging to fix partman issue, high-traffic2 (upload/maps) to lvs3007 for the duration |
[production] |
16:19 |
<crusnov@deploy1001> |
Finished deploy [netbox/deploy@0f4c92d]: deploy netbox scripts update (netbox1001) T223292 (duration: 13m 31s) |
[production] |
16:15 |
<bstorm_> |
rebooting toolsbeta-test-k8s-worker-1 and -2 |
[toolsbeta] |
16:05 |
<crusnov@deploy1001> |
Started deploy [netbox/deploy@0f4c92d]: deploy netbox scripts update (netbox1001) T223292 |
[production] |
16:04 |
<crusnov@deploy1001> |
Finished deploy [netbox/deploy@0f4c92d]: deploy netbox scripts update (netbox2001) T223292 (duration: 00m 43s) |
[production] |
16:04 |
<crusnov@deploy1001> |
Started deploy [netbox/deploy@0f4c92d]: deploy netbox scripts update (netbox2001) T223292 |
[production] |
15:40 |
<hauskater> |
Dropped dupe outreach.wikipedia from CVNBot7, left on CVNBot19 |
[cvn] |
15:35 |
<robh> |
ps1-oe14-esams ip info set, rebooting (wont affect servers) via T184066 |
[production] |
15:22 |
<tgr> |
made wikispore-prod temporarily its own puppetmaster to work around T236455 |
[wikispore] |
15:10 |
<wm-bot> |
<lucaswerkmeister> deployed 7a4982705c (improve headings) |
[tools.wd-image-positions] |
15:03 |
<gehel@cumin1001> |
END (ERROR) - Cookbook sre.wdqs.data-reload (exit_code=97) |
[production] |
15:02 |
<wm-bot> |
<jeanfred> Deploy latest from Git master: 82de2957 (T224212, T223930) |
[tools.integraality] |
15:01 |
<bblack@cumin1001> |
END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) |
[production] |
15:00 |
<bblack@cumin1001> |
START - Cookbook sre.hosts.decommission |
[production] |
14:51 |
<wm-bot> |
<lucaswerkmeister> deployed 2d28f55912 (optimize cropper image loading) |
[tools.wd-image-positions] |
14:41 |
<bblack> |
cr[23]-esams: re-route ns2 IP to ganeti3003 |
[production] |
14:36 |
<gehel@cumin1001> |
START - Cookbook sre.wdqs.data-reload |
[production] |
14:32 |
<crusnov@deploy1001> |
Finished deploy [netbox/deploy@690f9ae]: deploy netbox scripts (netbox2001) -T223292 (duration: 00m 44s) |
[production] |
14:31 |
<crusnov@deploy1001> |
Started deploy [netbox/deploy@690f9ae]: deploy netbox scripts (netbox2001) -T223292 |
[production] |
14:30 |
<crusnov@deploy1001> |
Finished deploy [netbox/deploy@690f9ae]: deploy netbox scripts (netbox2001) T223292 (duration: 00m 05s) |
[production] |
14:30 |
<crusnov@deploy1001> |
Started deploy [netbox/deploy@690f9ae]: deploy netbox scripts (netbox2001) T223292 |
[production] |
14:28 |
<crusnov@deploy1001> |
Finished deploy [netbox/deploy@690f9ae]: deploy netbox scripts T223292 (duration: 01m 02s) |
[production] |
14:27 |
<crusnov@deploy1001> |
Started deploy [netbox/deploy@690f9ae]: deploy netbox scripts T223292 |
[production] |
14:24 |
<jeh> |
`systemctl reset-failed` on labweb1001 and labweb1002 to cleanup hhvm service |
[openstack] |
14:17 |
<gehel@cumin1001> |
END (FAIL) - Cookbook sre.wdqs.data-reload (exit_code=99) |
[production] |