2023-06-20
ยง
|
17:44 |
<otto@deploy1002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. |
[production] |
17:44 |
<otto@deploy1002> |
helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. |
[production] |
17:13 |
<brett@cumin2002> |
START - Cookbook sre.cdn.roll-reboot rolling reboot on A:cp-text_esams |
[production] |
16:59 |
<bking@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
16:55 |
<brett@cumin2002> |
START - Cookbook sre.cdn.roll-reboot rolling reboot on P{cp3053.esams.wmnet,cp3055.esams.wmnet,cp3057.esams.wmnet,cp3059.esams.wmnet,cp3061.esams.wmnet,cp3063.esams.wmnet,cp3065.esams.wmnet} and A:cp |
[production] |
16:52 |
<bking@cumin1001> |
END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) |
[production] |
16:52 |
<bking@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
16:52 |
<bking@cumin1001> |
END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) |
[production] |
16:52 |
<bking@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
16:49 |
<brett@cumin2002> |
END (ERROR) - Cookbook sre.cdn.roll-reboot (exit_code=97) rolling reboot on P{cp3053.esams.wmnet,cp3055.esams.wmnet,cp3057.esams.wmnet,cp3059.esams.wmnet,cp3061.esams.wmnet,cp3063.esams.wmnet,cp3065.esams.wmnet} and A:cp |
[production] |
16:49 |
<brett@cumin2002> |
START - Cookbook sre.cdn.roll-reboot rolling reboot on P{cp3053.esams.wmnet,cp3055.esams.wmnet,cp3057.esams.wmnet,cp3059.esams.wmnet,cp3061.esams.wmnet,cp3063.esams.wmnet,cp3065.esams.wmnet} and A:cp |
[production] |
16:44 |
<bking@cumin1001> |
END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) |
[production] |
16:44 |
<bking@cumin1001> |
START - Cookbook sre.wdqs.data-transfer |
[production] |
16:28 |
<otto@deploy1002> |
Synchronized wmf-config/ext-EventStreamConfig.php: wgEventStreams - remove unused rc stream names for page_change related streams - T336817 (duration: 07m 35s) |
[production] |
16:21 |
<sukhe> |
sudo cumin 'A:cp' 'enable-puppet "merging CR 931626"' |
[production] |
16:17 |
<otto@deploy1002> |
Synchronized wmf-config/InitialiseSettings.php: wgEventBusStreamNamesMap - Remove page_change stream name override - T336817 (duration: 07m 42s) |
[production] |
16:14 |
<sukhe> |
sudo cumin 'A:cp' 'disable-puppet "merging CR 931626"' |
[production] |
16:09 |
<aokoth@cumin1001> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on vrts2001.codfw.wmnet with reason: Setup Incomplete |
[production] |
16:09 |
<aokoth@cumin1001> |
START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on vrts2001.codfw.wmnet with reason: Setup Incomplete |
[production] |
15:25 |
<moritzm> |
installing unbound security updates |
[production] |
15:14 |
<klausman@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/changeprop: apply |
[production] |
15:13 |
<klausman@deploy1002> |
helmfile [eqiad] START helmfile.d/services/changeprop: apply |
[production] |
15:13 |
<klausman@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/changeprop: apply |
[production] |
15:13 |
<klausman@deploy1002> |
helmfile [codfw] START helmfile.d/services/changeprop: apply |
[production] |
14:55 |
<bking@deploy1002> |
helmfile [staging] DONE helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
14:55 |
<bking@deploy1002> |
helmfile [staging] START helmfile.d/services/mw-page-content-change-enrich: apply |
[production] |
14:42 |
<mvernon@cumin1001> |
END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies (exit_code=0) rolling restart_daemons on A:thanos-fe |
[production] |
14:36 |
<mvernon@cumin1001> |
START - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies rolling restart_daemons on A:thanos-fe |
[production] |
14:36 |
<arturo> |
homer run for CR eqiad/codfw to allow bacula traffic in from cloud-hosts (T338132, T339894) |
[production] |
14:27 |
<hnowlan@deploy1002> |
helmfile [codfw] DONE helmfile.d/services/api-gateway: apply |
[production] |
14:26 |
<hnowlan@deploy1002> |
helmfile [codfw] START helmfile.d/services/api-gateway: apply |
[production] |
14:26 |
<hnowlan@deploy1002> |
helmfile [eqiad] DONE helmfile.d/services/api-gateway: apply |
[production] |
14:26 |
<hnowlan@deploy1002> |
helmfile [eqiad] START helmfile.d/services/api-gateway: apply |
[production] |
14:24 |
<hnowlan@deploy1002> |
helmfile [staging] DONE helmfile.d/services/api-gateway: apply |
[production] |
14:24 |
<hnowlan@deploy1002> |
helmfile [staging] START helmfile.d/services/api-gateway: apply |
[production] |
14:18 |
<jclark@cumin1001> |
END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host parse1002.eqiad.wmnet |
[production] |
14:16 |
<hnowlan@puppetmaster1001> |
conftool action : set/weight=10; selector: service=thumbor,name=kubernetes20[12][0-9].codfw.wmnet |
[production] |
14:15 |
<hnowlan@puppetmaster1001> |
conftool action : set/weight=10; selector: service=thumbor,name=kubernetes10[12][0-9].eqiad.wmnet |
[production] |
14:15 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=yes; selector: service=thumbor,name=kubernetes202[0-9].codfw.wmnet |
[production] |
14:15 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=yes; selector: service=thumbor,name=kubernetes201[0-9].codfw.wmnet |
[production] |
14:15 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=yes; selector: service=thumbor,name=kubernetes102[0-9].eqiad.wmnet |
[production] |
14:15 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=yes; selector: service=thumbor,name=kubernetes101[0-9].eqiad.wmnet |
[production] |
14:14 |
<hnowlan@puppetmaster1001> |
conftool action : set/pooled=yes; selector: service=thumbor,name=kubernetes1*.eqiad.wmnet |
[production] |
14:11 |
<vgutierrez@cumin1001> |
END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on P{cp4044.ulsfo.wmnet,cp4051.ulsfo.wmnet} and A:cp |
[production] |
14:07 |
<vgutierrez@cumin1001> |
START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on P{cp4044.ulsfo.wmnet,cp4051.ulsfo.wmnet} and A:cp |
[production] |
14:06 |
<vgutierrez> |
test HAProxy 2.6.14 on cp4044 and cp4051 |
[production] |
14:03 |
<vgutierrez> |
fetch HAProxy 2.6.14 on thirdparty/haproxy26 for bullseye (apt.wm.o) |
[production] |
13:22 |
<vgutierrez> |
repooling cp3050 - T339898 |
[production] |
13:22 |
<isaranto@deploy1002> |
helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . |
[production] |
13:18 |
<moritzm> |
installing python2.7 security updates |
[production] |