651-700 of 10000 results (73ms)
2023-06-20 ยง
16:52 <bking@cumin1001> END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) [production]
16:52 <bking@cumin1001> START - Cookbook sre.wdqs.data-transfer [production]
16:52 <bking@cumin1001> END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) [production]
16:52 <bking@cumin1001> START - Cookbook sre.wdqs.data-transfer [production]
16:49 <brett@cumin2002> END (ERROR) - Cookbook sre.cdn.roll-reboot (exit_code=97) rolling reboot on P{cp3053.esams.wmnet,cp3055.esams.wmnet,cp3057.esams.wmnet,cp3059.esams.wmnet,cp3061.esams.wmnet,cp3063.esams.wmnet,cp3065.esams.wmnet} and A:cp [production]
16:49 <brett@cumin2002> START - Cookbook sre.cdn.roll-reboot rolling reboot on P{cp3053.esams.wmnet,cp3055.esams.wmnet,cp3057.esams.wmnet,cp3059.esams.wmnet,cp3061.esams.wmnet,cp3063.esams.wmnet,cp3065.esams.wmnet} and A:cp [production]
16:44 <bking@cumin1001> END (FAIL) - Cookbook sre.wdqs.data-transfer (exit_code=99) [production]
16:44 <bking@cumin1001> START - Cookbook sre.wdqs.data-transfer [production]
16:28 <otto@deploy1002> Synchronized wmf-config/ext-EventStreamConfig.php: wgEventStreams - remove unused rc stream names for page_change related streams - T336817 (duration: 07m 35s) [production]
16:21 <sukhe> sudo cumin 'A:cp' 'enable-puppet "merging CR 931626"' [production]
16:17 <otto@deploy1002> Synchronized wmf-config/InitialiseSettings.php: wgEventBusStreamNamesMap - Remove page_change stream name override - T336817 (duration: 07m 42s) [production]
16:14 <sukhe> sudo cumin 'A:cp' 'disable-puppet "merging CR 931626"' [production]
16:09 <aokoth@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 30 days, 0:00:00 on vrts2001.codfw.wmnet with reason: Setup Incomplete [production]
16:09 <aokoth@cumin1001> START - Cookbook sre.hosts.downtime for 30 days, 0:00:00 on vrts2001.codfw.wmnet with reason: Setup Incomplete [production]
15:25 <moritzm> installing unbound security updates [production]
15:14 <klausman@deploy1002> helmfile [eqiad] DONE helmfile.d/services/changeprop: apply [production]
15:13 <klausman@deploy1002> helmfile [eqiad] START helmfile.d/services/changeprop: apply [production]
15:13 <klausman@deploy1002> helmfile [codfw] DONE helmfile.d/services/changeprop: apply [production]
15:13 <klausman@deploy1002> helmfile [codfw] START helmfile.d/services/changeprop: apply [production]
14:55 <bking@deploy1002> helmfile [staging] DONE helmfile.d/services/mw-page-content-change-enrich: apply [production]
14:55 <bking@deploy1002> helmfile [staging] START helmfile.d/services/mw-page-content-change-enrich: apply [production]
14:42 <mvernon@cumin1001> END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies (exit_code=0) rolling restart_daemons on A:thanos-fe [production]
14:36 <mvernon@cumin1001> START - Cookbook sre.swift.roll-restart-reboot-swift-thanos-proxies rolling restart_daemons on A:thanos-fe [production]
14:36 <arturo> homer run for CR eqiad/codfw to allow bacula traffic in from cloud-hosts (T338132, T339894) [production]
14:27 <hnowlan@deploy1002> helmfile [codfw] DONE helmfile.d/services/api-gateway: apply [production]
14:26 <hnowlan@deploy1002> helmfile [codfw] START helmfile.d/services/api-gateway: apply [production]
14:26 <hnowlan@deploy1002> helmfile [eqiad] DONE helmfile.d/services/api-gateway: apply [production]
14:26 <hnowlan@deploy1002> helmfile [eqiad] START helmfile.d/services/api-gateway: apply [production]
14:24 <hnowlan@deploy1002> helmfile [staging] DONE helmfile.d/services/api-gateway: apply [production]
14:24 <hnowlan@deploy1002> helmfile [staging] START helmfile.d/services/api-gateway: apply [production]
14:18 <jclark@cumin1001> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host parse1002.eqiad.wmnet [production]
14:16 <hnowlan@puppetmaster1001> conftool action : set/weight=10; selector: service=thumbor,name=kubernetes20[12][0-9].codfw.wmnet [production]
14:15 <hnowlan@puppetmaster1001> conftool action : set/weight=10; selector: service=thumbor,name=kubernetes10[12][0-9].eqiad.wmnet [production]
14:15 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: service=thumbor,name=kubernetes202[0-9].codfw.wmnet [production]
14:15 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: service=thumbor,name=kubernetes201[0-9].codfw.wmnet [production]
14:15 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: service=thumbor,name=kubernetes102[0-9].eqiad.wmnet [production]
14:15 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: service=thumbor,name=kubernetes101[0-9].eqiad.wmnet [production]
14:14 <hnowlan@puppetmaster1001> conftool action : set/pooled=yes; selector: service=thumbor,name=kubernetes1*.eqiad.wmnet [production]
14:11 <vgutierrez@cumin1001> END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on P{cp4044.ulsfo.wmnet,cp4051.ulsfo.wmnet} and A:cp [production]
14:07 <vgutierrez@cumin1001> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on P{cp4044.ulsfo.wmnet,cp4051.ulsfo.wmnet} and A:cp [production]
14:06 <vgutierrez> test HAProxy 2.6.14 on cp4044 and cp4051 [production]
14:03 <vgutierrez> fetch HAProxy 2.6.14 on thirdparty/haproxy26 for bullseye (apt.wm.o) [production]
13:22 <vgutierrez> repooling cp3050 - T339898 [production]
13:22 <isaranto@deploy1002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . [production]
13:18 <moritzm> installing python2.7 security updates [production]
13:15 <aokoth@cumin1001> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts otrs1001.eqiad.wmnet [production]
13:15 <aokoth@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
13:15 <aokoth@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: otrs1001.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - aokoth@cumin1001" [production]
13:14 <urbanecm@deploy1002> Finished scap: Backport for [[gerrit:930189|Enable Extension:Translate on pt.wikisource.org (T339139)]] (duration: 09m 11s) [production]
13:13 <aokoth@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: otrs1001.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - aokoth@cumin1001" [production]