5951-6000 of 10000 results (98ms)
2023-08-23 ยง
17:47 <denisse> make alert2001 the active host [production]
17:31 <denisse> failing over alert1001 to alert2001 [production]
17:24 <brett@cumin2002> START - Cookbook sre.cdn.roll-reboot rolling reboot on A:cp-text_codfw and A:cp [production]
17:24 <brett@cumin2002> START - Cookbook sre.cdn.roll-reboot rolling reboot on A:cp-upload_codfw and A:cp [production]
17:23 <pt1979@cumin2002> START - Cookbook sre.hosts.reimage for host kubernetes2053.codfw.wmnet with OS bullseye [production]
17:23 <brett@cumin2002> END (ERROR) - Cookbook sre.cdn.roll-reboot (exit_code=97) rolling reboot on A:cp-upload_eqiad and A:cp [production]
17:23 <brett@cumin2002> END (ERROR) - Cookbook sre.cdn.roll-reboot (exit_code=97) rolling reboot on A:cp-text_eqiad and A:cp [production]
17:22 <brett@cumin2002> START - Cookbook sre.cdn.roll-reboot rolling reboot on A:cp-text_eqiad and A:cp [production]
17:22 <brett@cumin2002> START - Cookbook sre.cdn.roll-reboot rolling reboot on A:cp-upload_eqiad and A:cp [production]
17:20 <pt1979@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
17:20 <pt1979@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add DNS entries for kubernetes2040-kubernetes2052 - pt1979@cumin2002" [production]
17:19 <pt1979@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add DNS entries for kubernetes2040-kubernetes2052 - pt1979@cumin2002" [production]
17:19 <hnowlan@deploy1002> helmfile [staging] DONE helmfile.d/services/geo-analytics: apply [production]
17:19 <hnowlan@deploy1002> helmfile [staging] START helmfile.d/services/geo-analytics: apply [production]
17:17 <pt1979@cumin2002> START - Cookbook sre.dns.netbox [production]
17:10 <pt1979@cumin2002> END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['kubernetes2053'] [production]
17:07 <denisse@cumin1001> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host alert2001.wikimedia.org [production]
17:07 <denisse@cumin1001> START - Cookbook sre.hosts.reboot-single for host alert2001.wikimedia.org [production]
17:06 <denisse> reboot alert2001 for a kernel upgrade [production]
17:05 <herron> set icinga downtime on wikitech-static [production]
17:03 <bking@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on wdqs1009.eqiad.wmnet with reason: jnl export [production]
17:03 <bking@cumin1001> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on wdqs1009.eqiad.wmnet with reason: jnl export [production]
17:00 <pt1979@cumin2002> START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['kubernetes2053'] [production]
16:56 <pt1979@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host kubernetes2053.mgmt.codfw.wmnet with reboot policy FORCED [production]
16:45 <pt1979@cumin2002> START - Cookbook sre.hosts.provision for host kubernetes2053.mgmt.codfw.wmnet with reboot policy FORCED [production]
16:45 <pt1979@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:43 <pt1979@cumin2002> START - Cookbook sre.dns.netbox [production]
16:43 <pt1979@cumin2002> END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) [production]
16:43 <pt1979@cumin2002> END (FAIL) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=99) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add DNS entries for kubernetes2053 - pt1979@cumin2002" [production]
16:37 <hnowlan@deploy1002> helmfile [staging] DONE helmfile.d/services/geo-analytics: apply [production]
16:35 <bblack> cp3067-81 - rolling restart of varnish frontends (one at a time, 30 minute sleep between, will run for ~7.5h), for experimental cache memory settings from https://gerrit.wikimedia.org/r/c/operations/puppet/+/951949 [production]
16:27 <hnowlan@deploy1002> helmfile [staging] START helmfile.d/services/geo-analytics: apply [production]
16:25 <hnowlan@deploy1002> helmfile [eqiad] DONE helmfile.d/admin 'apply'. [production]
16:24 <hnowlan@deploy1002> helmfile [eqiad] START helmfile.d/admin 'apply'. [production]
16:17 <jiji@cumin1001> conftool action : set/pooled=false; selector: dnsdisc=kartotherian,name=codfw [production]
16:17 <effie> depool maps/karothertian codfw [production]
16:10 <hnowlan@deploy1002> helmfile [codfw] DONE helmfile.d/admin 'apply'. [production]
16:09 <hnowlan@deploy1002> helmfile [codfw] START helmfile.d/admin 'apply'. [production]
16:09 <fabfur@cumin1001> END (PASS) - Cookbook sre.cdn.roll-reboot (exit_code=0) rolling reboot on A:cp-upload_eqiad and A:cp [production]
16:08 <hnowlan@deploy1002> helmfile [staging-codfw] DONE helmfile.d/admin 'apply'. [production]
16:07 <fabfur@cumin1001> END (PASS) - Cookbook sre.cdn.roll-reboot (exit_code=0) rolling reboot on A:cp-text_eqiad and A:cp [production]
16:07 <jclark@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:07 <hnowlan@deploy1002> helmfile [staging-codfw] START helmfile.d/admin 'apply'. [production]
16:06 <jclark@cumin1001> START - Cookbook sre.dns.netbox [production]
16:06 <hnowlan@deploy1002> helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'. [production]
16:05 <hnowlan@deploy1002> helmfile [staging-eqiad] START helmfile.d/admin 'apply'. [production]
15:57 <bblack> cp3066 - varnish-frontend-restart for new memory params experiment [production]
15:55 <effie> pooled codfw kartotherian/maps [production]
15:54 <jiji@cumin1001> conftool action : set/pooled=true; selector: dnsdisc=kartotherian,name=codfw [production]
15:44 <jiji@cumin1001> conftool action : set/pooled=false; selector: dnsdisc=kartotherian,name=codfw [production]