1351-1400 of 10000 results (78ms)
2023-01-24 ยง
17:44 <bblack> cp5032: upgrading packages (varnish, trafficserver [production]
17:40 <eevans@cumin1001> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host restbase2020.codfw.wmnet [production]
17:37 <brett@cumin1001> START - Cookbook sre.hosts.reimage for host cp5017.eqsin.wmnet with OS bullseye [production]
17:36 <brett@cumin1001> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host cp5017.eqsin.wmnet with OS bullseye [production]
17:28 <eevans@cumin1001> START - Cookbook sre.hosts.reboot-single for host restbase2020.codfw.wmnet [production]
17:21 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase2016.codfw.wmnet [production]
17:19 <thcipriani> restarting ci jenkins for updates [production]
17:13 <eevans@cumin1001> START - Cookbook sre.hosts.reboot-single for host restbase2016.codfw.wmnet [production]
17:13 <eevans@cumin1001> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host restbase2015.codfw.wmnet [production]
17:10 <brett@cumin1001> START - Cookbook sre.hosts.reimage for host cp5017.eqsin.wmnet with OS bullseye [production]
17:04 <eevans@cumin1001> START - Cookbook sre.hosts.reboot-single for host restbase2015.codfw.wmnet [production]
17:04 <urandom> rebooting restbase cassandra nodes, row c -- T325132 [production]
16:29 <hnowlan@deploy1002> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
16:29 <hnowlan@deploy1002> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
16:28 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc2042.codfw.wmnet with OS bullseye [production]
16:23 <hnowlan@deploy1002> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
16:23 <hnowlan@deploy1002> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
16:23 <jiji@deploy1002> helmfile [codfw] DONE helmfile.d/services/tegola-vector-tiles: apply [production]
16:23 <jiji@deploy1002> helmfile [codfw] START helmfile.d/services/tegola-vector-tiles: apply [production]
16:22 <jiji@deploy1002> helmfile [codfw] DONE helmfile.d/services/tegola-vector-tiles: apply [production]
16:22 <jiji@deploy1002> helmfile [codfw] START helmfile.d/services/tegola-vector-tiles: apply [production]
16:12 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc2042.codfw.wmnet with reason: host reimage [production]
16:10 <hnowlan@deploy1002> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
16:10 <hnowlan@deploy1002> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
16:09 <jiji@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on mc2042.codfw.wmnet with reason: host reimage [production]
15:54 <btullis@deploy1002> helmfile [staging] DONE helmfile.d/services/datahub: sync on main [production]
15:53 <jiji@cumin1001> START - Cookbook sre.hosts.reimage for host mc2042.codfw.wmnet with OS bullseye [production]
15:43 <btullis@deploy1002> helmfile [staging] START helmfile.d/services/datahub: apply on main [production]
15:31 <btullis@deploy1002> helmfile [staging] DONE helmfile.d/services/datahub: sync on main [production]
15:30 <btullis@deploy1002> helmfile [staging] START helmfile.d/services/datahub: sync on main [production]
15:26 <btullis@deploy1002> helmfile [staging] DONE helmfile.d/services/datahub: sync on main [production]
15:17 <jgiannelos@deploy1002> Finished deploy [kartotherian/deploy@5c58f8f] (codfw): Disable traffic mirroring from codfw to eqiad (duration: 01m 40s) [production]
15:15 <jgiannelos@deploy1002> Started deploy [kartotherian/deploy@5c58f8f] (codfw): Disable traffic mirroring from codfw to eqiad [production]
15:12 <jgiannelos@deploy1002> Finished deploy [kartotherian/deploy@15e6aa7] (codfw): Revert "codfw: Disable traffic mirroring" (duration: 00m 33s) [production]
15:11 <jgiannelos@deploy1002> Started deploy [kartotherian/deploy@15e6aa7] (codfw): Revert "codfw: Disable traffic mirroring" [production]
14:58 <btullis@deploy1002> helmfile [staging] START helmfile.d/services/datahub: apply on main [production]
14:58 <btullis@deploy1002> helmfile [staging] DONE helmfile.d/services/datahub: sync on main [production]
14:57 <btullis@deploy1002> helmfile [staging] START helmfile.d/services/datahub: sync on main [production]
14:55 <jclark@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1001" [production]
14:52 <btullis@deploy1002> helmfile [staging] DONE helmfile.d/services/datahub: sync on main [production]
14:52 <btullis@deploy1002> helmfile [staging] START helmfile.d/services/datahub: sync on main [production]
14:51 <btullis@deploy1002> helmfile [staging] DONE helmfile.d/services/datahub: sync on main [production]
14:41 <btullis@deploy1002> helmfile [staging] START helmfile.d/services/datahub: apply on main [production]
14:41 <jclark@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on druid1010.eqiad.wmnet with reason: host reimage [production]
14:39 <jiji@cumin1001> conftool action : set/pooled=false; selector: dnsdisc=kartotherian,name=eqiad [production]
14:38 <jclark@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on druid1010.eqiad.wmnet with reason: host reimage [production]
14:36 <volans@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:36 <volans@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Force update after switch upgrade - volans@cumin1001" [production]
14:35 <isaranto@deploy1002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-editquality-reverted' for release 'main' . [production]
14:35 <isaranto@deploy1002> helmfile [ml-staging-codfw] Ran 'sync' command on namespace 'revscoring-editquality-goodfaith' for release 'main' . [production]