151-200 of 10000 results (74ms)
2023-06-29 ยง
16:38 <jiji@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kubestagemaster2002.codfw.wmnet with reason: host reimage [production]
16:35 <jiji@cumin1001> START - Cookbook sre.hosts.downtime for 2:00:00 on kubestagemaster2002.codfw.wmnet with reason: host reimage [production]
16:22 <klausman@deploy1002> helmfile [eqiad] DONE helmfile.d/services/changeprop: apply [production]
16:21 <klausman@deploy1002> helmfile [eqiad] START helmfile.d/services/changeprop: apply [production]
16:18 <mutante> releases1003 - re-enabling puppet after recent webserver debugging [production]
16:18 <jiji@cumin1001> START - Cookbook sre.hosts.reimage for host kubestagemaster2002.codfw.wmnet with OS bullseye [production]
16:17 <jiji@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM kubestagemaster2002.codfw.wmnet - jiji@cumin1001" [production]
16:16 <jiji@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM kubestagemaster2002.codfw.wmnet - jiji@cumin1001" [production]
16:16 <jiji@cumin1001> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) kubestagemaster2002.codfw.wmnet on all recursors [production]
16:16 <jiji@cumin1001> START - Cookbook sre.dns.wipe-cache kubestagemaster2002.codfw.wmnet on all recursors [production]
16:16 <jiji@cumin1001> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:16 <jiji@cumin1001> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM kubestagemaster2002.codfw.wmnet - jiji@cumin1001" [production]
16:12 <fabfur@cumin1001> END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-text_drmrs and A:cp [production]
16:11 <fabfur@cumin1001> END (PASS) - Cookbook sre.cdn.roll-upgrade-haproxy (exit_code=0) rolling upgrade of HAProxy on A:cp-upload_drmrs and A:cp [production]
16:10 <sukhe> systemctl restart bird.service on doh2002 [production]
16:04 <klausman@deploy1002> helmfile [codfw] DONE helmfile.d/services/changeprop: apply [production]
16:04 <klausman@deploy1002> helmfile [codfw] START helmfile.d/services/changeprop: apply [production]
16:04 <klausman@deploy1002> helmfile [staging] DONE helmfile.d/services/changeprop: apply [production]
16:03 <klausman@deploy1002> helmfile [staging] START helmfile.d/services/changeprop: apply [production]
16:03 <jiji@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM kubestagemaster2002.codfw.wmnet - jiji@cumin1001" [production]
15:59 <jiji@cumin1001> START - Cookbook sre.dns.netbox [production]
15:59 <jiji@cumin1001> START - Cookbook sre.ganeti.makevm for new host kubestagemaster2002.codfw.wmnet [production]
15:49 <fabfur@cumin1001> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-upload_drmrs and A:cp [production]
15:49 <fabfur@cumin1001> START - Cookbook sre.cdn.roll-upgrade-haproxy rolling upgrade of HAProxy on A:cp-text_drmrs and A:cp [production]
15:49 <elukey@deploy1002> helmfile [ml-serve-eqiad] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
15:48 <elukey@deploy1002> helmfile [ml-serve-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
15:47 <elukey@deploy1002> helmfile [ml-staging-codfw] 'sync' command on namespace 'ores-legacy' for release 'main' . [production]
15:35 <cgoubert@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw1486.eqiad.wmnet [production]
15:35 <cgoubert@cumin1001> START - Cookbook sre.hosts.remove-downtime for mw1486.eqiad.wmnet [production]
15:34 <cgoubert@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw1485.eqiad.wmnet [production]
15:34 <cgoubert@cumin1001> START - Cookbook sre.hosts.remove-downtime for mw1485.eqiad.wmnet [production]
15:34 <cgoubert@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw1484.eqiad.wmnet [production]
15:34 <cgoubert@cumin1001> START - Cookbook sre.hosts.remove-downtime for mw1484.eqiad.wmnet [production]
15:34 <cgoubert@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw1483.eqiad.wmnet [production]
15:34 <cgoubert@cumin1001> START - Cookbook sre.hosts.remove-downtime for mw1483.eqiad.wmnet [production]
15:34 <cgoubert@cumin1001> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for mw1482.eqiad.wmnet [production]
15:34 <cgoubert@cumin1001> START - Cookbook sre.hosts.remove-downtime for mw1482.eqiad.wmnet [production]
15:30 <claime> Pooled mw148[2-6].eqiad.wmnet as jobrunners - T329366 [production]
15:29 <cgoubert@cumin1001> conftool action : set/pooled=yes; selector: name=mw148[2-6].eqiad.wmnet,cluster=jobrunner [production]
15:27 <cgoubert@cumin1001> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - cgoubert@cumin1001" [production]
15:25 <cgoubert@cumin1001> conftool action : set/pooled=no; selector: name=mw148[2-6].eqiad.wmnet [production]
15:24 <cgoubert@cumin1001> conftool action : set/weight=10; selector: name=mw148[2-6].eqiad.wmnet [production]
15:23 <cgoubert@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1484.eqiad.wmnet with OS buster [production]
15:21 <cgoubert@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1485.eqiad.wmnet with OS buster [production]
15:19 <cgoubert@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1483.eqiad.wmnet with OS buster [production]
15:16 <cgoubert@cumin1001> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mw1482.eqiad.wmnet with OS buster [production]
15:16 <moritzm> installing Java 8 security updates on sessionstore/codfw [production]
15:06 <Daimona> Creating new DB tables for the CampaignEvents extension in x1.testwiki, x1.test2wiki, x1.officewiki, and x1.wikishared # T340000 [production]
14:54 <cgoubert@cumin1001> END (FAIL) - Cookbook sre.hosts.downtime (exit_code=99) for 2:00:00 on mw1486.eqiad.wmnet with reason: host reimage [production]
14:53 <cgoubert@cumin1001> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1484.eqiad.wmnet with reason: host reimage [production]