101-150 of 10000 results (114ms)
2026-06-09 ยง
18:43 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host dse-k8s-wdqs2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
18:42 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dse-k8s-wdqs2002.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
18:42 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host dse-k8s-wdqs2002.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
18:30 <dduvall@deploy1003> rebuilt and synchronized wikiversions files: group0 to 1.47.0-wmf.6 refs T423915 [production]
18:29 <jasmine@cumin2002> START - Cookbook sre.hosts.reimage for host kafka-main2008.codfw.wmnet with OS trixie [production]
18:26 <jasmine@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kafka-main2008.codfw.wmnet with OS trixie [production]
17:48 <mutante> https://releases.wikimedia.org | https://releases-jenkins.wikimedia.org - down for maintenance T418299 [production]
17:48 <cmooney@dns2005> END - running authdns-update [production]
17:47 <dzahn@cumin2002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on releases2003.codfw.wmnet with reason: reimage [production]
17:47 <cmooney@dns2005> START - running authdns-update [production]
17:46 <sukhe> sudo cumin 'A:hcaptcha-proxy' 'run-puppet-agent': rolling out CR 1299427 T428539 [production]
17:43 <jayme> kafka-main2008 is down due to hardware failure T428654 [production]
17:32 <blake@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc-wf1002.eqiad.wmnet with OS trixie [production]
17:14 <blake@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc-wf1002.eqiad.wmnet with reason: host reimage [production]
17:06 <blake@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on mc-wf1002.eqiad.wmnet with reason: host reimage [production]
17:05 <jasmine@cumin2002> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host kafka-main2008 [production]
17:05 <jasmine@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host kafka-main2008 [production]
17:04 <cjming@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/test-kitchen: apply [production]
17:04 <jasmine@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host kafka-main2008 [production]
17:04 <jasmine@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) kafka-main2008.codfw.wmnet 4.32.192.10.in-addr.arpa 4.0.0.0.2.3.0.0.2.9.1.0.0.1.0.0.3.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
17:04 <cjming@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/test-kitchen: apply [production]
17:04 <jasmine@cumin2002> START - Cookbook sre.dns.wipe-cache kafka-main2008.codfw.wmnet 4.32.192.10.in-addr.arpa 4.0.0.0.2.3.0.0.2.9.1.0.0.1.0.0.3.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
17:04 <jasmine@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
17:04 <jasmine@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host kafka-main2008 - jasmine@cumin2002" [production]
17:04 <brett@cumin2002> START - Cookbook sre.hosts.move-vlan for host cp5018 [production]
17:04 <jasmine@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host kafka-main2008 - jasmine@cumin2002" [production]
17:03 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host cp5018.eqsin.wmnet with OS trixie [production]
16:58 <jiji@deploy1003> helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply [production]
16:58 <jiji@deploy1003> helmfile [eqiad] START helmfile.d/services/rest-gateway: apply [production]
16:57 <jasmine@cumin2002> START - Cookbook sre.dns.netbox [production]
16:57 <jiji@deploy1003> helmfile [eqiad] DONE helmfile.d/services/rest-gateway: apply [production]
16:57 <jiji@deploy1003> helmfile [eqiad] START helmfile.d/services/rest-gateway: apply [production]
16:53 <javiermonton@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-html-feature-counts-change-enrich: apply [production]
16:52 <javiermonton@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-html-feature-counts-change-enrich: apply [production]
16:50 <blake@cumin1003> START - Cookbook sre.hosts.reimage for host mc-wf1002.eqiad.wmnet with OS trixie [production]
16:48 <javiermonton@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply [production]
16:47 <blake@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc-wf1001.eqiad.wmnet with OS trixie [production]
16:47 <jiji@deploy1003> helmfile [aux-k8s-eqiad] DONE helmfile.d/aux-k8s-services/redioscope: apply [production]
16:47 <jiji@deploy1003> helmfile [aux-k8s-eqiad] START helmfile.d/aux-k8s-services/redioscope: apply [production]
16:47 <javiermonton@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mw-page-html-content-change-enrich: apply [production]
16:41 <jiji@deploy1003> helmfile [eqiad] DONE helmfile.d/services/ratelimit: apply [production]
16:41 <jiji@deploy1003> helmfile [eqiad] START helmfile.d/services/ratelimit: apply [production]
16:35 <jasmine@cumin2002> START - Cookbook sre.hosts.move-vlan for host kafka-main2008 [production]
16:34 <jasmine@cumin2002> START - Cookbook sre.hosts.reimage for host kafka-main2008.codfw.wmnet with OS trixie [production]
16:31 <trueg@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs: apply [production]
16:31 <trueg@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/services/wdqs: apply [production]
16:31 <trueg@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs: apply [production]
16:31 <jiji@deploy1003> helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
16:30 <jiji@deploy1003> helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: apply [production]
16:30 <blake@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc-wf1001.eqiad.wmnet with reason: host reimage [production]