1-50 of 10000 results (100ms)
2026-06-16 ยง
19:27 <jasmine@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-ctrl2006.codfw.wmnet with OS trixie [production]
19:18 <topranks> restarting grpc server on eqiad SR-Linux switches to recover from problem of no free threads T429242 [production]
19:08 <jasmine@cumin2002> START - Cookbook sre.hosts.reimage for host wikikube-ctrl2006.codfw.wmnet with OS trixie [production]
19:08 <robh@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-ctrl2006.codfw.wmnet with OS trixie [production]
19:02 <jasmine@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wikikube-ctrl2006.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
19:00 <krinkle@deploy1003> Finished scap sync-world: Backport for [[gerrit:1302274|Disable ShortUrl on hiwiki, hiwikiversity, maiwiki, knwiki, knwikisource, tcywiki (T107188)]] (duration: 11m 18s) [production]
18:58 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
18:56 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
18:56 <jasmine@cumin2002> START - Cookbook sre.hosts.provision for host wikikube-ctrl2006.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
18:55 <krinkle@deploy1003> krinkle: Continuing with deployment [production]
18:52 <jasmine@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wikikube-ctrl2006.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
18:51 <krinkle@deploy1003> krinkle: Backport for [[gerrit:1302274|Disable ShortUrl on hiwiki, hiwikiversity, maiwiki, knwiki, knwikisource, tcywiki (T107188)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. [production]
18:48 <krinkle@deploy1003> Started scap sync-world: Backport for [[gerrit:1302274|Disable ShortUrl on hiwiki, hiwikiversity, maiwiki, knwiki, knwikisource, tcywiki (T107188)]] [production]
18:45 <jasmine@cumin2002> START - Cookbook sre.hosts.provision for host wikikube-ctrl2006.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
18:44 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on dns5004.wikimedia.org with reason: host reimage [production]
18:41 <eevans@deploy1003> helmfile [codfw] DONE helmfile.d/services/data-gateway: apply [production]
18:41 <eevans@deploy1003> helmfile [codfw] START helmfile.d/services/data-gateway: apply [production]
18:41 <brett@cumin2002> START - Cookbook sre.hosts.downtime for 2:00:00 on dns5004.wikimedia.org with reason: host reimage [production]
18:40 <eevans@deploy1003> helmfile [eqiad] DONE helmfile.d/services/data-gateway: apply [production]
18:39 <eevans@deploy1003> helmfile [eqiad] START helmfile.d/services/data-gateway: apply [production]
18:39 <eevans@deploy1003> helmfile [staging] DONE helmfile.d/services/data-gateway: apply [production]
18:39 <eevans@deploy1003> helmfile [staging] START helmfile.d/services/data-gateway: apply [production]
18:35 <trueg@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/services/wdqs: apply [production]
18:34 <robh@cumin2002> START - Cookbook sre.hosts.reimage for host wikikube-ctrl2006.codfw.wmnet with OS trixie [production]
18:33 <robh@cumin2002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-ctrl2006.codfw.wmnet with OS trixie [production]
18:30 <robh@cumin2002> START - Cookbook sre.hosts.reimage for host wikikube-ctrl2006.codfw.wmnet with OS trixie [production]
18:23 <jhuneidi@deploy1003> rebuilt and synchronized wikiversions files: group0 to 1.47.0-wmf.7 refs T423916 [production]
18:12 <btullis@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
18:12 <brett@cumin2002> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host dns5004 [production]
18:12 <brett@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host dns5004 [production]
18:08 <brett@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host dns5004 [production]
18:08 <brett@cumin2002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) dns5004.wikimedia.org 8.166.102.103.in-addr.arpa 8.0.0.0.6.6.1.0.2.0.1.0.3.0.1.0.1.0.0.0.0.0.5.e.2.f.d.0.1.0.0.2.ip6.arpa on all recursors [production]
18:08 <brett@cumin2002> START - Cookbook sre.dns.wipe-cache dns5004.wikimedia.org 8.166.102.103.in-addr.arpa 8.0.0.0.6.6.1.0.2.0.1.0.3.0.1.0.1.0.0.0.0.0.5.e.2.f.d.0.1.0.0.2.ip6.arpa on all recursors [production]
18:08 <brett@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
18:08 <brett@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host dns5004 - brett@cumin2002" [production]
18:08 <brett@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host dns5004 - brett@cumin2002" [production]
18:02 <btullis@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
18:00 <brett@cumin2002> START - Cookbook sre.dns.netbox [production]
18:00 <btullis@deploy1003> helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'. [production]
17:59 <btullis@deploy1003> helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'. [production]
17:53 <brett@puppetserver1001> conftool action : set/pooled=no; selector: name=dns5004.* [production]
17:47 <cmooney@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
17:47 <cmooney@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: change mgmt name for frproto1001 - cmooney@cumin1003" [production]
17:46 <brett@cumin2002> START - Cookbook sre.hosts.move-vlan for host dns5004 [production]
17:46 <brett@cumin2002> START - Cookbook sre.hosts.reimage for host dns5004.wikimedia.org with OS bookworm [production]
17:44 <cmooney@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: change mgmt name for frproto1001 - cmooney@cumin1003" [production]
17:43 <elukey@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host conf2007.codfw.wmnet with OS trixie [production]
17:43 <dreamyjazz@deploy1003> Finished scap sync-world: Backport for [[gerrit:1302912|Revert^2 "hCaptcha: Enable for UploadWizard on all wikis with it"]], [[gerrit:1302909|PublishCaptchaHandler: Only require CAPTCHA for UploadWizard (T429322)]], [[gerrit:1302908|PublishCaptchaHandler: Only require CAPTCHA for UploadWizard (T429322)]] (duration: 32m 19s) [production]
17:38 <cmooney@cumin1003> START - Cookbook sre.dns.netbox [production]
17:30 <dreamyjazz@deploy1003> dreamyjazz: Continuing with deployment [production]