251-300 of 10000 results (22ms)
2026-02-09 ยง
11:51 <jmm@cumin2002> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=1) for host es1033.eqiad.wmnet [production]
11:46 <fceratto@deploy2002> helmfile [aux-k8s-eqiad] 'sync' command on namespace 'zarcillo' for release 'main' . [production]
11:38 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host es1033.eqiad.wmnet [production]
11:29 <jayme@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host kubestagemaster2003.codfw.wmnet [production]
11:29 <jayme@cumin1003> START - Cookbook sre.k8s.pool-depool-node pool for host kubestagemaster2003.codfw.wmnet [production]
11:28 <jayme@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host kubestagemaster2003.codfw.wmnet [production]
11:23 <jayme@cumin1003> START - Cookbook sre.hosts.reboot-single for host kubestagemaster2003.codfw.wmnet [production]
11:23 <jayme@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host kubestage2001.codfw.wmnet [production]
11:23 <jayme@cumin1003> START - Cookbook sre.k8s.pool-depool-node pool for host kubestage2001.codfw.wmnet [production]
11:16 <jayme@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host kubestage2001.codfw.wmnet [production]
11:13 <jayme@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host kubestage2001.codfw.wmnet [production]
11:10 <jayme@cumin1003> START - Cookbook sre.k8s.pool-depool-node depool for host kubestage2001.codfw.wmnet [production]
11:10 <jayme@cumin1003> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host kubestagemaster2003.codfw.wmnet [production]
11:10 <jayme@cumin1003> START - Cookbook sre.k8s.pool-depool-node depool for host kubestagemaster2003.codfw.wmnet [production]
11:08 <jayme@cumin1003> START - Cookbook sre.hosts.reboot-single for host kubestage2001.codfw.wmnet [production]
11:08 <jayme@cumin1003> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host kubestage2001.codfw.wmnet [production]
11:07 <jayme@cumin1003> START - Cookbook sre.hosts.reboot-single for host kubestage2001.codfw.wmnet [production]
11:07 <ayounsi@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host aux-k8s-worker1007.eqiad.wmnet with OS bookworm [production]
10:49 <marostegui@cumin1003> END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool es1033: Will be depooled [production]
10:49 <marostegui@cumin1003> START - Cookbook sre.mysql.depool depool es1033: Will be depooled [production]
10:33 <ayounsi@cumin1003> START - Cookbook sre.hosts.reimage for host aux-k8s-worker1007.eqiad.wmnet with OS bookworm [production]
10:32 <ayounsi@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host aux-k8s-worker1007.eqiad.wmnet with OS bookworm [production]
10:17 <ayounsi@cumin1003> START - Cookbook sre.hosts.reimage for host aux-k8s-worker1007.eqiad.wmnet with OS bookworm [production]
10:17 <ayounsi@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host aux-k8s-worker1007.eqiad.wmnet with OS bookworm [production]
10:15 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=0) on pki-proxy.pki.eqiad1.wikimedia.cloud [pki]
10:14 <dcaro@cloudcumin1001> START - Cookbook wmcs.vps.refresh_puppet_certs on pki-proxy.pki.eqiad1.wikimedia.cloud [pki]
10:13 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=0) on pki-intermediate.pki.eqiad1.wikimedia.cloud [pki]
10:11 <dcaro@cloudcumin1001> START - Cookbook wmcs.vps.refresh_puppet_certs on pki-intermediate.pki.eqiad1.wikimedia.cloud [pki]
10:10 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=0) on pki-root.pki.eqiad1.wikimedia.cloud [pki]
10:09 <dcaro@cloudcumin1001> START - Cookbook wmcs.vps.refresh_puppet_certs on pki-root.pki.eqiad1.wikimedia.cloud [pki]
10:08 <dcaro@cloudcumin1001> END (PASS) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=0) on pki-db.pki.eqiad1.wikimedia.cloud [pki]
10:07 <dcaro@cloudcumin1001> START - Cookbook wmcs.vps.refresh_puppet_certs on pki-db.pki.eqiad1.wikimedia.cloud [pki]
10:00 <ayounsi@cumin1003> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host aux-k8s-worker1007 [production]
10:00 <ayounsi@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host aux-k8s-worker1007 [production]
10:00 <dcaro@cloudcumin1001> END (FAIL) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=99) on pki-pm.pki.eqiad1.wikimedia.cloud [pki]
10:00 <ayounsi@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host aux-k8s-worker1007 [production]
10:00 <ayounsi@cumin1003> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) aux-k8s-worker1007.eqiad.wmnet 131.48.64.10.in-addr.arpa 1.3.1.0.8.4.0.0.4.6.0.0.0.1.0.0.7.0.1.0.1.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
10:00 <ayounsi@cumin1003> START - Cookbook sre.dns.wipe-cache aux-k8s-worker1007.eqiad.wmnet 131.48.64.10.in-addr.arpa 1.3.1.0.8.4.0.0.4.6.0.0.0.1.0.0.7.0.1.0.1.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
10:00 <ayounsi@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
10:00 <ayounsi@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host aux-k8s-worker1007 - ayounsi@cumin1003" [production]
10:00 <ayounsi@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host aux-k8s-worker1007 - ayounsi@cumin1003" [production]
09:57 <dcaro@cloudcumin1001> START - Cookbook wmcs.vps.refresh_puppet_certs on pki-pm.pki.eqiad1.wikimedia.cloud [pki]
09:55 <ayounsi@cumin1003> START - Cookbook sre.dns.netbox [production]
09:55 <ayounsi@cumin1003> START - Cookbook sre.hosts.move-vlan for host aux-k8s-worker1007 [production]
09:55 <ayounsi@cumin1003> START - Cookbook sre.hosts.reimage for host aux-k8s-worker1007.eqiad.wmnet with OS bookworm [production]
09:49 <phuedx> End of UTC morning backport window [production]
09:48 <phuedx@deploy2002> Finished scap sync-world: Backport for [[gerrit:1237851|metrics(ReviseTone): Use Experiment::send to send metrics (T416612)]], [[gerrit:1237852|metrics(ReviseTone): send consistent experiment exposure event (T416199)]] (duration: 27m 34s) [production]
09:46 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti-test2003.codfw.wmnet [production]
09:45 <dcaro> re-enabling nrpe2nodexp-ferm_active.service on cloudcumins after upgrade (getting stall promfile) [admin]
09:42 <phuedx@deploy2002> phuedx: Continuing with sync [production]