1-50 of 10000 results (22ms)
2025-12-23 ยง
12:43 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
12:43 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply [production]
12:43 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
12:42 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: apply [production]
12:42 <cgoubert@deploy2002> helmfile [staging] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
12:41 <cgoubert@deploy2002> helmfile [staging] START helmfile.d/services/changeprop-jobqueue: apply [production]
12:27 <btullis@deploy2002> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/analytics-test: apply [production]
12:27 <btullis@deploy2002> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/analytics-test: apply [production]
11:57 <vriley@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-worker1363.eqiad.wmnet with OS trixie [production]
11:16 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1361.eqiad.wmnet with OS trixie [production]
11:16 <vriley@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - vriley@cumin1003" [production]
11:15 <vriley@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - vriley@cumin1003" [production]
11:12 <cgoubert@deploy2002> helmfile [staging] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
11:12 <cgoubert@deploy2002> helmfile [staging] START helmfile.d/services/changeprop-jobqueue: apply [production]
11:11 <cgoubert@deploy2002> helmfile [codfw] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
11:09 <cgoubert@deploy2002> helmfile [codfw] START helmfile.d/services/changeprop-jobqueue: apply [production]
11:09 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/changeprop-jobqueue: apply [production]
11:08 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/changeprop-jobqueue: apply [production]
10:59 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1361.eqiad.wmnet with reason: host reimage [production]
10:56 <dhinus> hard-reboot paws-127c-uwce57bvcgrt-node-1 (reporting NotReady in kubectl) [paws]
10:53 <vriley@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1361.eqiad.wmnet with reason: host reimage [production]
10:42 <vriley@cumin1003> START - Cookbook sre.hosts.reimage for host wikikube-worker1361.eqiad.wmnet with OS trixie [production]
10:37 <vriley@cumin1003> START - Cookbook sre.hosts.reimage for host wikikube-worker1363.eqiad.wmnet with OS trixie [production]
09:37 <vriley@cumin1003> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-worker1363.eqiad.wmnet with OS trixie [production]
08:17 <vriley@cumin1003> START - Cookbook sre.hosts.reimage for host wikikube-worker1363.eqiad.wmnet with OS trixie [production]
08:12 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wikikube-worker1363.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
08:11 <slyngshede@cumin1003> DONE (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Resquito out of all services on: 1 hosts [production]
08:07 <slyngshede@cumin1003> DONE (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Resquito out of all services on: 1 hosts [production]
08:07 <slyngshede@cumin1003> DONE (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Resquito out of all services on: 1 hosts [production]
08:07 <slyngshede@cumin1003> DONE (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Resquito out of all services on: 1 hosts [production]
08:07 <slyngshede@cumin1003> DONE (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Resquito out of all services on: 1 hosts [production]
08:05 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1362.eqiad.wmnet with OS trixie [production]
08:05 <vriley@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - vriley@cumin1003" [production]
08:05 <vriley@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - vriley@cumin1003" [production]
08:04 <slyngshede@cumin1003> DONE (PASS) - Cookbook sre.idm.logout (exit_code=0) Logging Resquito out of all services on: 2444 hosts [production]
08:03 <vriley@cumin1003> START - Cookbook sre.hosts.provision for host wikikube-worker1363.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
08:03 <vriley@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker1363 [production]
08:02 <vriley@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker1363 [production]
08:02 <vriley@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:02 <vriley@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt wikikube-worker1363 - vriley@cumin1003" [production]
08:02 <vriley@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt wikikube-worker1363 - vriley@cumin1003" [production]
07:58 <vriley@cumin1003> START - Cookbook sre.dns.netbox [production]
07:51 <vriley@cumin1003> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host wikikube-worker1361.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
07:49 <vriley@cumin1003> START - Cookbook sre.hosts.provision for host wikikube-worker1361.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
07:49 <vriley@cumin1003> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1362.eqiad.wmnet with reason: host reimage [production]
07:46 <vriley@cumin1003> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker1361 [production]
07:44 <vriley@cumin1003> START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker1361 [production]
07:44 <vriley@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
07:43 <vriley@cumin1003> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1362.eqiad.wmnet with reason: host reimage [production]
07:41 <vriley@cumin1003> START - Cookbook sre.dns.netbox [production]