1601-1650 of 10000 results (96ms)
2024-06-17 ยง
16:59 <swfrench@deploy1002> helmfile [codfw] DONE helmfile.d/services/data-gateway: sync [production]
16:59 <swfrench@deploy1002> helmfile [codfw] START helmfile.d/services/data-gateway: sync [production]
16:58 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker1021.eqiad.wmnet with OS bullseye [production]
16:58 <swfrench@deploy1002> helmfile [codfw] DONE helmfile.d/services/data-gateway: apply [production]
16:58 <claime> homer 'cr*eqiad*' commit 'T351074' [production]
16:58 <swfrench@deploy1002> helmfile [codfw] START helmfile.d/services/data-gateway: apply [production]
16:43 <swfrench@deploy1002> helmfile [staging] DONE helmfile.d/services/device-analytics: sync [production]
16:43 <swfrench@deploy1002> helmfile [staging] START helmfile.d/services/device-analytics: sync [production]
16:42 <mnz@deploy1002> Finished deploy [airflow-dags/research@5e1cd80]: (no justification provided) (duration: 00m 32s) [production]
16:42 <swfrench@deploy1002> helmfile [staging] DONE helmfile.d/services/device-analytics: apply [production]
16:42 <mnz@deploy1002> Started deploy [airflow-dags/research@5e1cd80]: (no justification provided) [production]
16:42 <swfrench@deploy1002> helmfile [staging] START helmfile.d/services/device-analytics: apply [production]
16:40 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1021.eqiad.wmnet with reason: host reimage [production]
16:37 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1019.eqiad.wmnet with reason: host reimage [production]
16:33 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker1020.eqiad.wmnet with reason: host reimage [production]
16:32 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1021.eqiad.wmnet with reason: host reimage [production]
16:31 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1019.eqiad.wmnet with reason: host reimage [production]
16:30 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker1020.eqiad.wmnet with reason: host reimage [production]
16:30 <cmooney@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:40:00 on cr2-eqdfw,cr2-eqdfw IPv6 with reason: JunOS upgrade and PSU swap on cr2-eqdfw [production]
16:29 <cmooney@cumin1002> START - Cookbook sre.hosts.downtime for 0:40:00 on cr2-eqdfw,cr2-eqdfw IPv6 with reason: JunOS upgrade and PSU swap on cr2-eqdfw [production]
16:29 <cmooney@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:40:00 on cr[1-2]-codfw,cr2-drmrs,cr2-esams,cr2-magru with reason: JunOS upgrade and PSU swap on cr2-eqdfw [production]
16:29 <cmooney@cumin1002> START - Cookbook sre.hosts.downtime for 0:40:00 on cr[1-2]-codfw,cr2-drmrs,cr2-esams,cr2-magru with reason: JunOS upgrade and PSU swap on cr2-eqdfw [production]
16:29 <mvolz@deploy1002> helmfile [eqiad] DONE helmfile.d/services/citoid: apply [production]
16:28 <mvolz@deploy1002> helmfile [eqiad] START helmfile.d/services/citoid: apply [production]
16:27 <mvolz@deploy1002> helmfile [codfw] DONE helmfile.d/services/citoid: apply [production]
16:27 <mvolz@deploy1002> helmfile [codfw] START helmfile.d/services/citoid: apply [production]
16:26 <mvolz@deploy1002> helmfile [staging] DONE helmfile.d/services/citoid: apply [production]
16:25 <mvolz@deploy1002> helmfile [staging] START helmfile.d/services/citoid: apply [production]
16:25 <andrew@cumin1002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts cloudvirt-wdqs1003.eqiad.wmnet [production]
16:25 <andrew@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:25 <andrew@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cloudvirt-wdqs1003.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - andrew@cumin1002" [production]
16:24 <andrew@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cloudvirt-wdqs1003.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - andrew@cumin1002" [production]
16:21 <andrew@cumin1002> START - Cookbook sre.dns.netbox [production]
16:16 <andrew@cumin1002> START - Cookbook sre.hosts.decommission for hosts cloudvirt-wdqs1003.eqiad.wmnet [production]
16:16 <andrew@cumin1002> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts cloudvirt-wdqs1002.eqiad.wmnet [production]
16:16 <andrew@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:14 <andrew@cumin1002> START - Cookbook sre.dns.netbox [production]
16:09 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker1019.eqiad.wmnet with OS bullseye [production]
16:09 <jdrewniak@deploy1002> Synchronized portals/wikipedia.org/assets: Wikimedia Portals Update: [[gerrit:1046698| Bumping portals to master (T128546)]] (duration: 14m 13s) [production]
16:09 <eevans@cumin1002> END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching restbase1028.eqiad.wmnet: Apply update to Java 11 - eevans@cumin1002 [production]
16:09 <cgoubert@cumin1002> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host wikikube-worker1019.eqiad.wmnet with OS bullseye [production]
16:08 <andrew@cumin1002> START - Cookbook sre.hosts.decommission for hosts cloudvirt-wdqs1002.eqiad.wmnet [production]
16:05 <andrew@cumin1002> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts cloudvirt-wdqs1001.eqiad.wmnet [production]
16:05 <andrew@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
16:03 <andrew@cumin1002> START - Cookbook sre.dns.netbox [production]
16:00 <eevans@cumin1002> START - Cookbook sre.cassandra.roll-restart for nodes matching restbase1028.eqiad.wmnet: Apply update to Java 11 - eevans@cumin1002 [production]
15:59 <andrew@cumin1002> START - Cookbook sre.hosts.decommission for hosts cloudvirt-wdqs1001.eqiad.wmnet [production]
15:57 <andrew@cumin1002> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts cloudvirt-wdqs1001.eqiad.wmnet [production]
15:57 <andrew@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
15:56 <andrew@cumin1002> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts cloudvirt-wdqs1002.eqiad.wmnet [production]