2024-12-19
§
|
08:27 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2061.codfw.wmnet with reason: host reimage |
[production] |
08:15 |
<kartik@deploy2002> |
Started scap sync-world: Backport for [[gerrit:1105341|Event logging: pass empty object to translation property (T364460)]] |
[production] |
08:09 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host wikikube-worker2062 |
[production] |
08:09 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.move-vlan for host wikikube-worker2062 |
[production] |
08:09 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host wikikube-worker2061 |
[production] |
08:09 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.move-vlan for host wikikube-worker2061 |
[production] |
08:08 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.reimage for host wikikube-worker2062.codfw.wmnet with OS bookworm |
[production] |
08:08 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.reimage for host wikikube-worker2061.codfw.wmnet with OS bookworm |
[production] |
08:07 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-worker[2061-2062].codfw.wmnet |
[production] |
08:03 |
<jelto@cumin1002> |
START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker[2061-2062].codfw.wmnet |
[production] |
08:01 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker2190.codfw.wmnet |
[production] |
08:01 |
<jelto@cumin1002> |
START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker2190.codfw.wmnet |
[production] |
08:00 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-worker2190.codfw.wmnet with OS bookworm |
[production] |
07:40 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2190.codfw.wmnet with reason: host reimage |
[production] |
07:37 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2190.codfw.wmnet with reason: host reimage |
[production] |
07:18 |
<jelto@cumin1002> |
END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host wikikube-worker2190 |
[production] |
07:18 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.move-vlan for host wikikube-worker2190 |
[production] |
07:18 |
<jelto@cumin1002> |
START - Cookbook sre.hosts.reimage for host wikikube-worker2190.codfw.wmnet with OS bookworm |
[production] |
02:28 |
<krinkle@deploy2002> |
Finished deploy [statsv/statsv@2ee86ea]: Add dogstatsd support (duration: 00m 18s) |
[production] |
02:28 |
<krinkle@deploy2002> |
Started deploy [statsv/statsv@2ee86ea]: Add dogstatsd support |
[production] |
01:48 |
<jhancock@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host es1043.eqiad.wmnet with OS bookworm |
[production] |
01:05 |
<jhancock@cumin2002> |
START - Cookbook sre.hosts.reimage for host es1043.eqiad.wmnet with OS bookworm |
[production] |
2024-12-18
§
|
23:41 |
<jhancock@cumin2002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host es1043.eqiad.wmnet with OS bookworm |
[production] |
22:58 |
<jhancock@cumin2002> |
START - Cookbook sre.hosts.reimage for host es1043.eqiad.wmnet with OS bookworm |
[production] |
21:31 |
<mfossati@deploy2002> |
Finished deploy [airflow-dags/platform_eng@a43cacf]: bump image suggestions, section topics, and SEAL (duration: 01m 43s) |
[production] |
21:30 |
<mfossati@deploy2002> |
Started deploy [airflow-dags/platform_eng@a43cacf]: bump image suggestions, section topics, and SEAL |
[production] |
20:44 |
<cjming@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic: apply |
[production] |
20:44 |
<cjming@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic: apply |
[production] |
20:36 |
<cjming@deploy2002> |
helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply |
[production] |
20:36 |
<cjming@deploy2002> |
helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply |
[production] |
20:29 |
<otto@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/eventgate-analytics-external: sync |
[production] |
20:28 |
<otto@deploy2002> |
helmfile [codfw] START helmfile.d/services/eventgate-analytics-external: sync |
[production] |
20:28 |
<otto@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/eventgate-analytics-external: sync |
[production] |
20:27 |
<otto@deploy2002> |
helmfile [eqiad] START helmfile.d/services/eventgate-analytics-external: sync |
[production] |
20:27 |
<otto@deploy2002> |
helmfile [staging] DONE helmfile.d/services/eventgate-analytics-external: sync |
[production] |
20:27 |
<otto@deploy2002> |
helmfile [staging] START helmfile.d/services/eventgate-analytics-external: sync |
[production] |
20:26 |
<ottomata> |
restarting eventgate-analytics-external to clear schema cache - T382113 | https://phabricator.wikimedia.org/T382113#10414005 |
[production] |
19:28 |
<dancy@deploy2002> |
rebuilt and synchronized wikiversions files: group1 to 1.44.0-wmf.8 refs T375667 |
[production] |
18:55 |
<btullis@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1002" |
[production] |
18:40 |
<btullis@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1069.eqiad.wmnet with reason: host reimage |
[production] |
18:37 |
<btullis@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1069.eqiad.wmnet with reason: host reimage |
[production] |
18:25 |
<btullis@cumin1002> |
START - Cookbook sre.hosts.reimage for host an-worker1069.eqiad.wmnet with OS bullseye |
[production] |
18:25 |
<btullis@cumin1002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-worker1069.eqiad.wmnet with OS bullseye |
[production] |
18:23 |
<btullis@cumin1002> |
END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1068.eqiad.wmnet |
[production] |
18:21 |
<btullis@cumin1002> |
START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1068.eqiad.wmnet |
[production] |
18:20 |
<btullis@cumin1002> |
START - Cookbook sre.hosts.reimage for host an-worker1069.eqiad.wmnet with OS bullseye |
[production] |
18:18 |
<btullis@cumin1002> |
END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host an-worker1068.eqiad.wmnet with OS bullseye |
[production] |
18:18 |
<btullis@cumin1002> |
END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1002" |
[production] |
18:18 |
<eevans@cumin1002> |
END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching sessionstore1*.eqiad.wmnet: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002 |
[production] |
18:16 |
<btullis@cumin1002> |
START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1002" |
[production] |