2024-02-13
ยง
|
10:23 |
<kharlan@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/ipoid: apply |
[production] |
10:23 |
<kharlan@deploy2002> |
helmfile [codfw] START helmfile.d/services/ipoid: apply |
[production] |
10:22 |
<kharlan@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/ipoid: apply |
[production] |
10:22 |
<kharlan@deploy2002> |
helmfile [eqiad] START helmfile.d/services/ipoid: apply |
[production] |
10:22 |
<kharlan@deploy2002> |
helmfile [staging] DONE helmfile.d/services/ipoid: apply |
[production] |
10:22 |
<kharlan@deploy2002> |
helmfile [staging] START helmfile.d/services/ipoid: apply |
[production] |
10:11 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a ingress role in the toolsbeta cluster |
[toolsbeta] |
10:11 |
<taavi@cloudcumin1001> |
Added a new k8s ingress toolsbeta-test-k8s-ingress-8.toolsbeta.eqiad1.wikimedia.cloud to the cluster |
[toolsbeta] |
10:09 |
<brouberol@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on apifeatureusage1001.eqiad.wmnet with reason: host reimage |
[production] |
10:06 |
<brouberol@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on apifeatureusage1001.eqiad.wmnet with reason: host reimage |
[production] |
10:05 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host clouddb2002-dev.codfw.wmnet |
[production] |
10:04 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.toolforge.add_k8s_node for a ingress role in the toolsbeta cluster |
[toolsbeta] |
10:03 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.remove_k8s_node (exit_code=0) for host toolsbeta-test-k8s-ingress-3 |
[toolsbeta] |
10:03 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.toolforge.remove_k8s_node for host toolsbeta-test-k8s-ingress-3 |
[toolsbeta] |
09:59 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a ingress role in the toolsbeta cluster |
[toolsbeta] |
09:59 |
<taavi@cloudcumin1001> |
Added a new k8s ingress toolsbeta-test-k8s-ingress-7.toolsbeta.eqiad1.wikimedia.cloud to the cluster |
[toolsbeta] |
09:58 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reboot-single for host clouddb2002-dev.codfw.wmnet |
[production] |
09:57 |
<brouberol@cumin1002> |
START - Cookbook sre.hosts.reimage for host apifeatureusage1001.eqiad.wmnet with OS bookworm |
[production] |
09:52 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.toolforge.add_k8s_node for a ingress role in the toolsbeta cluster |
[toolsbeta] |
09:50 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker-nfs role in the toolsbeta cluster |
[toolsbeta] |
09:50 |
<taavi@cloudcumin1001> |
Added a new k8s worker-nfs toolsbeta-test-k8s-worker-nfs-4.toolsbeta.eqiad1.wikimedia.cloud to the cluster |
[toolsbeta] |
09:41 |
<arturo> |
deleting all leaked instances by hand (11 VMs) |
[admin-monitoring] |
09:40 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.toolforge.add_k8s_node for a worker-nfs role in the toolsbeta cluster |
[toolsbeta] |
09:40 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.remove_k8s_node (exit_code=0) for host toolsbeta-test-k8s-worker-8 |
[toolsbeta] |
09:39 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.toolforge.remove_k8s_node for host toolsbeta-test-k8s-worker-8 |
[toolsbeta] |
09:39 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.remove_k8s_node (exit_code=0) for host toolsbeta-test-k8s-worker-7 |
[toolsbeta] |
09:38 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.toolforge.remove_k8s_node for host toolsbeta-test-k8s-worker-7 |
[toolsbeta] |
09:36 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker-nfs role in the tools cluster |
[tools] |
09:36 |
<taavi@cloudcumin1001> |
Added a new k8s worker-nfs tools-k8s-worker-nfs-21.tools.eqiad1.wikimedia.cloud to the cluster |
[tools] |
09:26 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.toolforge.add_k8s_node for a worker-nfs role in the tools cluster |
[tools] |
09:26 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.remove_k8s_node (exit_code=0) for host tools-k8s-worker-64 |
[tools] |
09:25 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.toolforge.remove_k8s_node for host tools-k8s-worker-64 |
[tools] |
09:23 |
<stran@deploy2002> |
helmfile [codfw] DONE helmfile.d/services/ipoid: apply |
[production] |
09:22 |
<stran@deploy2002> |
helmfile [codfw] START helmfile.d/services/ipoid: apply |
[production] |
09:22 |
<stran@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/ipoid: apply |
[production] |
09:22 |
<akosiaris> |
delete sessionstore pod to force rescheduling |
[production] |
09:21 |
<stran@deploy2002> |
helmfile [eqiad] START helmfile.d/services/ipoid: apply |
[production] |
09:20 |
<stran@deploy2002> |
helmfile [staging] DONE helmfile.d/services/ipoid: apply |
[production] |
09:20 |
<brouberol@cumin1002> |
END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host apifeatureusage1001.eqiad.wmnet with OS bookworm |
[production] |
09:20 |
<stran@deploy2002> |
helmfile [staging] START helmfile.d/services/ipoid: apply |
[production] |
09:18 |
<brouberol@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on apifeatureusage1001.eqiad.wmnet with reason: host reimage |
[production] |
09:16 |
<brouberol@cumin1002> |
START - Cookbook sre.hosts.downtime for 2:00:00 on apifeatureusage1001.eqiad.wmnet with reason: host reimage |
[production] |
09:04 |
<brouberol@cumin1002> |
START - Cookbook sre.hosts.reimage for host apifeatureusage1001.eqiad.wmnet with OS bookworm |
[production] |
09:03 |
<brouberol> |
attempting a reimage of apifeatureusage1001 to bookworm - T346053 |
[analytics] |
08:28 |
<hashar@deploy2002> |
Finished scap: Backport for [[gerrit:1002813|Increase $wgMaxUploadSize to 5 GiB (previously was 4GiB). (T191804)]] (duration: 08m 57s) |
[production] |
08:27 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: grafana |
[production] |
08:21 |
<hashar@deploy2002> |
hashar and bawolff: Continuing with sync |
[production] |
08:21 |
<hashar@deploy2002> |
hashar and bawolff: Backport for [[gerrit:1002813|Increase $wgMaxUploadSize to 5 GiB (previously was 4GiB). (T191804)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) |
[production] |
08:19 |
<hashar@deploy2002> |
Started scap: Backport for [[gerrit:1002813|Increase $wgMaxUploadSize to 5 GiB (previously was 4GiB). (T191804)]] |
[production] |
08:18 |
<jmm@cumin2002> |
START - Cookbook sre.puppet.migrate-role for role: grafana |
[production] |