2024-02-14
ยง
|
10:38 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1166 (T352010)', diff saved to https://phabricator.wikimedia.org/P56745 and previous config saved to /var/cache/conftool/dbconfig/20240214-103810-ladsgroup.json |
[production] |
10:38 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1166.eqiad.wmnet with reason: Maintenance |
[production] |
10:37 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1166.eqiad.wmnet with reason: Maintenance |
[production] |
10:37 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1150.eqiad.wmnet with reason: Maintenance |
[production] |
10:37 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1150.eqiad.wmnet with reason: Maintenance |
[production] |
10:37 |
<slyngs> |
Deploying new PKI checks to alertmanager |
[production] |
10:33 |
<akosiaris@deploy2002> |
helmfile [eqiad] DONE helmfile.d/services/eventstreams: apply |
[production] |
10:33 |
<akosiaris@deploy2002> |
helmfile [eqiad] START helmfile.d/services/eventstreams: apply |
[production] |
10:31 |
<jmm@cumin2002> |
END (ERROR) - Cookbook sre.hosts.reimage (exit_code=93) for host puppetserver2003.codfw.wmnet with OS bookworm |
[production] |
10:28 |
<akosiaris@deploy2002> |
helmfile [staging] DONE helmfile.d/services/eventstreams: apply |
[production] |
10:28 |
<akosiaris@deploy2002> |
helmfile [staging] START helmfile.d/services/eventstreams: apply |
[production] |
10:19 |
<jmm@cumin2002> |
START - Cookbook sre.hosts.reimage for host puppetserver2003.codfw.wmnet with OS bookworm |
[production] |
10:18 |
<godog> |
powercycle titan1001 |
[production] |
10:15 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'P{O:wmcs::openstack::eqiad1::virt_ceph}' |
[admin] |
10:02 |
<ayounsi@cumin1002> |
START - Cookbook sre.hosts.reimage for host sretest2005.codfw.wmnet with OS bookworm |
[production] |
09:55 |
<moritzm> |
installing Linux 5.10.209 on Bullseye hosts |
[production] |
09:49 |
<moritzm> |
imported openssl11 1.1.1w-0+deb11u1+wmf1 to component/haproxy26 T352744 |
[production] |
09:42 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2024.codfw.wmnet |
[production] |
09:38 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2024.codfw.wmnet |
[production] |
09:33 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'P{O:wmcs::openstack::codfw1dev::virt_ceph}' |
[admin] |
09:31 |
<taavi> |
failover all dumps traffic to clouddumps1001 |
[admin] |
09:14 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=0) on tools-k8s-worker-nfs-25.tools.eqiad1.wikimedia.cloud |
[tools] |
09:13 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.vps.refresh_puppet_certs on tools-k8s-worker-nfs-25.tools.eqiad1.wikimedia.cloud |
[tools] |
09:13 |
<wmbot~dcaro@urcuchillay> |
END (PASS) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=0) |
[tools] |
09:08 |
<jmm@cumin2002> |
END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2023.codfw.wmnet |
[production] |
09:07 |
<wmbot~dcaro@urcuchillay> |
START - Cookbook wmcs.openstack.cloudvirt.vm_console |
[tools] |
09:07 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker-nfs role in the tools cluster |
[tools] |
09:07 |
<taavi@cloudcumin1001> |
Added a new k8s worker-nfs tools-k8s-worker-nfs-30.tools.eqiad1.wikimedia.cloud to the cluster |
[tools] |
09:05 |
<jmm@cumin2002> |
START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti2023.codfw.wmnet |
[production] |
08:56 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.toolforge.add_k8s_node for a worker-nfs role in the tools cluster |
[tools] |
08:56 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.remove_k8s_node (exit_code=0) for host tools-k8s-worker-74 |
[tools] |
08:55 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.toolforge.remove_k8s_node for host tools-k8s-worker-74 |
[tools] |
08:54 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker-nfs role in the tools cluster |
[tools] |
08:54 |
<taavi@cloudcumin1001> |
Added a new k8s worker-nfs tools-k8s-worker-nfs-29.tools.eqiad1.wikimedia.cloud to the cluster |
[tools] |
08:44 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.toolforge.add_k8s_node for a worker-nfs role in the tools cluster |
[tools] |
08:44 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.remove_k8s_node (exit_code=0) for host tools-k8s-worker-73 |
[tools] |
08:43 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.toolforge.remove_k8s_node for host tools-k8s-worker-73 |
[tools] |
08:43 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker-nfs role in the tools cluster |
[tools] |
08:43 |
<taavi@cloudcumin1001> |
Added a new k8s worker-nfs tools-k8s-worker-nfs-28.tools.eqiad1.wikimedia.cloud to the cluster |
[tools] |
08:41 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Depooling db1221 (T352010)', diff saved to https://phabricator.wikimedia.org/P56744 and previous config saved to /var/cache/conftool/dbconfig/20240214-084146-ladsgroup.json |
[production] |
08:41 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on clouddb[1015,1019,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance |
[production] |
08:41 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on clouddb[1015,1019,1021].eqiad.wmnet,db1155.eqiad.wmnet with reason: Maintenance |
[production] |
08:41 |
<ladsgroup@cumin1002> |
END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1221.eqiad.wmnet with reason: Maintenance |
[production] |
08:41 |
<ladsgroup@cumin1002> |
START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1221.eqiad.wmnet with reason: Maintenance |
[production] |
08:41 |
<ladsgroup@cumin1002> |
dbctl commit (dc=all): 'Repooling after maintenance db1199 (T352010)', diff saved to https://phabricator.wikimedia.org/P56743 and previous config saved to /var/cache/conftool/dbconfig/20240214-084104-ladsgroup.json |
[production] |
08:33 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.toolforge.add_k8s_node for a worker-nfs role in the tools cluster |
[tools] |
08:33 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.remove_k8s_node (exit_code=0) for host tools-k8s-worker-72 |
[tools] |
08:33 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'P{O:wmcs::openstack::codfw1dev::virt_ceph}' |
[admin] |
08:32 |
<taavi@cloudcumin1001> |
START - Cookbook wmcs.toolforge.remove_k8s_node for host tools-k8s-worker-72 |
[tools] |
08:32 |
<taavi@cloudcumin1001> |
END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker-nfs role in the tools cluster |
[tools] |