6301-6350 of 10000 results (72ms)
2024-07-22 ยง
18:16 <andrewbogott> deleting project after discussion on https://phabricator.wikimedia.org/T367542 [mwv-apt]
18:13 <aokoth@cumin1002> END (FAIL) - Cookbook sre.vrts.upgrade (exit_code=99) on VRTS host vrts2001.codfw.wmnet [production]
18:12 <aokoth@cumin1002> START - Cookbook sre.vrts.upgrade on VRTS host vrts2001.codfw.wmnet [production]
18:12 <aokoth@cumin1002> END (ERROR) - Cookbook sre.vrts.upgrade (exit_code=97) on VRTS host vrts2001.codfw.wmnet [production]
18:12 <aokoth@cumin1002> START - Cookbook sre.vrts.upgrade on VRTS host vrts2001.codfw.wmnet [production]
18:10 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.ceph.osd.bootstrap_and_add [admin]
18:09 <wmbot~dcaro@urcuchillay> END (PASS) - Cookbook wmcs.ceph.osd.depool_and_destroy (exit_code=0) [admin]
17:42 <dcaro> moved the apt repo to service endpoint deb.svc.toolforge.org [tools]
17:42 <cmooney@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
17:42 <cmooney@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update dns entries for new cloudceph nodes - cmooney@cumin1002" [production]
17:41 <cmooney@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update dns entries for new cloudceph nodes - cmooney@cumin1002" [production]
17:39 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=0) for tools-k8s-worker-nfs-3 [tools]
17:39 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.ceph.osd.depool_and_destroy [admin]
17:39 <wmbot~dcaro@urcuchillay> END (PASS) - Cookbook wmcs.ceph.osd.bootstrap_and_add (exit_code=0) [admin]
17:38 <andrew@cloudcumin1001> START - Cookbook wmcs.toolforge.k8s.reboot for tools-k8s-worker-nfs-3 [tools]
17:33 <cmooney@cumin1002> START - Cookbook sre.dns.netbox [production]
17:32 <cmooney@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudcephmon1004.eqiad.wmnet with OS bullseye [production]
17:11 <cmooney@cumin1002> START - Cookbook sre.hosts.reimage for host cloudcephmon1004.eqiad.wmnet with OS bullseye [production]
17:09 <ayounsi@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on netbox2003.codfw.wmnet with reason: netbox upgrade prep work [production]
17:09 <ayounsi@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on netbox2003.codfw.wmnet with reason: netbox upgrade prep work [production]
17:09 <ayounsi@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on netbox1003.eqiad.wmnet with reason: netbox upgrade prep work [production]
17:09 <ayounsi@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on netbox1003.eqiad.wmnet with reason: netbox upgrade prep work [production]
17:09 <ayounsi@cumin1002> END (ERROR) - Cookbook sre.hosts.downtime (exit_code=97) for 2:00:00 on netbox1003.eqiad.wmnet with reason: netbox upgrade prep work [production]
17:08 <ayounsi@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on netbox1003.eqiad.wmnet with reason: netbox upgrade prep work [production]
17:07 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.rebuild_dbinstance (exit_code=0) [releng]
17:07 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.rebuild_dbinstance (exit_code=0) [deployment-prep]
17:05 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.rebuild_dbinstance [releng]
17:05 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.rebuild_dbinstance [deployment-prep]
17:03 <wmbot~dcaro@urcuchillay> END (PASS) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=0) [tools]
17:03 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.openstack.cloudvirt.vm_console [tools]
17:02 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.rebuild_dbinstance (exit_code=0) [deployment-prep]
17:02 <andrew@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.rebuild_dbinstance (exit_code=0) [releng]
17:00 <dcaro> moving the toolforge apt repo to tools-services-06 [tools]
17:00 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.rebuild_dbinstance [releng]
17:00 <andrew@cloudcumin1001> START - Cookbook wmcs.openstack.rebuild_dbinstance [deployment-prep]
16:58 <andrewbogott> upgrading db servers copypatrol-prod-db-01 and copypatrol-dev-db-01 to latest Trove guest image [copypatrol]
16:55 <wmbot~dcaro@urcuchillay> END (PASS) - Cookbook wmcs.vps.refresh_puppet_certs (exit_code=0) on tools-services-06.tools.eqiad1.wikimedia.cloud [tools]
16:53 <wmbot~dcaro@urcuchillay> START - Cookbook wmcs.vps.refresh_puppet_certs on tools-services-06.tools.eqiad1.wikimedia.cloud [tools]
16:37 <sukhe> [doh1001] upgrade anycast-healthchecker to 0.9.8-1+wmf12u1: T370068 [production]
16:32 <cgoubert@cumin1002> conftool action : set/weight=10:pooled=yes; selector: name=(wikikube-worker2035.codfw.wmnet|wikikube-worker2036.codfw.wmnet|wikikube-worker2037.codfw.wmnet|wikikube-worker2038.codfw.wmnet),cluster=kubernetes,service=kubesvc [production]
16:31 <claime> Pooling and uncordoning wikikube-worker2035.codfw.wmnet wikikube-worker2036.codfw.wmnet wikikube-worker2037.codfw.wmnet wikikube-worker2038.codfw.wmnet - T351074 [production]
16:31 <sukhe> restart anycast-hc on durum1001 [production]
16:13 <pt1979@cumin1002> END (FAIL) - Cookbook sre.hosts.dhcp (exit_code=99) for host cloudcephmon1004.eqiad.wmnet [production]
16:08 <pt1979@cumin1002> START - Cookbook sre.hosts.dhcp for host cloudcephmon1004.eqiad.wmnet [production]
16:08 <aborrero@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.tofu (exit_code=0) running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/21 [admin]
16:07 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.tofu running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/21 [admin]
16:05 <aborrero@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.tofu (exit_code=99) running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/21 [admin]
16:05 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.tofu running tofu plan for https://gitlab.wikimedia.org/repos/cloud/cloud-vps/tofu-infra/-/merge_requests/21 [admin]
16:02 <elukey> remove /srv/kafka/data/eqiad.resource-purge-3 on kafka-main2001 to force a refetch of data from good replicas and circumvent data corruption - T370574 [production]
15:58 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on kafka-main2001.codfw.wmnet with reason: attempt to remove a data dir on disk [production]