4551-4600 of 10000 results (44ms)
2024-06-21 ยง
11:37 <hnowlan@cumin1002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) shellbox-video.discovery.wmnet on all recursors [production]
11:37 <hnowlan@cumin1002> START - Cookbook sre.dns.wipe-cache shellbox-video.discovery.wmnet on all recursors [production]
11:06 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2214 (T367856)', diff saved to https://phabricator.wikimedia.org/P65303 and previous config saved to /var/cache/conftool/dbconfig/20240621-110638-marostegui.json [production]
11:06 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2214.codfw.wmnet with reason: Maintenance [production]
11:06 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2214.codfw.wmnet with reason: Maintenance [production]
10:57 <Emperor> restart swift-proxy on ms-fe2011 ms-fe2012 T360913 [production]
10:56 <Emperor> restart swift-proxy on ms-fe1010 T360913 [production]
10:49 <arturo> force reschedule the humaniki-prod instance to a non-OVS hypervisor with `wmcs-openstack server migrate 0bd43b56-75ba-411b-980b-b8d8f06837a8` [wikidumpparse]
10:44 <aborrero@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.migrate_project_to_ovs (exit_code=1) [wikidumpparse]
10:43 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.migrate_project_to_ovs [wikidumpparse]
10:39 <arturo> force-reboot humaniki-prod, it had lost the network [wikidumpparse]
10:36 <kamila@cumin1002> conftool action : set/pooled=yes; selector: name=wikikube-ctrl2002.codfw.wmnet [production]
10:36 <kamila@cumin1002> conftool action : set/pooled=yes; selector: name=wikikube-ctrl2001.codfw.wmnet [production]
10:36 <aborrero@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=99) [admin]
10:36 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.vm_console [admin]
10:35 <wmbot~arturo@nostromo> END (ERROR) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=97) [admin]
10:35 <wmbot~arturo@nostromo> START - Cookbook wmcs.openstack.cloudvirt.vm_console [admin]
10:35 <wmbot~arturo@nostromo> END (FAIL) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=99) [admin]
10:35 <wmbot~arturo@nostromo> START - Cookbook wmcs.openstack.cloudvirt.vm_console [admin]
10:28 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-ctrl2002.codfw.wmnet with OS bullseye [production]
10:05 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2163 (T364069)', diff saved to https://phabricator.wikimedia.org/P65302 and previous config saved to /var/cache/conftool/dbconfig/20240621-100554-marostegui.json [production]
10:05 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2163.codfw.wmnet with reason: Maintenance [production]
10:05 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2163.codfw.wmnet with reason: Maintenance [production]
10:05 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2162 (T364069)', diff saved to https://phabricator.wikimedia.org/P65301 and previous config saved to /var/cache/conftool/dbconfig/20240621-100531-marostegui.json [production]
09:50 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2162', diff saved to https://phabricator.wikimedia.org/P65300 and previous config saved to /var/cache/conftool/dbconfig/20240621-095024-marostegui.json [production]
09:45 <brouberol@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12 days, 0:00:00 on karapace[1001-1002].eqiad.wmnet with reason: The hosts are soon to be decommissioned [production]
09:45 <brouberol@cumin2002> START - Cookbook sre.hosts.downtime for 12 days, 0:00:00 on karapace[1001-1002].eqiad.wmnet with reason: The hosts are soon to be decommissioned [production]
09:43 <aborrero@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.unset_maintenance (exit_code=0) [admin]
09:43 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.unset_maintenance [admin]
09:41 <aborrero@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1053.eqiad.wmnet with OS bookworm [production]
09:40 <aborrero@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary (exit_code=0) on eqiad1, with recreate True, for hosts list: ['cloudvirt1053'] [cloudvirt-canary]
09:39 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary on eqiad1, with recreate True, for hosts list: ['cloudvirt1053'] [cloudvirt-canary]
09:35 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2162', diff saved to https://phabricator.wikimedia.org/P65299 and previous config saved to /var/cache/conftool/dbconfig/20240621-093517-marostegui.json [production]
09:31 <ryankemper@cumin2002> END (PASS) - Cookbook sre.wdqs.data-reload (exit_code=0) reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/data/discovery/wikidata/munged_n3_dump/wikidata/full/20240603/ using stat1009.eqiad.wmnet) [production]
09:20 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2162 (T364069)', diff saved to https://phabricator.wikimedia.org/P65298 and previous config saved to /var/cache/conftool/dbconfig/20240621-092009-marostegui.json [production]
09:16 <aborrero@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1053.eqiad.wmnet with reason: host reimage [production]
09:14 <aborrero@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1053.eqiad.wmnet with reason: host reimage [production]
09:02 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-ctrl2002.codfw.wmnet with reason: host reimage [production]
08:57 <kamila@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-ctrl2002.codfw.wmnet with reason: host reimage [production]
08:56 <aborrero@cumin1002> START - Cookbook sre.hosts.reimage for host cloudvirt1053.eqiad.wmnet with OS bookworm [production]
08:47 <aborrero@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1053.eqiad.wmnet [production]
08:41 <kamila@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-ctrl2002.codfw.wmnet with OS bullseye [production]
08:39 <aborrero@cumin1002> START - Cookbook sre.hosts.reboot-single for host cloudvirt1053.eqiad.wmnet [production]
08:31 <aborrero@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.drain (exit_code=0) on host 'cloudvirt1053.eqiad.wmnet' (T368129) [admin]
08:28 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.drain on host 'cloudvirt1053.eqiad.wmnet' (T368129) [admin]
08:14 <vgutierrez> restarting logrotate.service on cp[3068,3070-3071].esams.wmnet [production]
08:04 <akosiaris@deploy1002> helmfile [eqiad] DONE helmfile.d/services/mobileapps: apply [production]
08:04 <akosiaris@deploy1002> helmfile [eqiad] START helmfile.d/services/mobileapps: apply [production]
08:03 <akosiaris@deploy1002> helmfile [codfw] DONE helmfile.d/services/mobileapps: apply [production]
08:03 <akosiaris@deploy1002> helmfile [codfw] START helmfile.d/services/mobileapps: apply [production]