9851-9900 of 10000 results (65ms)
2024-06-21 ยง
13:21 <btullis@deploy1002> Finished deploy [performance/asoranking@febfb9f]: (no justification provided) (duration: 00m 04s) [production]
13:21 <btullis@deploy1002> Started deploy [performance/asoranking@febfb9f]: (no justification provided) [production]
13:08 <dani@deploy1002> helmfile [codfw] DONE helmfile.d/services/miscweb: apply [production]
13:07 <dani@deploy1002> helmfile [codfw] START helmfile.d/services/miscweb: apply [production]
13:07 <dani@deploy1002> helmfile [eqiad] DONE helmfile.d/services/miscweb: apply [production]
13:07 <dani@deploy1002> helmfile [eqiad] START helmfile.d/services/miscweb: apply [production]
13:07 <dani@deploy1002> helmfile [staging] DONE helmfile.d/services/miscweb: apply [production]
13:06 <dani@deploy1002> helmfile [staging] START helmfile.d/services/miscweb: apply [production]
11:37 <hnowlan@cumin1002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) shellbox-video.discovery.wmnet on all recursors [production]
11:37 <hnowlan@cumin1002> START - Cookbook sre.dns.wipe-cache shellbox-video.discovery.wmnet on all recursors [production]
11:06 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2214 (T367856)', diff saved to https://phabricator.wikimedia.org/P65303 and previous config saved to /var/cache/conftool/dbconfig/20240621-110638-marostegui.json [production]
11:06 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2214.codfw.wmnet with reason: Maintenance [production]
11:06 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2214.codfw.wmnet with reason: Maintenance [production]
10:57 <Emperor> restart swift-proxy on ms-fe2011 ms-fe2012 T360913 [production]
10:56 <Emperor> restart swift-proxy on ms-fe1010 T360913 [production]
10:49 <arturo> force reschedule the humaniki-prod instance to a non-OVS hypervisor with `wmcs-openstack server migrate 0bd43b56-75ba-411b-980b-b8d8f06837a8` [wikidumpparse]
10:44 <aborrero@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.migrate_project_to_ovs (exit_code=1) [wikidumpparse]
10:43 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.migrate_project_to_ovs [wikidumpparse]
10:39 <arturo> force-reboot humaniki-prod, it had lost the network [wikidumpparse]
10:36 <kamila@cumin1002> conftool action : set/pooled=yes; selector: name=wikikube-ctrl2002.codfw.wmnet [production]
10:36 <kamila@cumin1002> conftool action : set/pooled=yes; selector: name=wikikube-ctrl2001.codfw.wmnet [production]
10:36 <aborrero@cloudcumin1001> END (FAIL) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=99) [admin]
10:36 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.vm_console [admin]
10:35 <wmbot~arturo@nostromo> END (ERROR) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=97) [admin]
10:35 <wmbot~arturo@nostromo> START - Cookbook wmcs.openstack.cloudvirt.vm_console [admin]
10:35 <wmbot~arturo@nostromo> END (FAIL) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=99) [admin]
10:35 <wmbot~arturo@nostromo> START - Cookbook wmcs.openstack.cloudvirt.vm_console [admin]
10:28 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wikikube-ctrl2002.codfw.wmnet with OS bullseye [production]
10:05 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2163 (T364069)', diff saved to https://phabricator.wikimedia.org/P65302 and previous config saved to /var/cache/conftool/dbconfig/20240621-100554-marostegui.json [production]
10:05 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2163.codfw.wmnet with reason: Maintenance [production]
10:05 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2163.codfw.wmnet with reason: Maintenance [production]
10:05 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2162 (T364069)', diff saved to https://phabricator.wikimedia.org/P65301 and previous config saved to /var/cache/conftool/dbconfig/20240621-100531-marostegui.json [production]
09:50 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2162', diff saved to https://phabricator.wikimedia.org/P65300 and previous config saved to /var/cache/conftool/dbconfig/20240621-095024-marostegui.json [production]
09:45 <brouberol@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12 days, 0:00:00 on karapace[1001-1002].eqiad.wmnet with reason: The hosts are soon to be decommissioned [production]
09:45 <brouberol@cumin2002> START - Cookbook sre.hosts.downtime for 12 days, 0:00:00 on karapace[1001-1002].eqiad.wmnet with reason: The hosts are soon to be decommissioned [production]
09:43 <aborrero@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.unset_maintenance (exit_code=0) [admin]
09:43 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.unset_maintenance [admin]
09:41 <aborrero@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1053.eqiad.wmnet with OS bookworm [production]
09:40 <aborrero@cloudcumin1001> END (PASS) - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary (exit_code=0) on eqiad1, with recreate True, for hosts list: ['cloudvirt1053'] [cloudvirt-canary]
09:39 <aborrero@cloudcumin1001> START - Cookbook wmcs.openstack.cloudvirt.lib.ensure_canary on eqiad1, with recreate True, for hosts list: ['cloudvirt1053'] [cloudvirt-canary]
09:35 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2162', diff saved to https://phabricator.wikimedia.org/P65299 and previous config saved to /var/cache/conftool/dbconfig/20240621-093517-marostegui.json [production]
09:31 <ryankemper@cumin2002> END (PASS) - Cookbook sre.wdqs.data-reload (exit_code=0) reloading wikidata_full on wdqs2023.codfw.wmnet from DumpsSource.HDFS (hdfs:///wmf/data/discovery/wikidata/munged_n3_dump/wikidata/full/20240603/ using stat1009.eqiad.wmnet) [production]
09:20 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2162 (T364069)', diff saved to https://phabricator.wikimedia.org/P65298 and previous config saved to /var/cache/conftool/dbconfig/20240621-092009-marostegui.json [production]
09:16 <aborrero@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudvirt1053.eqiad.wmnet with reason: host reimage [production]
09:14 <aborrero@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on cloudvirt1053.eqiad.wmnet with reason: host reimage [production]
09:02 <kamila@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-ctrl2002.codfw.wmnet with reason: host reimage [production]
08:57 <kamila@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-ctrl2002.codfw.wmnet with reason: host reimage [production]
08:56 <aborrero@cumin1002> START - Cookbook sre.hosts.reimage for host cloudvirt1053.eqiad.wmnet with OS bookworm [production]
08:47 <aborrero@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudvirt1053.eqiad.wmnet [production]
08:41 <kamila@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-ctrl2002.codfw.wmnet with OS bullseye [production]