701-750 of 10000 results (116ms)
2024-09-04 ยง
15:55 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 0:30:00 on 10 hosts with reason: network maintenance T370852 [production]
15:55 <arnaudb@cumin1002> dbctl commit (dc=all): 'depool db2125 db2138 db2149 db2190 db2206 db2207 es2031 es2032 es2036 - T370852', diff saved to https://phabricator.wikimedia.org/P68650 and previous config saved to /var/cache/conftool/dbconfig/20240904-155459-arnaudb.json [production]
15:53 <ladsgroup@deploy1003> Finished scap sync-world: Backport for [[gerrit:1070324|Fix bug causing review form to disappear on unreviewed pages (T373582)]] (duration: 10m 31s) [production]
15:53 <swfrench@cumin2002> START - Cookbook sre.hosts.decommission for hosts mw[2260,2267].codfw.wmnet [production]
15:48 <ladsgroup@deploy1003> ladsgroup: Continuing with sync [production]
15:47 <ladsgroup@deploy1003> ladsgroup: Backport for [[gerrit:1070324|Fix bug causing review form to disappear on unreviewed pages (T373582)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
15:46 <bking@cumin2002> END (FAIL) - Cookbook sre.wdqs.restart (exit_code=99) [production]
15:43 <aqu@deploy1003> Finished deploy [airflow-dags/analytics@3b0d8ba]: Regular analytics weekly train [airflow-dags@3b0d8ba1] (duration: 00m 48s) [production]
15:43 <topranks> configure lsw1-c1-codfw interfaces for servers in advance of move T373095 [production]
15:43 <ladsgroup@deploy1003> Started scap sync-world: Backport for [[gerrit:1070324|Fix bug causing review form to disappear on unreviewed pages (T373582)]] [production]
15:42 <aqu@deploy1003> Started deploy [airflow-dags/analytics@3b0d8ba]: Regular analytics weekly train [airflow-dags@3b0d8ba1] [production]
15:25 <swfrench@cumin2002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host mw2267.codfw.wmnet [production]
15:24 <swfrench@cumin2002> START - Cookbook sre.k8s.pool-depool-node depool for host mw2267.codfw.wmnet [production]
15:21 <hnowlan@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker2081.codfw.wmnet [production]
15:20 <hnowlan@cumin1002> START - Cookbook sre.k8s.pool-depool-node pool for host wikikube-worker2081.codfw.wmnet [production]
15:15 <vgutierrez@cumin1002> conftool action : set/pooled=yes; selector: name=cp7015.magru.wmnet [production]
15:14 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ml-lab1001.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
15:13 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cp7015.magru.wmnet [production]
15:13 <vgutierrez@cumin1002> START - Cookbook sre.hosts.remove-downtime for cp7015.magru.wmnet [production]
15:12 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host phab1005.eqiad.wmnet with OS bookworm [production]
15:12 <jclark@cumin1002> END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host phab1005.eqiad.wmnet with OS bookworm [production]
15:11 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host ml-lab1001.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
15:09 <hashar@deploy1003> Finished scap sync-world: Backport for [[gerrit:1070548|ParserOutput::collectMetadata: Log if given value is non-numeric and also non-string, for easier debugging, and don't fatal (T373920)]] (duration: 08m 37s) [production]
15:06 <elukey@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ml-lab1001.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
15:06 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wikikube-worker2036.codfw.wmnet with reason: host reimage [production]
15:05 <jclark@cumin1002> START - Cookbook sre.hosts.reimage for host ml-lab1001.eqiad.wmnet with OS bookworm [production]
15:04 <hashar@deploy1003> hashar: Continuing with sync [production]
15:04 <hashar@deploy1003> hashar: Backport for [[gerrit:1070548|ParserOutput::collectMetadata: Log if given value is non-numeric and also non-string, for easier debugging, and don't fatal (T373920)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) [production]
15:02 <cgoubert@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on wikikube-worker2036.codfw.wmnet with reason: host reimage [production]
15:02 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host ml-lab1001.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
15:00 <hashar@deploy1003> Started scap sync-world: Backport for [[gerrit:1070548|ParserOutput::collectMetadata: Log if given value is non-numeric and also non-string, for easier debugging, and don't fatal (T373920)]] [production]
14:49 <claime> homer cr*codfw* commit 'T372878' [production]
14:44 <cgoubert@cumin1002> END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host wikikube-worker2036 [production]
14:44 <cgoubert@cumin1002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker2036 [production]
14:44 <cgoubert@cumin1002> START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker2036 [production]
14:44 <cgoubert@cumin1002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) wikikube-worker2036.codfw.wmnet 121.16.192.10.in-addr.arpa 1.2.1.0.6.1.0.0.2.9.1.0.0.1.0.0.2.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
14:44 <cgoubert@cumin1002> START - Cookbook sre.dns.wipe-cache wikikube-worker2036.codfw.wmnet 121.16.192.10.in-addr.arpa 1.2.1.0.6.1.0.0.2.9.1.0.0.1.0.0.2.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors [production]
14:44 <cgoubert@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:44 <cgoubert@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host wikikube-worker2036 - cgoubert@cumin1002" [production]
14:44 <cgoubert@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host wikikube-worker2036 - cgoubert@cumin1002" [production]
14:40 <stevemunene@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
14:40 <cgoubert@cumin1002> START - Cookbook sre.dns.netbox [production]
14:40 <bking@cumin2002> START - Cookbook sre.wdqs.restart [production]
14:40 <stevemunene@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
14:40 <cgoubert@cumin1002> START - Cookbook sre.hosts.move-vlan for host wikikube-worker2036 [production]
14:40 <bking@cumin2002> END (PASS) - Cookbook sre.wdqs.restart (exit_code=0) [production]
14:39 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker2036.codfw.wmnet with OS bullseye [production]
14:39 <arnaudb@cumin1002> dbctl commit (dc=all): 'swap masters for es1 - T373095', diff saved to https://phabricator.wikimedia.org/P68648 and previous config saved to /var/cache/conftool/dbconfig/20240904-143928-arnaudb.json [production]
14:38 <cgoubert@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-worker2036.codfw.wmnet [production]
14:38 <stevemunene@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]