2651-2700 of 10000 results (114ms)
2024-08-29 ยง
09:48 <cgoubert@cumin1002> START - Cookbook sre.hosts.reimage for host wikikube-worker2010.codfw.wmnet with OS bullseye [production]
09:47 <cgoubert@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host wikikube-worker2010.codfw.wmnet [production]
09:46 <cgoubert@cumin1002> START - Cookbook sre.k8s.pool-depool-node depool for host wikikube-worker2010.codfw.wmnet [production]
09:46 <cgoubert@cumin1002> START - Cookbook sre.k8s.renumber-node Renumbering for host wikikube-worker2010.codfw.wmnet [production]
09:44 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'. [production]
09:44 <stevemunene@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
09:44 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'. [production]
09:43 <stevemunene@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
09:32 <arnaudb@cumin1002> END (PASS) - Cookbook sre.switchdc.databases.prepare (exit_code=0) for the switch from test-s4 to test-s4 [production]
09:32 <arnaudb@cumin1002> START - Cookbook sre.switchdc.databases.prepare for the switch from test-s4 to test-s4 [production]
09:28 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1167 (T370903)', diff saved to https://phabricator.wikimedia.org/P68126 and previous config saved to /var/cache/conftool/dbconfig/20240829-092819-ladsgroup.json [production]
09:28 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 16:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
09:28 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 16:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
09:28 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1167.eqiad.wmnet with reason: Maintenance [production]
09:27 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 8:00:00 on db1167.eqiad.wmnet with reason: Maintenance [production]
09:25 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1161 (T371742)', diff saved to https://phabricator.wikimedia.org/P68125 and previous config saved to /var/cache/conftool/dbconfig/20240829-092547-ladsgroup.json [production]
09:25 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
09:25 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1016,1020].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance [production]
09:25 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1161.eqiad.wmnet with reason: Maintenance [production]
09:25 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 12:00:00 on db1161.eqiad.wmnet with reason: Maintenance [production]
09:24 <topranks> apply qos classifers and scedulers to interfaces on asw2-ulsfo T339850 [production]
09:24 <ayounsi@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "idp-test2005 - ayounsi@cumin1002" [production]
09:24 <ayounsi@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "idp-test2005 - ayounsi@cumin1002" [production]
09:15 <ayounsi@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on idp-test2005.wikimedia.org with reason: host reimage [production]
09:14 <hnowlan@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host mw2380.codfw.wmnet [production]
09:13 <hnowlan@cumin1002> START - Cookbook sre.k8s.pool-depool-node depool for host mw2380.codfw.wmnet [production]
09:13 <ayounsi@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on idp-test2005.wikimedia.org with reason: host reimage [production]
09:06 <aqu@deploy1003> Finished deploy [airflow-dags/analytics_test@cb0bc4d]: Test Refine through Airflow (duration: 00m 11s) [production]
09:06 <aqu@deploy1003> Started deploy [airflow-dags/analytics_test@cb0bc4d]: Test Refine through Airflow [production]
08:59 <ayounsi@cumin1002> START - Cookbook sre.hosts.reimage for host idp-test2005.wikimedia.org with OS bookworm [production]
08:58 <ayounsi@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM idp-test2005.wikimedia.org - ayounsi@cumin1002" [production]
08:58 <ayounsi@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.ganeti.makevm: created new VM idp-test2005.wikimedia.org - ayounsi@cumin1002" [production]
08:58 <ayounsi@cumin1002> END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) idp-test2005.wikimedia.org on all recursors [production]
08:58 <ayounsi@cumin1002> START - Cookbook sre.dns.wipe-cache idp-test2005.wikimedia.org on all recursors [production]
08:58 <ayounsi@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
08:58 <ayounsi@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM idp-test2005.wikimedia.org - ayounsi@cumin1002" [production]
08:58 <ayounsi@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add records for VM idp-test2005.wikimedia.org - ayounsi@cumin1002" [production]
08:51 <ayounsi@cumin1002> START - Cookbook sre.dns.netbox [production]
08:51 <ayounsi@cumin1002> START - Cookbook sre.ganeti.makevm for new host idp-test2005.wikimedia.org [production]
08:41 <hashar@deploy1003> rebuilt and synchronized wikiversions files: group2 to 1.43.0-wmf.20 refs T366965 [production]
07:53 <brouberol@cumin1002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host snapshot1011.eqiad.wmnet [production]
07:47 <brouberol@cumin1002> START - Cookbook sre.hosts.reboot-single for host snapshot1011.eqiad.wmnet [production]
07:46 <brouberol@cumin1002> END (FAIL) - Cookbook sre.hosts.reboot-single (exit_code=99) for host snapshot1011.eqiad.wmnet [production]
07:46 <brouberol@cumin1002> START - Cookbook sre.hosts.reboot-single for host snapshot1011.eqiad.wmnet [production]
07:39 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 6:00:00 on db1125.eqiad.wmnet with reason: Testing [production]
07:39 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 1 day, 6:00:00 on db1125.eqiad.wmnet with reason: Testing [production]
07:00 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2219 (T371742)', diff saved to https://phabricator.wikimedia.org/P68124 and previous config saved to /var/cache/conftool/dbconfig/20240829-070017-ladsgroup.json [production]
06:55 <kcvelaga@deploy1003> Finished deploy [airflow-dags/analytics_product@cb0bc4d]: (no justification provided) (duration: 00m 03s) [production]
06:55 <kcvelaga@deploy1003> Started deploy [airflow-dags/analytics_product@cb0bc4d]: (no justification provided) [production]
06:45 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2219', diff saved to https://phabricator.wikimedia.org/P68123 and previous config saved to /var/cache/conftool/dbconfig/20240829-064508-ladsgroup.json [production]