6001-6050 of 10000 results (159ms)
2025-06-10 ยง
15:02 <jmm@cumin1003> START - Cookbook sre.hosts.reimage for host install7002.wikimedia.org with OS bookworm [production]
15:02 <jmm@cumin1003> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1028.eqiad.wmnet [production]
15:02 <jmm@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1028.eqiad.wmnet [production]
15:01 <klausman@cumin1003> START - Cookbook sre.hosts.reboot-single for host ml-lab1001.eqiad.wmnet [production]
15:01 <taavi@dns1004> END - running authdns-update [production]
15:00 <taavi@dns1004> START - running authdns-update [production]
14:58 <taavi@cumin1003> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:58 <taavi@cumin1003> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add wiki replica cloudlb v6 addresses - taavi@cumin1003" [production]
14:58 <taavi@cumin1003> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: add wiki replica cloudlb v6 addresses - taavi@cumin1003" [production]
14:56 <jmm@cumin1003> START - Cookbook sre.hosts.reboot-single for host ganeti1028.eqiad.wmnet [production]
14:55 <jmm@cumin1003> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1028.eqiad.wmnet [production]
14:54 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2226', diff saved to https://phabricator.wikimedia.org/P77542 and previous config saved to /var/cache/conftool/dbconfig/20250610-145424-marostegui.json [production]
14:54 <taavi@cumin1003> START - Cookbook sre.dns.netbox [production]
14:53 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
14:53 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
14:51 <marostegui@cumin1002> dbctl commit (dc=all): 'Repool pc1 T378715', diff saved to https://phabricator.wikimedia.org/P77541 and previous config saved to /var/cache/conftool/dbconfig/20250610-145137-marostegui.json [production]
14:49 <bking@cumin2002> END (FAIL) - Cookbook sre.hosts.decommission (exit_code=1) for hosts cirrussearch1063.eqiad.wmnet [production]
14:49 <bking@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:49 <bking@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cirrussearch1063.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - bking@cumin2002" [production]
14:49 <bking@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: cirrussearch1063.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - bking@cumin2002" [production]
14:39 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2226', diff saved to https://phabricator.wikimedia.org/P77539 and previous config saved to /var/cache/conftool/dbconfig/20250610-143917-marostegui.json [production]
14:36 <bking@cumin2002> START - Cookbook sre.dns.netbox [production]
14:36 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
14:36 <fceratto@cumin1002> dbctl commit (dc=all): 'Depooling db1227 (T395241)', diff saved to https://phabricator.wikimedia.org/P77538 and previous config saved to /var/cache/conftool/dbconfig/20250610-143623-fceratto.json [production]
14:36 <fceratto@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1227.eqiad.wmnet with reason: Maintenance [production]
14:36 <brouberol@deploy1003> helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow-test-k8s: apply [production]
14:35 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1202 (T395241)', diff saved to https://phabricator.wikimedia.org/P77537 and previous config saved to /var/cache/conftool/dbconfig/20250610-143558-fceratto.json [production]
14:29 <jmm@cumin1003> END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host install7002.wikimedia.org with OS bullseye [production]
14:28 <bking@cumin2002> START - Cookbook sre.hosts.decommission for hosts cirrussearch1063.eqiad.wmnet [production]
14:24 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2226 (T396130)', diff saved to https://phabricator.wikimedia.org/P77536 and previous config saved to /var/cache/conftool/dbconfig/20250610-142410-marostegui.json [production]
14:20 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1202', diff saved to https://phabricator.wikimedia.org/P77535 and previous config saved to /var/cache/conftool/dbconfig/20250610-142051-fceratto.json [production]
14:20 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db2226 (T396130)', diff saved to https://phabricator.wikimedia.org/P77534 and previous config saved to /var/cache/conftool/dbconfig/20250610-142009-marostegui.json [production]
14:20 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2226.codfw.wmnet with reason: Maintenance [production]
14:19 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2225 (T396130)', diff saved to https://phabricator.wikimedia.org/P77533 and previous config saved to /var/cache/conftool/dbconfig/20250610-141946-marostegui.json [production]
14:19 <jmm@cumin1003> END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti1028.eqiad.wmnet [production]
14:13 <fabfur@dns1004> END - running authdns-update [production]
14:13 <jmm@cumin1003> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1028.eqiad.wmnet [production]
14:13 <jmm@cumin1003> END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti1028.eqiad.wmnet [production]
14:12 <fabfur@dns1004> START - running authdns-update [production]
14:05 <fceratto@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1202', diff saved to https://phabricator.wikimedia.org/P77532 and previous config saved to /var/cache/conftool/dbconfig/20250610-140544-fceratto.json [production]
14:04 <taavi@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudlb1001.eqiad.wmnet [production]
14:04 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db2225', diff saved to https://phabricator.wikimedia.org/P77531 and previous config saved to /var/cache/conftool/dbconfig/20250610-140439-marostegui.json [production]
13:57 <jmm@cumin1003> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1028.eqiad.wmnet [production]
13:56 <taavi@cumin1003> START - Cookbook sre.hosts.reboot-single for host cloudlb1001.eqiad.wmnet [production]
13:55 <taavi@cumin1003> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cloudlb1002.eqiad.wmnet [production]
13:51 <jforrester@deploy1003> helmfile [eqiad] DONE helmfile.d/services/wikifunctions: apply [production]
13:51 <jforrester@deploy1003> helmfile [eqiad] START helmfile.d/services/wikifunctions: apply [production]
13:51 <jforrester@deploy1003> helmfile [codfw] DONE helmfile.d/services/wikifunctions: apply [production]
13:50 <jforrester@deploy1003> helmfile [codfw] START helmfile.d/services/wikifunctions: apply [production]
13:50 <jmm@cumin1003> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1027.eqiad.wmnet [production]