651-700 of 10000 results (108ms)
2024-12-04 ยง
10:22 <jayme@cumin2002> START - Cookbook sre.hosts.rename from mw2442 to wikikube-worker20160 [production]
10:22 <brouberol@cumin2002> START - Cookbook sre.hosts.decommission for hosts an-presto1004.eqiad.wmnet [production]
10:21 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host ms-be1086.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
10:21 <jayme@cumin2002> START - Cookbook sre.dns.netbox [production]
10:20 <jayme@cumin2002> START - Cookbook sre.hosts.rename from mw2440 to wikikube-worker2015 [production]
10:19 <brouberol@cumin2002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts an-presto1003.eqiad.wmnet [production]
10:19 <brouberol@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
10:19 <brouberol@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-presto1003.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - brouberol@cumin2002" [production]
10:19 <elukey@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ms-be1086.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
10:19 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host ms-be1086.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
10:19 <elukey@cumin1002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host ms-be1086.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
10:19 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host ms-be1086.mgmt.eqiad.wmnet with chassis set policy FORCE_RESTART [production]
10:13 <brouberol@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-presto1003.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - brouberol@cumin2002" [production]
10:10 <brouberol@cumin2002> START - Cookbook sre.dns.netbox [production]
10:09 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on 8 hosts with reason: Rebooting [production]
10:09 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on 8 hosts with reason: Rebooting [production]
10:04 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti2018.codfw.wmnet [production]
10:04 <brouberol@cumin2002> START - Cookbook sre.hosts.decommission for hosts an-presto1003.eqiad.wmnet [production]
10:03 <brouberol@cumin2002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts an-presto1002.eqiad.wmnet [production]
10:03 <brouberol@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
10:03 <brouberol@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-presto1002.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - brouberol@cumin2002" [production]
10:02 <brouberol@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-presto1002.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - brouberol@cumin2002" [production]
09:58 <brouberol@cumin2002> START - Cookbook sre.dns.netbox [production]
09:56 <godog> bump space for prometheus k8s-mlserve in eqiad [production]
09:50 <jayme@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host mw2444.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
09:50 <jayme@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host mw2443.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
09:46 <jayme@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host mw2440.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
09:39 <brouberol@cumin2002> START - Cookbook sre.hosts.decommission for hosts an-presto1002.eqiad.wmnet [production]
09:36 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on es2024.codfw.wmnet with reason: cloning [production]
09:35 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on es2024.codfw.wmnet with reason: cloning [production]
09:35 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool es2024 to clone es2045', diff saved to https://phabricator.wikimedia.org/P71535 and previous config saved to /var/cache/conftool/dbconfig/20241204-093541-marostegui.json [production]
09:35 <marostegui@cumin1002> dbctl commit (dc=all): 'Promote es2023 to es5 master T381259', diff saved to https://phabricator.wikimedia.org/P71534 and previous config saved to /var/cache/conftool/dbconfig/20241204-093519-marostegui.json [production]
09:35 <brouberol@cumin2002> END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts an-presto1001.eqiad.wmnet [production]
09:35 <brouberol@cumin2002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
09:35 <brouberol@cumin2002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-presto1001.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - brouberol@cumin2002" [production]
09:34 <brouberol@cumin2002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: an-presto1001.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - brouberol@cumin2002" [production]
09:33 <jayme@cumin2002> START - Cookbook sre.hosts.provision for host mw2444.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
09:32 <jayme@cumin2002> START - Cookbook sre.hosts.provision for host mw2443.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
09:32 <jayme@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host mw2442.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
09:30 <brouberol@cumin2002> START - Cookbook sre.dns.netbox [production]
09:21 <brouberol@cumin2002> START - Cookbook sre.hosts.decommission for hosts an-presto1001.eqiad.wmnet [production]
09:15 <jayme@cumin2002> START - Cookbook sre.hosts.provision for host mw2442.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
09:14 <jayme@cumin2002> START - Cookbook sre.hosts.provision for host mw2440.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART and with Dell SCP reboot policy GRACEFUL [production]
09:12 <jayme@cumin2002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on mw[2440,2442-2444].codfw.wmnet with reason: T377877 [production]
09:12 <marostegui@cumin1002> dbctl commit (dc=all): 'es2046 (re)pooling @ 100%: Pooling in es5', diff saved to https://phabricator.wikimedia.org/P71533 and previous config saved to /var/cache/conftool/dbconfig/20241204-091229-root.json [production]
09:12 <jayme@cumin2002> START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on mw[2440,2442-2444].codfw.wmnet with reason: T377877 [production]
09:07 <jayme@cumin2002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host mw[2440,2442-2444].codfw.wmnet [production]
09:05 <jayme@cumin2002> START - Cookbook sre.k8s.pool-depool-node depool for host mw[2440,2442-2444].codfw.wmnet [production]
08:57 <marostegui@cumin1002> dbctl commit (dc=all): 'es2046 (re)pooling @ 75%: Pooling in es5', diff saved to https://phabricator.wikimedia.org/P71532 and previous config saved to /var/cache/conftool/dbconfig/20241204-085724-root.json [production]
08:52 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on es2022.codfw.wmnet with reason: cloning [production]