4551-4600 of 10000 results (119ms)
2024-08-27 ยง
15:15 <kamila@cumin1002> START - Cookbook sre.dns.netbox [production]
15:15 <kamila@cumin1002> START - Cookbook sre.hosts.rename from kubernetes2019 to wikikube-worker2044 [production]
15:11 <elukey> restart httpd and librenms-syslog.service on netmon1003 for libaom upgrades [production]
15:11 <elukey> restart httpd on crm2001 for libaom upgrades [production]
15:09 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1247 (T371742)', diff saved to https://phabricator.wikimedia.org/P67933 and previous config saved to /var/cache/conftool/dbconfig/20240827-150952-ladsgroup.json [production]
15:09 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1247.eqiad.wmnet with reason: Maintenance [production]
15:09 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 12:00:00 on db1247.eqiad.wmnet with reason: Maintenance [production]
15:05 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2176 (re)pooling @ 75%: post maintenance', diff saved to https://phabricator.wikimedia.org/P67932 and previous config saved to /var/cache/conftool/dbconfig/20240827-150525-arnaudb.json [production]
15:02 <elukey@puppetserver1001> conftool action : set/pooled=yes; selector: name=wikikube-ctrl2003.codfw.wmnet [production]
15:01 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host wikikube-ctrl2003.mgmt.codfw.wmnet with reboot policy GRACEFUL [production]
15:00 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1191', diff saved to https://phabricator.wikimedia.org/P67931 and previous config saved to /var/cache/conftool/dbconfig/20240827-150041-ladsgroup.json [production]
14:57 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2232.codfw.wmnet with OS bookworm [production]
14:54 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2231.codfw.wmnet with OS bookworm [production]
14:51 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db2230.codfw.wmnet with OS bookworm [production]
14:50 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2176 (re)pooling @ 50%: post maintenance', diff saved to https://phabricator.wikimedia.org/P67930 and previous config saved to /var/cache/conftool/dbconfig/20240827-145020-arnaudb.json [production]
14:45 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1191', diff saved to https://phabricator.wikimedia.org/P67929 and previous config saved to /var/cache/conftool/dbconfig/20240827-144534-ladsgroup.json [production]
14:44 <elukey@cumin1002> START - Cookbook sre.hosts.provision for host wikikube-ctrl2003.mgmt.codfw.wmnet with reboot policy GRACEFUL [production]
14:42 <akosiaris@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) depool for host mw2293.codfw.wmnet [production]
14:41 <akosiaris@cumin1002> START - Cookbook sre.k8s.pool-depool-node depool for host mw2293.codfw.wmnet [production]
14:41 <elukey@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 0:30:00 on wikikube-ctrl2003.codfw.wmnet with reason: running provision again [production]
14:41 <elukey@cumin1002> START - Cookbook sre.hosts.downtime for 0:30:00 on wikikube-ctrl2003.codfw.wmnet with reason: running provision again [production]
14:41 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2232.codfw.wmnet with reason: host reimage [production]
14:40 <elukey@puppetserver1001> conftool action : set/pooled=no; selector: name=wikikube-ctrl2003.codfw.wmnet [production]
14:37 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2231.codfw.wmnet with reason: host reimage [production]
14:35 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2176 (re)pooling @ 25%: post maintenance', diff saved to https://phabricator.wikimedia.org/P67928 and previous config saved to /var/cache/conftool/dbconfig/20240827-143514-arnaudb.json [production]
14:35 <arnaudb@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2230.codfw.wmnet with reason: host reimage [production]
14:32 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db2231.codfw.wmnet with reason: host reimage [production]
14:32 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db2232.codfw.wmnet with reason: host reimage [production]
14:31 <arnaudb@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on db2230.codfw.wmnet with reason: host reimage [production]
14:30 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1191 (T370903)', diff saved to https://phabricator.wikimedia.org/P67927 and previous config saved to /var/cache/conftool/dbconfig/20240827-143027-ladsgroup.json [production]
14:29 <brouberol@cumin1002> END (PASS) - Cookbook sre.dns.netbox (exit_code=0) [production]
14:29 <brouberol@cumin1002> END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Adding AAAA field to wdqs101[1-3] and wdqs200[7-8] - brouberol@cumin1002" [production]
14:29 <brouberol@cumin1002> START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Adding AAAA field to wdqs101[1-3] and wdqs200[7-8] - brouberol@cumin1002" [production]
14:26 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8 days, 0:00:00 on db2186.codfw.wmnet with reason: Schema change [production]
14:26 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 8 days, 0:00:00 on db2186.codfw.wmnet with reason: Schema change [production]
14:26 <marostegui@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 12:00:00 on db2186.codfw.wmnet with reason: Schema change [production]
14:26 <brouberol@cumin1002> START - Cookbook sre.dns.netbox [production]
14:26 <marostegui@cumin1002> START - Cookbook sre.hosts.downtime for 2 days, 12:00:00 on db2186.codfw.wmnet with reason: Schema change [production]
14:25 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Depooling db1191 (T370903)', diff saved to https://phabricator.wikimedia.org/P67926 and previous config saved to /var/cache/conftool/dbconfig/20240827-142516-ladsgroup.json [production]
14:25 <ladsgroup@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db1191.eqiad.wmnet with reason: Maintenance [production]
14:24 <ladsgroup@cumin1002> START - Cookbook sre.hosts.downtime for 8:00:00 on db1191.eqiad.wmnet with reason: Maintenance [production]
14:24 <ladsgroup@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1181 (T370903)', diff saved to https://phabricator.wikimedia.org/P67925 and previous config saved to /var/cache/conftool/dbconfig/20240827-142454-ladsgroup.json [production]
14:24 <marostegui> Update zarcillo db for pc4 master T373340 [production]
14:20 <akosiaris> T372878 uncordon wikikube-worker2043 [production]
14:20 <akosiaris> T327878 uncordon wikikube-worker2043 [production]
14:20 <arnaudb@cumin1002> dbctl commit (dc=all): 'db2176 (re)pooling @ 15%: post maintenance', diff saved to https://phabricator.wikimedia.org/P67924 and previous config saved to /var/cache/conftool/dbconfig/20240827-142009-arnaudb.json [production]
14:18 <tappof@cumin2002> END (FAIL) - Cookbook sre.opensearch.roll-restart-reboot (exit_code=99) rolling restart_daemons on P{O:logging::opensearch::data and logs*.codfw.wmnet} and (A:datahubsearch or A:logstash-eqiad or A:logstash-codfw) [production]
14:18 <marostegui@cumin1002> dbctl commit (dc=all): 'Switch pc4 master to pc2015 T373340', diff saved to https://phabricator.wikimedia.org/P67923 and previous config saved to /var/cache/conftool/dbconfig/20240827-141845-marostegui.json [production]
14:18 <arnaudb@cumin1002> START - Cookbook sre.hosts.reimage for host db2232.codfw.wmnet with OS bookworm [production]
14:18 <arnaudb@cumin1002> START - Cookbook sre.hosts.reimage for host db2231.codfw.wmnet with OS bookworm [production]