2251-2300 of 10000 results (120ms)
2025-01-28 ยง
18:03 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1236 (T384592)', diff saved to https://phabricator.wikimedia.org/P72652 and previous config saved to /var/cache/conftool/dbconfig/20250128-180335-marostegui.json [production]
17:48 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1236', diff saved to https://phabricator.wikimedia.org/P72651 and previous config saved to /var/cache/conftool/dbconfig/20250128-174828-marostegui.json [production]
17:33 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1236', diff saved to https://phabricator.wikimedia.org/P72650 and previous config saved to /var/cache/conftool/dbconfig/20250128-173321-marostegui.json [production]
17:18 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1236 (T384592)', diff saved to https://phabricator.wikimedia.org/P72649 and previous config saved to /var/cache/conftool/dbconfig/20250128-171814-marostegui.json [production]
17:12 <marostegui@cumin1002> dbctl commit (dc=all): 'db1166 (re)pooling @ 100%: Repooling', diff saved to https://phabricator.wikimedia.org/P72648 and previous config saved to /var/cache/conftool/dbconfig/20250128-171205-root.json [production]
17:10 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host lvs4010.ulsfo.wmnet with OS bookworm [production]
17:06 <rzl> stopping puppet on A:cp-text [production]
17:05 <cmooney@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on netflow3003.esams.wmnet with reason: disabling alerts as I'm running gnmic manually rather than with systemd [production]
17:04 <marostegui@cumin1002> dbctl commit (dc=all): 'Depooling db1236 (T384592)', diff saved to https://phabricator.wikimedia.org/P72647 and previous config saved to /var/cache/conftool/dbconfig/20250128-170412-marostegui.json [production]
17:04 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1236.eqiad.wmnet with reason: Maintenance [production]
17:03 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1227 (T384592)', diff saved to https://phabricator.wikimedia.org/P72646 and previous config saved to /var/cache/conftool/dbconfig/20250128-170350-marostegui.json [production]
17:01 <cmooney@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for cr[1-2]-magru,cr[1-2]-magru IPv6 [production]
17:01 <cmooney@cumin1002> START - Cookbook sre.hosts.remove-downtime for cr[1-2]-magru,cr[1-2]-magru IPv6 [production]
16:57 <marostegui@cumin1002> dbctl commit (dc=all): 'db1166 (re)pooling @ 75%: Repooling', diff saved to https://phabricator.wikimedia.org/P72645 and previous config saved to /var/cache/conftool/dbconfig/20250128-165700-root.json [production]
16:53 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:53 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:52 <elukey> restart kartotherian on maps1009 as test [production]
16:51 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:51 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:48 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1227', diff saved to https://phabricator.wikimedia.org/P72644 and previous config saved to /var/cache/conftool/dbconfig/20250128-164843-marostegui.json [production]
16:42 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:42 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:42 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:42 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:41 <marostegui@cumin1002> dbctl commit (dc=all): 'db1166 (re)pooling @ 50%: Repooling', diff saved to https://phabricator.wikimedia.org/P72643 and previous config saved to /var/cache/conftool/dbconfig/20250128-164154-root.json [production]
16:41 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:41 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:41 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:41 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:40 <jhancock@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:40 <jhancock@cumin2002> START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy FORCE_RESTART [production]
16:37 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs4010.ulsfo.wmnet with reason: host reimage [production]
16:33 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1227', diff saved to https://phabricator.wikimedia.org/P72642 and previous config saved to /var/cache/conftool/dbconfig/20250128-163336-marostegui.json [production]
16:33 <vgutierrez@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on lvs4010.ulsfo.wmnet with reason: host reimage [production]
16:26 <root@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2177.codfw.wmnet with reason: Index rebuild [production]
16:26 <marostegui@cumin1002> dbctl commit (dc=all): 'db1166 (re)pooling @ 25%: Repooling', diff saved to https://phabricator.wikimedia.org/P72641 and previous config saved to /var/cache/conftool/dbconfig/20250128-162649-root.json [production]
16:26 <root@cumin1002> END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db2177.codfw.wmnet [production]
16:25 <jhancock@cumin2002> END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host gerrit2002 [production]
16:25 <jhancock@cumin2002> START - Cookbook sre.network.configure-switch-interfaces for host gerrit2002 [production]
16:25 <marostegui@cumin1002> DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2177.codfw.wmnet with reason: maintenance [production]
16:20 <reedy@deploy2002> Finished scap sync-world: Backport for [[gerrit:1114701|FormatMetadata: Prevent running preg_match() on null (T384879)]], [[gerrit:1114702|FormatMetadata: Prevent running preg_match() on null (T384879)]] (duration: 12m 12s) [production]
16:19 <root@cumin1002> START - Cookbook sre.mysql.upgrade for db2177.codfw.wmnet [production]
16:18 <marostegui@cumin1002> dbctl commit (dc=all): 'Depool db2177 T382842', diff saved to https://phabricator.wikimedia.org/P72640 and previous config saved to /var/cache/conftool/dbconfig/20250128-161857-marostegui.json [production]
16:18 <marostegui@cumin1002> dbctl commit (dc=all): 'Repooling after maintenance db1227 (T384592)', diff saved to https://phabricator.wikimedia.org/P72639 and previous config saved to /var/cache/conftool/dbconfig/20250128-161829-marostegui.json [production]
16:15 <cgoubert@deploy2002> helmfile [eqiad] DONE helmfile.d/services/mw-videoscaler: apply [production]
16:15 <cgoubert@deploy2002> helmfile [eqiad] START helmfile.d/services/mw-videoscaler: apply [production]
16:15 <vgutierrez@cumin1002> START - Cookbook sre.hosts.reimage for host lvs4010.ulsfo.wmnet with OS bookworm [production]
16:14 <jayme@cumin1002> END (PASS) - Cookbook sre.k8s.pool-depool-node (exit_code=0) pool for host wikikube-worker[2013,2036,2088].codfw.wmnet [production]
16:14 <jayme@cumin1002> END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for wikikube-worker[2013,2036,2088].codfw.wmnet [production]
16:14 <jayme@cumin1002> START - Cookbook sre.hosts.remove-downtime for wikikube-worker[2013,2036,2088].codfw.wmnet [production]