production SAL

451-500 of 10000 results (165ms)

2026-07-29 §
19:24	<cwilliams@cumin1003>	dbctl commit (dc=all): 'Depooling db1231 (T431660)', diff saved to https://phabricator.wikimedia.org/P95714 and previous config saved to /var/cache/conftool/dbconfig/20260729-192454-cwilliams.json	[production]
19:24	<cwilliams@cumin1003>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1231.eqiad.wmnet with reason: Maintenance	[production]
19:24	<root@cumin1003>	END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db1227: Maintenance	[production]
19:22	<vriley@cumin1003>	END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cloudvirt1048	[production]
19:22	<vriley@cumin1003>	START - Cookbook sre.network.configure-switch-interfaces for host cloudvirt1048	[production]
19:21	<vriley@cumin1003>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
19:21	<vriley@cumin1003>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt [cloudvirt1048] - vriley@cumin1003"	[production]
19:21	<vriley@cumin1003>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update mgmt [cloudvirt1048] - vriley@cumin1003"	[production]
19:19	<dduvall@deploy1003>	rebuilt and synchronized wikiversions files: group0 to 1.47.0-wmf.13 refs T430832	[production]
19:18	<root@cumin1003>	END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db1251: Maintenance	[production]
19:17	<cwilliams@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1252 (T431660)', diff saved to https://phabricator.wikimedia.org/P95711 and previous config saved to /var/cache/conftool/dbconfig/20260729-191756-cwilliams.json	[production]
19:16	<vriley@cumin1003>	START - Cookbook sre.dns.netbox	[production]
19:11	<dduvall>	rolling back wmf.13 to group0 due to T433457 (cc T430832)	[production]
19:07	<cwilliams@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1252', diff saved to https://phabricator.wikimedia.org/P95709 and previous config saved to /var/cache/conftool/dbconfig/20260729-190748-cwilliams.json	[production]
19:01	<bking@cumin2003>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wdqs2022.codfw.wmnet with OS bookworm	[production]
19:01	<bking@cumin2003>	END (PASS) - Cookbook sre.wdqs.data-transfer (exit_code=0) (T430880, restore data on newly-reimaged host) xfer wdqs-all from wdqs2015.codfw.wmnet -> wdqs2021.codfw.wmnet, repooling source-only afterwards	[production]
18:57	<cwilliams@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1252', diff saved to https://phabricator.wikimedia.org/P95707 and previous config saved to /var/cache/conftool/dbconfig/20260729-185740-cwilliams.json	[production]
18:47	<cwilliams@cumin1003>	dbctl commit (dc=all): 'Repooling after maintenance db1252 (T431660)', diff saved to https://phabricator.wikimedia.org/P95704 and previous config saved to /var/cache/conftool/dbconfig/20260729-184732-cwilliams.json	[production]
18:37	<root@cumin1003>	START - Cookbook sre.mysql.pool pool db1227: Maintenance	[production]
18:34	<bking@cumin2003>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs2022.codfw.wmnet with reason: host reimage	[production]
18:31	<cwilliams@cumin1003>	dbctl commit (dc=all): 'Depooling db1227 (T431660)', diff saved to https://phabricator.wikimedia.org/P95701 and previous config saved to /var/cache/conftool/dbconfig/20260729-183117-cwilliams.json	[production]
18:31	<cwilliams@cumin1003>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1227.eqiad.wmnet with reason: Maintenance	[production]
18:30	<root@cumin1003>	END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db1202: Maintenance	[production]
18:30	<root@cumin1003>	START - Cookbook sre.mysql.pool pool db1251: Maintenance	[production]
18:27	<bking@cumin2003>	START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs2022.codfw.wmnet with reason: host reimage	[production]
18:24	<cwilliams@cumin1003>	dbctl commit (dc=all): 'Depooling db1251 (T431660)', diff saved to https://phabricator.wikimedia.org/P95698 and previous config saved to /var/cache/conftool/dbconfig/20260729-182428-cwilliams.json	[production]
18:24	<brett@cumin2002>	END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for lvs2014.codfw.wmnet	[production]
18:24	<brett@cumin2002>	START - Cookbook sre.hosts.remove-downtime for lvs2014.codfw.wmnet	[production]
18:24	<cwilliams@cumin1003>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1251.eqiad.wmnet with reason: Maintenance	[production]
18:23	<root@cumin1003>	END (PASS) - Cookbook sre.mysql.pool (exit_code=0) pool db1235: Maintenance	[production]
18:22	<brett@cumin2002>	END (PASS) - Cookbook sre.loadbalancer.upgrade (exit_code=0) restart A:liberica-eqsin (T428495)	[production]
18:19	<brett@cumin2002>	START - Cookbook sre.loadbalancer.upgrade restart A:liberica-eqsin (T428495)	[production]
18:19	<brett@cumin2002>	END (PASS) - Cookbook sre.loadbalancer.upgrade (exit_code=0) restart A:liberica-ulsfo (T428495)	[production]
18:17	<brett@cumin2002>	START - Cookbook sre.loadbalancer.upgrade restart A:liberica-ulsfo (T428495)	[production]
18:17	<dduvall@deploy1003>	rebuilt and synchronized wikiversions files: group1 to 1.47.0-wmf.13 refs T430832	[production]
18:16	<mutante>	removing jenkins during the train - living on the edge - no, just kidding, jenkins has migrated to dedicated machines, nothing should happen	[production]
18:15	<brett@cumin2002>	END (ERROR) - Cookbook sre.loadbalancer.restart-pybal (exit_code=97) rolling-restart of pybal on P{lvs2014.codfw.wmnet} and A:lvs (T428495)	[production]
18:15	<mutante>	CI: contint1002/contint2002: apt-get remove --purge jenkins - jenkins be gone - T418521	[production]
18:13	<brett@cumin2002>	START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on P{lvs2014.codfw.wmnet} and A:lvs (T428495)	[production]
18:08	<bking@cumin2003>	END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host wdqs2022	[production]
18:08	<bking@cumin2003>	END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wdqs2022	[production]
18:03	<swfrench-wmf>	restarted navtiming on webperf2003 - T428495	[production]
18:03	<bking@cumin2003>	START - Cookbook sre.network.configure-switch-interfaces for host wdqs2022	[production]
18:02	<bking@cumin2003>	END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) wdqs2022.codfw.wmnet 211.48.192.10.in-addr.arpa 1.1.2.0.8.4.0.0.2.9.1.0.0.1.0.0.4.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors	[production]
18:02	<bking@cumin2003>	START - Cookbook sre.dns.wipe-cache wdqs2022.codfw.wmnet 211.48.192.10.in-addr.arpa 1.1.2.0.8.4.0.0.2.9.1.0.0.1.0.0.4.0.1.0.0.6.8.0.0.0.0.0.0.2.6.2.ip6.arpa on all recursors	[production]
18:02	<bking@cumin2003>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
18:02	<bking@cumin2003>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host wdqs2022 - bking@cumin2003"	[production]
18:02	<bking@cumin2003>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Update records for host wdqs2022 - bking@cumin2003"	[production]
17:57	<bking@cumin2003>	START - Cookbook sre.dns.netbox	[production]
17:56	<brett@cumin2002>	END (FAIL) - Cookbook sre.loadbalancer.restart-pybal (exit_code=1) rolling-restart of pybal on A:lvs-codfw and A:lvs (T428495)	[production]