production SAL

701-750 of 10000 results (113ms)

2025-10-01 §
13:29	<cgoubert@deploy2002>	helmfile [staging-codfw] DONE helmfile.d/admin 'apply'.	[production]
13:28	<cgoubert@deploy2002>	helmfile [staging-codfw] START helmfile.d/admin 'apply'.	[production]
13:28	<cgoubert@deploy2002>	helmfile [staging-eqiad] DONE helmfile.d/admin 'apply'.	[production]
13:24	<cgoubert@deploy2002>	helmfile [staging-eqiad] START helmfile.d/admin 'apply'.	[production]
13:24	<cgoubert@deploy2002>	helmfile [codfw] DONE helmfile.d/admin 'apply'.	[production]
13:24	<cgoubert@deploy2002>	helmfile [codfw] START helmfile.d/admin 'apply'.	[production]
13:23	<cgoubert@deploy2002>	helmfile [codfw] START helmfile.d/admin 'apply'.	[production]
13:18	<fceratto@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1254 (T401906)', diff saved to https://phabricator.wikimedia.org/P83566 and previous config saved to /var/cache/conftool/dbconfig/20251001-131836-fceratto.json	[production]
13:17	<fceratto@cumin1002>	dbctl commit (dc=all): 'Depooling db1254 (T401906)', diff saved to https://phabricator.wikimedia.org/P83565 and previous config saved to /var/cache/conftool/dbconfig/20251001-131719-fceratto.json	[production]
13:17	<fceratto@cumin1002>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1254.eqiad.wmnet with reason: Maintenance	[production]
13:16	<fceratto@cumin1002>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1239.eqiad.wmnet with reason: Maintenance	[production]
13:16	<fceratto@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1233 (T401906)', diff saved to https://phabricator.wikimedia.org/P83564 and previous config saved to /var/cache/conftool/dbconfig/20251001-131639-fceratto.json	[production]
13:13	<elukey@cumin2002>	START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['cp2048.codfw.wmnet']	[production]
13:10	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Repool db1172 after upgrade T406008', diff saved to https://phabricator.wikimedia.org/P83563 and previous config saved to /var/cache/conftool/dbconfig/20251001-131033-ladsgroup.json	[production]
13:07	<ladsgroup@cumin1003>	END (PASS) - Cookbook sre.mysql.pool (exit_code=0) db1258* gradually with 4 steps - Work done	[production]
13:01	<fceratto@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1233', diff saved to https://phabricator.wikimedia.org/P83561 and previous config saved to /var/cache/conftool/dbconfig/20251001-130131-fceratto.json	[production]
12:56	<cgoubert@cumin1003>	conftool action : set/pooled=false; selector: dnsdisc=thumbor.*,name=eqiad	[production]
12:53	<cgoubert@cumin1003>	conftool action : set/pooled=false; selector: dnsdisc=swift.*,name=eqiad	[production]
12:51	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Depool db1172 for upgrade T406008', diff saved to https://phabricator.wikimedia.org/P83559 and previous config saved to /var/cache/conftool/dbconfig/20251001-125120-ladsgroup.json	[production]
12:50	<ladsgroup@cumin1003>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on db1172.eqiad.wmnet with reason: Upgrade to 10.11	[production]
12:46	<fceratto@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1233', diff saved to https://phabricator.wikimedia.org/P83558 and previous config saved to /var/cache/conftool/dbconfig/20251001-124622-fceratto.json	[production]
12:31	<fceratto@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1233 (T401906)', diff saved to https://phabricator.wikimedia.org/P83556 and previous config saved to /var/cache/conftool/dbconfig/20251001-123115-fceratto.json	[production]
12:31	<cgoubert@cumin1003>	conftool action : set/pooled=true; selector: dnsdisc=swift.*,name=eqiad	[production]
12:30	<fceratto@cumin1002>	dbctl commit (dc=all): 'Depooling db1233 (T401906)', diff saved to https://phabricator.wikimedia.org/P83555 and previous config saved to /var/cache/conftool/dbconfig/20251001-122959-fceratto.json	[production]
12:29	<fceratto@cumin1002>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db1233.eqiad.wmnet with reason: Maintenance	[production]
12:29	<cgoubert@cumin1003>	conftool action : set/pooled=true; selector: dnsdisc=thumbor.*,name=eqiad	[production]
12:29	<fceratto@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1229 (T401906)', diff saved to https://phabricator.wikimedia.org/P83554 and previous config saved to /var/cache/conftool/dbconfig/20251001-122936-fceratto.json	[production]
12:27	<mvernon@cumin2002>	END (PASS) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=0) rolling restart_daemons on A:swift-fe-codfw	[production]
12:21	<ladsgroup@cumin1003>	START - Cookbook sre.mysql.pool db1258* gradually with 4 steps - Work done	[production]
12:21	<ladsgroup@cumin1003>	END (PASS) - Cookbook sre.mysql.upgrade (exit_code=0) for db1258.eqiad.wmnet	[production]
12:19	<mvernon@cumin2002>	START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe-codfw	[production]
12:19	<mvernon@cumin2002>	END (ERROR) - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies (exit_code=97) rolling restart_daemons on A:swift-fe-eqiad	[production]
12:19	<mvernon@cumin2002>	START - Cookbook sre.swift.roll-restart-reboot-swift-ms-proxies rolling restart_daemons on A:swift-fe-eqiad	[production]
12:15	<ladsgroup@cumin1003>	END (PASS) - Cookbook sre.mysql.depool (exit_code=0) db1258 - Upgrading db1258.eqiad.wmnet	[production]
12:15	<ladsgroup@cumin1003>	START - Cookbook sre.mysql.depool db1258 - Upgrading db1258.eqiad.wmnet	[production]
12:15	<ladsgroup@cumin1003>	START - Cookbook sre.mysql.upgrade for db1258.eqiad.wmnet	[production]
12:14	<fceratto@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1229', diff saved to https://phabricator.wikimedia.org/P83552 and previous config saved to /var/cache/conftool/dbconfig/20251001-121429-fceratto.json	[production]
12:13	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Depool db1258 T406116', diff saved to https://phabricator.wikimedia.org/P83551 and previous config saved to /var/cache/conftool/dbconfig/20251001-121339-ladsgroup.json	[production]
12:12	<hnowlan@deploy2002>	helmfile [codfw] DONE helmfile.d/services/thumbor: sync	[production]
12:11	<hnowlan@deploy2002>	helmfile [codfw] START helmfile.d/services/thumbor: sync	[production]
12:08	<hnowlan@deploy2002>	helmfile [codfw] DONE helmfile.d/services/thumbor: apply	[production]
12:08	<hnowlan@deploy2002>	helmfile [codfw] START helmfile.d/services/thumbor: apply	[production]
12:06	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Promote db1255 to x3 primary T406116', diff saved to https://phabricator.wikimedia.org/P83550 and previous config saved to /var/cache/conftool/dbconfig/20251001-120629-ladsgroup.json	[production]
12:06	<hnowlan@deploy2002>	helmfile [codfw] DONE helmfile.d/services/thumbor: apply	[production]
12:06	<hnowlan@deploy2002>	helmfile [codfw] START helmfile.d/services/thumbor: apply	[production]
12:05	<Amir1>	Starting x3 eqiad failover from db1258 to db1255 - T406116	[production]
12:05	<hnowlan@deploy2002>	helmfile [codfw] DONE helmfile.d/admin 'apply'.	[production]
12:04	<hnowlan@deploy2002>	helmfile [codfw] START helmfile.d/admin 'apply'.	[production]
12:01	<ladsgroup@cumin1003>	dbctl commit (dc=all): 'Set db1255 with weight 0 T406116', diff saved to https://phabricator.wikimedia.org/P83549 and previous config saved to /var/cache/conftool/dbconfig/20251001-120140-ladsgroup.json	[production]
12:00	<ladsgroup@cumin1003>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on 16 hosts with reason: Primary switchover x3 T406116	[production]