production SAL

8501-8550 of 10000 results (91ms)

2022-12-01 §
00:17	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2153 (T322618)', diff saved to https://phabricator.wikimedia.org/P41972 and previous config saved to /var/cache/conftool/dbconfig/20221201-001659-ladsgroup.json	[production]
00:14	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db2153 (T322618)', diff saved to https://phabricator.wikimedia.org/P41971 and previous config saved to /var/cache/conftool/dbconfig/20221201-001449-ladsgroup.json	[production]
00:14	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2153.codfw.wmnet with reason: Maintenance	[production]
00:14	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db2153.codfw.wmnet with reason: Maintenance	[production]
00:14	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2146 (T322618)', diff saved to https://phabricator.wikimedia.org/P41970 and previous config saved to /var/cache/conftool/dbconfig/20221201-001427-ladsgroup.json	[production]
00:10	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1206.eqiad.wmnet with reason: host reimage	[production]
00:07	<pt1979@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on db1206.eqiad.wmnet with reason: host reimage	[production]
00:04	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1119', diff saved to https://phabricator.wikimedia.org/P41969 and previous config saved to /var/cache/conftool/dbconfig/20221201-000458-ladsgroup.json	[production]
2022-11-30 §
23:59	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2146', diff saved to https://phabricator.wikimedia.org/P41968 and previous config saved to /var/cache/conftool/dbconfig/20221130-235921-ladsgroup.json	[production]
23:54	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host db1206.eqiad.wmnet with OS bullseye	[production]
23:49	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1119 (T322618)', diff saved to https://phabricator.wikimedia.org/P41967 and previous config saved to /var/cache/conftool/dbconfig/20221130-234952-ladsgroup.json	[production]
23:48	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['db1206']	[production]
23:48	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1119 (T322618)', diff saved to https://phabricator.wikimedia.org/P41966 and previous config saved to /var/cache/conftool/dbconfig/20221130-234844-ladsgroup.json	[production]
23:48	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1119.eqiad.wmnet with reason: Maintenance	[production]
23:48	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1119.eqiad.wmnet with reason: Maintenance	[production]
23:48	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1118 (T322618)', diff saved to https://phabricator.wikimedia.org/P41965 and previous config saved to /var/cache/conftool/dbconfig/20221130-234821-ladsgroup.json	[production]
23:44	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2146', diff saved to https://phabricator.wikimedia.org/P41964 and previous config saved to /var/cache/conftool/dbconfig/20221130-234414-ladsgroup.json	[production]
23:36	<pt1979@cumin2002>	START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['db1206']	[production]
23:36	<pt1979@cumin2002>	END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['db1206']	[production]
23:35	<pt1979@cumin2002>	START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['db1206']	[production]
23:33	<pt1979@cumin2002>	END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['db1206']	[production]
23:33	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1118', diff saved to https://phabricator.wikimedia.org/P41963 and previous config saved to /var/cache/conftool/dbconfig/20221130-233314-ladsgroup.json	[production]
23:32	<pt1979@cumin2002>	START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['db1206']	[production]
23:32	<pt1979@cumin2002>	END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['db1206']	[production]
23:30	<pt1979@cumin2002>	END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['sretest2001']	[production]
23:30	<pt1979@cumin2002>	START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['sretest2001']	[production]
23:29	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2146 (T322618)', diff saved to https://phabricator.wikimedia.org/P41962 and previous config saved to /var/cache/conftool/dbconfig/20221130-232908-ladsgroup.json	[production]
23:26	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db2146 (T322618)', diff saved to https://phabricator.wikimedia.org/P41961 and previous config saved to /var/cache/conftool/dbconfig/20221130-232658-ladsgroup.json	[production]
23:26	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db2146.codfw.wmnet with reason: Maintenance	[production]
23:26	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db2146.codfw.wmnet with reason: Maintenance	[production]
23:26	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2145 (T322618)', diff saved to https://phabricator.wikimedia.org/P41960 and previous config saved to /var/cache/conftool/dbconfig/20221130-232637-ladsgroup.json	[production]
23:24	<pt1979@cumin2002>	START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['db1206']	[production]
23:24	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host db1206.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
23:22	<brett@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cp5025.eqsin.wmnet with OS buster	[production]
23:18	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1118', diff saved to https://phabricator.wikimedia.org/P41959 and previous config saved to /var/cache/conftool/dbconfig/20221130-231808-ladsgroup.json	[production]
23:11	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2145', diff saved to https://phabricator.wikimedia.org/P41958 and previous config saved to /var/cache/conftool/dbconfig/20221130-231130-ladsgroup.json	[production]
23:06	<tgr>	running GrowthExperiments refreshUserImpactData.php (and generating a bunch of AQS requests) for T323958	[production]
23:03	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1118 (T322618)', diff saved to https://phabricator.wikimedia.org/P41957 and previous config saved to /var/cache/conftool/dbconfig/20221130-230301-ladsgroup.json	[production]
23:01	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db1118 (T322618)', diff saved to https://phabricator.wikimedia.org/P41956 and previous config saved to /var/cache/conftool/dbconfig/20221130-230154-ladsgroup.json	[production]
23:01	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on db1118.eqiad.wmnet with reason: Maintenance	[production]
23:01	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on db1118.eqiad.wmnet with reason: Maintenance	[production]
23:01	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1107 (T322618)', diff saved to https://phabricator.wikimedia.org/P41955 and previous config saved to /var/cache/conftool/dbconfig/20221130-230132-ladsgroup.json	[production]
22:57	<tgr>	UTC late deploys done	[production]
22:56	<tgr@deploy1002>	Finished scap: Backport for [[gerrit:862352\|Use the right load balancer for UserImpactStore]] (duration: 07m 15s)	[production]
22:56	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2145', diff saved to https://phabricator.wikimedia.org/P41954 and previous config saved to /var/cache/conftool/dbconfig/20221130-225623-ladsgroup.json	[production]
22:50	<tgr@deploy1002>	tgr and tgr: Backport for [[gerrit:862352\|Use the right load balancer for UserImpactStore]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet	[production]
22:50	<brett@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cp5025.eqsin.wmnet with reason: host reimage	[production]
22:49	<tgr@deploy1002>	Started scap: Backport for [[gerrit:862352\|Use the right load balancer for UserImpactStore]]	[production]
22:46	<brett@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on cp5025.eqsin.wmnet with reason: host reimage	[production]
22:46	<tgr@deploy1002>	Finished scap: Backport for [[gerrit:862351\|Use the right load balancer for UserImpactStore]] (duration: 05m 59s)	[production]