production SAL

1-50 of 10000 results (107ms)

2026-06-25 §
09:15	<cwilliams@cumin1003>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db2236.codfw.wmnet with reason: host reimage	[production]
09:13	<elukey@cumin1003>	START - Cookbook sre.hosts.reimage for host kafka-logging2008.codfw.wmnet with OS trixie	[production]
09:12	<elukey@cumin1003>	START - Cookbook sre.hosts.reimage for host kafka-logging2007.codfw.wmnet with OS trixie	[production]
09:11	<cwilliams@cumin1003>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1221.eqiad.wmnet with reason: host reimage	[production]
09:11	<cwilliams@cumin1003>	START - Cookbook sre.hosts.downtime for 2:00:00 on db2236.codfw.wmnet with reason: host reimage	[production]
09:08	<elukey@cumin1003>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on kafka-logging2006.codfw.wmnet with reason: host reimage	[production]
09:06	<marostegui@cumin1003>	conftool action : set/weight=100; selector: name=clouddb1026.eqiad.wmnet	[production]
09:06	<cwilliams@cumin1003>	START - Cookbook sre.hosts.downtime for 2:00:00 on db1221.eqiad.wmnet with reason: host reimage	[production]
09:05	<jforrester@deploy1003>	Finished scap sync-world: Backport for [[gerrit:1305599\|On AW article deletion, clear all AWArticleStore from sections and metadata (T429873)]], [[gerrit:1305600\|AWStorage: Use global stash keys (T430060)]] (duration: 07m 29s)	[production]
09:05	<elukey@cumin1003>	START - Cookbook sre.hosts.downtime for 2:00:00 on kafka-logging2006.codfw.wmnet with reason: host reimage	[production]
09:00	<jforrester@deploy1003>	jforrester: Continuing with deployment	[production]
09:00	<jforrester@deploy1003>	jforrester: Backport for [[gerrit:1305599\|On AW article deletion, clear all AWArticleStore from sections and metadata (T429873)]], [[gerrit:1305600\|AWStorage: Use global stash keys (T430060)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there.	[production]
08:58	<brouberol@deploy1003>	helmfile [dse-k8s-codfw] DONE helmfile.d/admin 'apply'.	[production]
08:58	<jforrester@deploy1003>	Started scap sync-world: Backport for [[gerrit:1305599\|On AW article deletion, clear all AWArticleStore from sections and metadata (T429873)]], [[gerrit:1305600\|AWStorage: Use global stash keys (T430060)]]	[production]
08:57	<brouberol@deploy1003>	helmfile [dse-k8s-codfw] START helmfile.d/admin 'apply'.	[production]
08:57	<brouberol@deploy1003>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
08:56	<brouberol@deploy1003>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
08:55	<marostegui@cumin1003>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host db2234.codfw.wmnet with OS trixie	[production]
08:54	<cwilliams@cumin1003>	START - Cookbook sre.hosts.reimage for host db2236.codfw.wmnet with OS trixie	[production]
08:53	<cwilliams@cumin1003>	END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db2236: Upgrading db2236.codfw.wmnet	[production]
08:52	<cwilliams@cumin1003>	START - Cookbook sre.mysql.depool depool db2236: Upgrading db2236.codfw.wmnet	[production]
08:52	<cwilliams@cumin1003>	dbmaint on s4@codfw T429893	[production]
08:52	<cwilliams@cumin1003>	START - Cookbook sre.mysql.major-upgrade	[production]
08:50	<cwilliams@cumin1003>	START - Cookbook sre.hosts.reimage for host db1221.eqiad.wmnet with OS trixie	[production]
08:48	<cwilliams@cumin1003>	END (PASS) - Cookbook sre.mysql.depool (exit_code=0) depool db1221: Upgrading db1221.eqiad.wmnet	[production]
08:48	<cwilliams@cumin1003>	START - Cookbook sre.mysql.depool depool db1221: Upgrading db1221.eqiad.wmnet	[production]
08:47	<cwilliams@cumin1003>	dbmaint on s4@eqiad T429893	[production]
08:47	<cwilliams@cumin1003>	START - Cookbook sre.mysql.major-upgrade	[production]
08:47	<elukey@cumin1003>	START - Cookbook sre.hosts.reimage for host kafka-logging2006.codfw.wmnet with OS trixie	[production]
08:45	<marostegui@cumin1003>	conftool action : set/weight=30; selector: name=clouddb1026.eqiad.wmnet	[production]
08:44	<cwilliams@cumin1003>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on an-redacteddb1001.eqiad.wmnet,clouddb[1015,1024-1025].eqiad.wmnet,db1155.eqiad.wmnet with reason: Reimaging db1221	[production]
08:10	<jnuche@deploy1003>	Finished deploy [releng/jenkins-deploy@ec879e3] (releasing): T430110 deploy to Jenkins primary (duration: 00m 52s)	[production]
08:10	<jnuche@deploy1003>	Started deploy [releng/jenkins-deploy@ec879e3] (releasing): T430110 deploy to Jenkins primary	[production]
08:07	<jnuche@deploy1003>	Finished deploy [releng/jenkins-deploy@ec879e3] (releasing): T430110 retry Jenkins secondary (duration: 00m 53s)	[production]
08:07	<jnuche@deploy1003>	Started deploy [releng/jenkins-deploy@ec879e3] (releasing): T430110 retry Jenkins secondary	[production]
08:03	<marostegui>	Pool clouddb1026:s1 with a bit of weight T409557	[production]
08:03	<marostegui@cumin1003>	conftool action : set/pooled=yes; selector: name=clouddb1026.eqiad.wmnet,service=s1	[production]
08:02	<marostegui@cumin1003>	conftool action : set/weight=10; selector: name=clouddb1026.eqiad.wmnet	[production]
07:52	<filippo@cumin1003>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
07:52	<filippo@cumin1003>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Allocate IPs for cloudvirt1077 - filippo@cumin1003"	[production]
07:52	<filippo@cumin1003>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Allocate IPs for cloudvirt1077 - filippo@cumin1003"	[production]
07:51	<arnaudb@cumin1003>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on releases2003.codfw.wmnet with reason: T410849	[production]
07:47	<filippo@cumin1003>	START - Cookbook sre.dns.netbox	[production]
07:41	<marostegui@cumin1003>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db2160.codfw.wmnet with reason: Upgrading	[production]
07:35	<marostegui@cumin1003>	START - Cookbook sre.hosts.reimage for host db2234.codfw.wmnet with OS trixie	[production]
07:35	<marostegui@cumin1003>	END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host db2234.codfw.wmnet with OS trixie	[production]
07:29	<jnuche@deploy1003>	Finished deploy [releng/jenkins-deploy@86ab691] (releasing): T430110 Test on Jenkins secondary (duration: 00m 50s)	[production]
07:29	<jnuche@deploy1003>	Started deploy [releng/jenkins-deploy@86ab691] (releasing): T430110 Test on Jenkins secondary	[production]
07:24	<moritzm>	installing nginx security updates	[production]
07:20	<dcausse>	T423993: dropping ttmserver indices from the cirrussearch opensearch clusters	[production]