production SAL

1251-1300 of 10000 results (99ms)

2024-05-21 §
14:26	<jclark@cumin1002>	START - Cookbook sre.hosts.provision for host kafka-main1010.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
14:24	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1192', diff saved to https://phabricator.wikimedia.org/P62781 and previous config saved to /var/cache/conftool/dbconfig/20240521-142412-marostegui.json	[production]
14:19	<jclark@cumin1002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host kafka-main1009.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
14:16	<jiji@cumin1002>	START - Cookbook sre.hosts.reimage for host mc2055.codfw.wmnet with OS bookworm	[production]
14:09	<taavi@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cloudnet2006-dev.codfw.wmnet with reason: host reimage	[production]
14:09	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1192 (T364299)', diff saved to https://phabricator.wikimedia.org/P62780 and previous config saved to /var/cache/conftool/dbconfig/20240521-140904-marostegui.json	[production]
14:06	<taavi@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cloudnet2006-dev.codfw.wmnet with reason: host reimage	[production]
14:04	<jclark@cumin1002>	START - Cookbook sre.hosts.provision for host kafka-main1009.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
14:04	<jclark@cumin1002>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kafka-main1010.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
14:03	<jclark@cumin1002>	START - Cookbook sre.hosts.provision for host kafka-main1010.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
13:55	<jiji@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host mc-gp2003.codfw.wmnet with OS bookworm	[production]
13:47	<taavi@cumin1002>	START - Cookbook sre.hosts.reimage for host cloudnet2006-dev.codfw.wmnet with OS bookworm	[production]
13:39	<kormat@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host db1246.eqiad.wmnet with OS bookworm	[production]
13:38	<jiji@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mc-gp2003.codfw.wmnet with reason: host reimage	[production]
13:34	<jiji@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on mc-gp2003.codfw.wmnet with reason: host reimage	[production]
13:34	<zabe@deploy1002>	Finished scap: Backport for [[gerrit:1034433\|arwiki: Disable Extension:ContentTranslation for non-autoreview users (T255022)]] (duration: 23m 20s)	[production]
13:30	<jclark@cumin1002>	START - Cookbook sre.hosts.reimage for host kafka-main1009.eqiad.wmnet with OS bullseye	[production]
13:29	<jclark@cumin1002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host kafka-main1009.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
13:27	<tchin@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/datasets-config-next: apply	[production]
13:26	<tchin@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/datasets-config-next: apply	[production]
13:23	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on dbstore1008.eqiad.wmnet with reason: Maintenance	[production]
13:22	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on dbstore1008.eqiad.wmnet with reason: Maintenance	[production]
13:20	<zabe@deploy1002>	zabe and gergesshamon: Continuing with sync	[production]
13:18	<kormat@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on db1246.eqiad.wmnet with reason: host reimage	[production]
13:17	<jiji@cumin1002>	START - Cookbook sre.hosts.reimage for host mc-gp2003.codfw.wmnet with OS bookworm	[production]
13:15	<kormat@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on db1246.eqiad.wmnet with reason: host reimage	[production]
13:13	<jclark@cumin1002>	START - Cookbook sre.hosts.provision for host kafka-main1009.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
13:13	<zabe@deploy1002>	zabe and gergesshamon: Backport for [[gerrit:1034433\|arwiki: Disable Extension:ContentTranslation for non-autoreview users (T255022)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
13:13	<jclark@cumin1002>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host kafka-main1009.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
13:13	<marostegui>	Deploy schema change on s3 eqiad with replication dbmaint T365465	[production]
13:12	<vgutierrez>	re-enable puppet on acme-chief clients - T364589	[production]
13:11	<jclark@cumin1002>	START - Cookbook sre.hosts.provision for host kafka-main1009.mgmt.eqiad.wmnet with reboot policy FORCED	[production]
13:11	<jclark@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host kafka-main1006.eqiad.wmnet with OS bullseye	[production]
13:11	<jclark@cumin1002>	END (FAIL) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=99) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1002"	[production]
13:10	<zabe@deploy1002>	Started scap: Backport for [[gerrit:1034433\|arwiki: Disable Extension:ContentTranslation for non-autoreview users (T255022)]]	[production]
13:09	<marostegui>	Deploy schema change on s5 (azwikimedia wikifunctionswiki vewikimedia) eqiad with replication dbmaint T365465	[production]
13:08	<marostegui>	Deploy schema change on s7 (metawiki and frwiktionary ) eqiad with replication dbmaint T365465	[production]
13:08	<marostegui@cumin1002>	dbctl commit (dc=all): 'Depooling db1192 (T364299)', diff saved to https://phabricator.wikimedia.org/P62778 and previous config saved to /var/cache/conftool/dbconfig/20240521-130838-marostegui.json	[production]
13:08	<marostegui@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1192.eqiad.wmnet with reason: Maintenance	[production]
13:08	<vgutierrez>	upgrading to acme-chief 0.37 on acmechief instances - T364589	[production]
13:08	<marostegui@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1192.eqiad.wmnet with reason: Maintenance	[production]
13:08	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1178 (T364299)', diff saved to https://phabricator.wikimedia.org/P62777 and previous config saved to /var/cache/conftool/dbconfig/20240521-130814-marostegui.json	[production]
13:07	<marostegui>	Deploy schema change on s4 eqiad with replication dbmaint T365465	[production]
13:05	<marostegui>	Deploy schema change on s8 eqiad with replication dbmaint T365465	[production]
13:04	<vgutierrez>	disable puppet on acme-chief clients - T364589	[production]
13:01	<kormat@cumin1002>	START - Cookbook sre.hosts.reimage for host db1246.eqiad.wmnet with OS bookworm	[production]
12:59	<vgutierrez>	upgrading to acme-chief 0.37 on acmechief-test instances - T364589	[production]
12:55	<vgutierrez>	upload acme-chief 0.37 to apt.wm.org (bookworm-wikimedia) - T364589	[production]
12:53	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1178', diff saved to https://phabricator.wikimedia.org/P62775 and previous config saved to /var/cache/conftool/dbconfig/20240521-125306-marostegui.json	[production]
12:22	<marostegui@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1178 (T364299)', diff saved to https://phabricator.wikimedia.org/P62771 and previous config saved to /var/cache/conftool/dbconfig/20240521-122250-marostegui.json	[production]