production SAL

7001-7050 of 10000 results (112ms)

2024-02-14 §
14:44	<hnowlan@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw2335.codfw.wmnet with reason: host reimage	[production]
14:44	<claime>	Restarted rsyslog on A:wikikube-master	[production]
14:44	<hnowlan@cumin2002>	START - Cookbook sre.hosts.reimage for host mw2380.codfw.wmnet with OS bullseye	[production]
14:43	<hnowlan@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host mw2380.codfw.wmnet with OS bullseye	[production]
14:43	<hnowlan@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host mw2379.codfw.wmnet with OS bullseye	[production]
14:42	<hnowlan@cumin2002>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host mw2383.codfw.wmnet with OS bullseye	[production]
14:42	<hnowlan@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on mw2311.codfw.wmnet with reason: host reimage	[production]
14:41	<hnowlan@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on mw2335.codfw.wmnet with reason: host reimage	[production]
14:40	<jmm@cumin2002>	START - Cookbook sre.puppet.migrate-host for host restbase1037.eqiad.wmnet	[production]
14:38	<jmm@cumin2002>	END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host restbase1035.eqiad.wmnet	[production]
14:35	<cgoubert@cumin2002>	conftool action : set/pooled=inactive; selector: name=(mw2402\|mw2403\|mw2404\|mw2405\|mw2407\|mw2408\|mw2409\|mw2401\|mw2410\|mw2411\|parse2001\|parse2002\|parse2003).*	[production]
14:34	<claime>	Depooling mw2402\|mw2403\|mw2404\|mw2405\|mw2407\|mw2408\|mw2409\|mw2401\|mw2410\|mw2411\|parse2001\|parse2002\|parse2003 for T355864	[production]
14:33	<TheresNoTime>	close UTC afternoon backport window	[production]
14:32	<jhancock@cumin2002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
14:31	<jmm@cumin2002>	START - Cookbook sre.puppet.migrate-host for host restbase1035.eqiad.wmnet	[production]
14:31	<ayounsi@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest2005.codfw.wmnet with reason: host reimage	[production]
14:31	<jhancock@cumin2002>	START - Cookbook sre.dns.netbox	[production]
14:31	<jmm@cumin2002>	END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host restbase1034.eqiad.wmnet	[production]
14:30	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1212', diff saved to https://phabricator.wikimedia.org/P56774 and previous config saved to /var/cache/conftool/dbconfig/20240214-143006-ladsgroup.json	[production]
14:29	<samtar@deploy2002>	Finished scap: Backport for [[gerrit:991352\|prod: Stop setting $wgCampaignEventsEnableParticipantQuestions (T347608)]] (duration: 23m 37s)	[production]
14:27	<ayounsi@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on sretest2005.codfw.wmnet with reason: host reimage	[production]
14:26	<hnowlan@cumin2002>	START - Cookbook sre.hosts.reimage for host mw2335.codfw.wmnet with OS bullseye	[production]
14:26	<hnowlan@cumin2002>	START - Cookbook sre.hosts.reimage for host mw2383.codfw.wmnet with OS bullseye	[production]
14:26	<hnowlan@cumin2002>	START - Cookbook sre.hosts.reimage for host mw2380.codfw.wmnet with OS bullseye	[production]
14:26	<hnowlan@cumin2002>	START - Cookbook sre.hosts.reimage for host mw2379.codfw.wmnet with OS bullseye	[production]
14:25	<hnowlan@cumin2002>	START - Cookbook sre.hosts.reimage for host mw2311.codfw.wmnet with OS bullseye	[production]
14:22	<samtar@deploy2002>	samtar and daimona: Continuing with sync	[production]
14:15	<claime>	Draining and cordoning kubernetes2019.codfw.wmnet kubernetes2018.codfw.wmnet mw2420.codfw.wmnet mw2421.codfw.wmnet mw2406.codfw.wmnet mw2422.codfw.wmnet mw2423.codfw.wmnet for T355864	[production]
14:15	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1212', diff saved to https://phabricator.wikimedia.org/P56773 and previous config saved to /var/cache/conftool/dbconfig/20240214-141459-ladsgroup.json	[production]
14:14	<ayounsi@cumin1002>	START - Cookbook sre.hosts.reimage for host sretest2005.codfw.wmnet with OS bookworm	[production]
14:10	<samtar@deploy2002>	samtar and daimona: Backport for [[gerrit:991352\|prod: Stop setting $wgCampaignEventsEnableParticipantQuestions (T347608)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug)	[production]
14:09	<jmm@cumin2002>	START - Cookbook sre.puppet.migrate-host for host restbase1034.eqiad.wmnet	[production]
14:06	<samtar@deploy2002>	Started scap: Backport for [[gerrit:991352\|prod: Stop setting $wgCampaignEventsEnableParticipantQuestions (T347608)]]	[production]
14:05	<jelto@deploy2002>	helmfile [staging] DONE helmfile.d/services/miscweb: apply	[production]
14:03	<jelto@deploy2002>	helmfile [staging] START helmfile.d/services/miscweb: apply	[production]
13:59	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1212 (T352010)', diff saved to https://phabricator.wikimedia.org/P56772 and previous config saved to /var/cache/conftool/dbconfig/20240214-135953-ladsgroup.json	[production]
13:59	<brouberol@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host eventlog1003.eqiad.wmnet with OS bullseye	[production]
13:58	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Depooling db1241 (T352010)', diff saved to https://phabricator.wikimedia.org/P56771 and previous config saved to /var/cache/conftool/dbconfig/20240214-135813-ladsgroup.json	[production]
13:58	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1241.eqiad.wmnet with reason: Maintenance	[production]
13:57	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1241.eqiad.wmnet with reason: Maintenance	[production]
13:57	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1221 (T352010)', diff saved to https://phabricator.wikimedia.org/P56770 and previous config saved to /var/cache/conftool/dbconfig/20240214-135750-ladsgroup.json	[production]
13:52	<jelto@deploy2002>	helmfile [staging] DONE helmfile.d/services/miscweb: apply	[production]
13:50	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Depooling db1212 (T352010)', diff saved to https://phabricator.wikimedia.org/P56769 and previous config saved to /var/cache/conftool/dbconfig/20240214-134959-ladsgroup.json	[production]
13:49	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on clouddb[1013,1017,1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance	[production]
13:49	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on clouddb[1013,1017,1021].eqiad.wmnet,db1154.eqiad.wmnet with reason: Maintenance	[production]
13:49	<ladsgroup@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1212.eqiad.wmnet with reason: Maintenance	[production]
13:49	<ladsgroup@cumin1002>	START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1212.eqiad.wmnet with reason: Maintenance	[production]
13:49	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1198 (T352010)', diff saved to https://phabricator.wikimedia.org/P56768 and previous config saved to /var/cache/conftool/dbconfig/20240214-134929-ladsgroup.json	[production]
13:42	<ladsgroup@cumin1002>	dbctl commit (dc=all): 'Repooling after maintenance db1221', diff saved to https://phabricator.wikimedia.org/P56767 and previous config saved to /var/cache/conftool/dbconfig/20240214-134244-ladsgroup.json	[production]
13:42	<jelto@deploy2002>	helmfile [staging] START helmfile.d/services/miscweb: apply	[production]