production SAL

8301-8350 of 10000 results (89ms)

2023-04-12 §
19:54	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 16:00:00 on db2187.codfw.wmnet with reason: Maintenance	[production]
19:54	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2159.codfw.wmnet with reason: Maintenance	[production]
19:54	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 8:00:00 on db2159.codfw.wmnet with reason: Maintenance	[production]
19:54	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2150 (T333332)', diff saved to https://phabricator.wikimedia.org/P46604 and previous config saved to /var/cache/conftool/dbconfig/20230412-195423-ladsgroup.json	[production]
19:51	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirtlocal1002.eqiad.wmnet with OS bullseye	[production]
19:43	<zabe@deploy2002>	Finished scap: Backport for [[gerrit:908292\|Revert "Ensure ApiHelp correctly types values in TOCData objects"]], [[gerrit:908293\|Revert "Ensure ApiHelp correctly types values in TOCData objects"]] (duration: 06m 40s)	[production]
19:41	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirtlocal1001.eqiad.wmnet with OS bullseye	[production]
19:41	<otto@deploy2002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/admin 'apply'.	[production]
19:40	<otto@deploy2002>	helmfile [dse-k8s-eqiad] START helmfile.d/admin 'apply'.	[production]
19:39	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2150', diff saved to https://phabricator.wikimedia.org/P46603 and previous config saved to /var/cache/conftool/dbconfig/20230412-193917-ladsgroup.json	[production]
19:38	<zabe@deploy2002>	zabe: Backport for [[gerrit:908292\|Revert "Ensure ApiHelp correctly types values in TOCData objects"]], [[gerrit:908293\|Revert "Ensure ApiHelp correctly types values in TOCData objects"]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet	[production]
19:37	<andrew@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudvirtlocal1001.eqiad.wmnet with OS bullseye	[production]
19:37	<zabe@deploy2002>	Started scap: Backport for [[gerrit:908292\|Revert "Ensure ApiHelp correctly types values in TOCData objects"]], [[gerrit:908293\|Revert "Ensure ApiHelp correctly types values in TOCData objects"]]	[production]
19:37	<urandom>	sessionstore1001: systemctl stop cassandra-a.service && systemctl start cassandra-a.service — T327954	[production]
19:36	<andrew@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudvirtlocal1002.eqiad.wmnet with OS bullseye	[production]
19:35	<zabe@deploy2002>	Sync cancelled.	[production]
19:32	<zabe@deploy2002>	jforrester and zabe: Backport for [[gerrit:908291\|composer.json: Explicitly pin psr/http-message to 1.0.1 (T333993)]], [[gerrit:908290\|Ensure ApiHelp correctly types values in TOCData objects (T334551)]], [[gerrit:908289\|Ensure ApiHelp correctly types values in TOCData objects (T334551)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1002.	[production]
19:30	<zabe@deploy2002>	Started scap: Backport for [[gerrit:908291\|composer.json: Explicitly pin psr/http-message to 1.0.1 (T333993)]], [[gerrit:908290\|Ensure ApiHelp correctly types values in TOCData objects (T334551)]], [[gerrit:908289\|Ensure ApiHelp correctly types values in TOCData objects (T334551)]]	[production]
19:28	<urandom>	restart Cassandra —sessionstore1001— to disable native transport for testing — T327954	[production]
19:24	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2150', diff saved to https://phabricator.wikimedia.org/P46602 and previous config saved to /var/cache/conftool/dbconfig/20230412-192411-ladsgroup.json	[production]
19:17	<eevans@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 6:00:00 on sessionstore1001.eqiad.wmnet with reason: Reproducing dissonant cluster state	[production]
19:16	<eevans@cumin1001>	START - Cookbook sre.hosts.downtime for 6:00:00 on sessionstore1001.eqiad.wmnet with reason: Reproducing dissonant cluster state	[production]
19:09	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2150 (T333332)', diff saved to https://phabricator.wikimedia.org/P46601 and previous config saved to /var/cache/conftool/dbconfig/20230412-190904-ladsgroup.json	[production]
18:42	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirtlocal1001.eqiad.wmnet with OS bullseye	[production]
18:42	<andrew@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudvirtlocal1001.eqiad.wmnet with OS bullseye	[production]
18:41	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirtlocal1002.eqiad.wmnet with OS bullseye	[production]
18:39	<andrew@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=93) for host cloudvirtlocal1002.eqiad.wmnet with OS bullseye	[production]
18:38	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db2150 (T333332)', diff saved to https://phabricator.wikimedia.org/P46600 and previous config saved to /var/cache/conftool/dbconfig/20230412-183822-ladsgroup.json	[production]
18:38	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2150.codfw.wmnet with reason: Maintenance	[production]
18:38	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 8:00:00 on db2150.codfw.wmnet with reason: Maintenance	[production]
18:37	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2122 (T333332)', diff saved to https://phabricator.wikimedia.org/P46599 and previous config saved to /var/cache/conftool/dbconfig/20230412-183758-ladsgroup.json	[production]
18:22	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2122', diff saved to https://phabricator.wikimedia.org/P46598 and previous config saved to /var/cache/conftool/dbconfig/20230412-182252-ladsgroup.json	[production]
18:16	<dancy@deploy2002>	Synchronized php: group1 wikis to 1.41.0-wmf.4 refs T330210 (duration: 06m 02s)	[production]
18:10	<dancy@deploy2002>	rebuilt and synchronized wikiversions files: group1 wikis to 1.41.0-wmf.4 refs T330210	[production]
18:07	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2122', diff saved to https://phabricator.wikimedia.org/P46597 and previous config saved to /var/cache/conftool/dbconfig/20230412-180746-ladsgroup.json	[production]
17:52	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2122 (T333332)', diff saved to https://phabricator.wikimedia.org/P46596 and previous config saved to /var/cache/conftool/dbconfig/20230412-175240-ladsgroup.json	[production]
17:48	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db2122 (T333332)', diff saved to https://phabricator.wikimedia.org/P46595 and previous config saved to /var/cache/conftool/dbconfig/20230412-174806-ladsgroup.json	[production]
17:48	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2122.codfw.wmnet with reason: Maintenance	[production]
17:47	<ladsgroup@cumin1001>	START - Cookbook sre.hosts.downtime for 8:00:00 on db2122.codfw.wmnet with reason: Maintenance	[production]
17:47	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2121 (T333332)', diff saved to https://phabricator.wikimedia.org/P46594 and previous config saved to /var/cache/conftool/dbconfig/20230412-174743-ladsgroup.json	[production]
17:47	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirtlocal1001.eqiad.wmnet with OS bullseye	[production]
17:46	<andrew@cumin1001>	END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host cloudvirtlocal1001.eqiad.wmnet with OS bullseye	[production]
17:44	<andrew@cumin1001>	START - Cookbook sre.hosts.reimage for host cloudvirtlocal1002.eqiad.wmnet with OS bullseye	[production]
17:32	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2121', diff saved to https://phabricator.wikimedia.org/P46593 and previous config saved to /var/cache/conftool/dbconfig/20230412-173237-ladsgroup.json	[production]
17:17	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2121', diff saved to https://phabricator.wikimedia.org/P46592 and previous config saved to /var/cache/conftool/dbconfig/20230412-171730-ladsgroup.json	[production]
17:12	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2179 (T333332)', diff saved to https://phabricator.wikimedia.org/P46591 and previous config saved to /var/cache/conftool/dbconfig/20230412-171219-ladsgroup.json	[production]
17:06	<ejegg>	payments-wiki upgraded from efe7e408 to 4dcba0a9	[production]
17:02	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db2121 (T333332)', diff saved to https://phabricator.wikimedia.org/P46590 and previous config saved to /var/cache/conftool/dbconfig/20230412-170224-ladsgroup.json	[production]
16:59	<ladsgroup@cumin1001>	dbctl commit (dc=all): 'Depooling db2121 (T333332)', diff saved to https://phabricator.wikimedia.org/P46589 and previous config saved to /var/cache/conftool/dbconfig/20230412-165951-ladsgroup.json	[production]
16:59	<ladsgroup@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 8:00:00 on db2121.codfw.wmnet with reason: Maintenance	[production]