production SAL

551-600 of 10000 results (55ms)

2023-08-14 §
16:02	<marostegui@cumin1001>	dbctl commit (dc=all): 'es2025 (re)pooling @ 25%: Repooling after onsite maintenance', diff saved to https://phabricator.wikimedia.org/P50577 and previous config saved to /var/cache/conftool/dbconfig/20230814-160213-root.json	[production]
16:01	<sukhe@cumin2002>	END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) cr2-esams.wikimedia.org on all recursors	[production]
16:00	<sukhe@cumin2002>	START - Cookbook sre.dns.wipe-cache cr2-esams.wikimedia.org on all recursors	[production]
15:58	<btullis@cumin1001>	START - Cookbook sre.hosts.reimage for host an-worker1094.eqiad.wmnet with OS bullseye	[production]
15:55	<btullis@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host an-worker1093.eqiad.wmnet with OS bullseye	[production]
15:53	<cmooney@cumin1001>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Rename cr3-knams to cr2-esams - cmooney@cumin1001"	[production]
15:50	<cmooney@cumin1001>	START - Cookbook sre.dns.netbox	[production]
15:47	<marostegui@cumin1001>	dbctl commit (dc=all): 'es2025 (re)pooling @ 10%: Repooling after onsite maintenance', diff saved to https://phabricator.wikimedia.org/P50576 and previous config saved to /var/cache/conftool/dbconfig/20230814-154708-root.json	[production]
15:47	<bking@cumin1001>	START - Cookbook sre.wdqs.data-transfer	[production]
15:45	<bking@deploy1002>	Finished deploy [wdqs/wdqs@f1a6177]: deploying WDQS on newly-reimaged Bullseye hosts T343124 (duration: 00m 15s)	[production]
15:45	<bking@deploy1002>	Started deploy [wdqs/wdqs@f1a6177]: deploying WDQS on newly-reimaged Bullseye hosts T343124	[production]
15:38	<bking@cumin1001>	START - Cookbook sre.wdqs.data-transfer	[production]
15:36	<urandom>	upgrading Cassandra to 4.1.1, restbase1016-{a,b,c} — T339298	[production]
15:32	<btullis@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1093.eqiad.wmnet with reason: host reimage	[production]
15:32	<marostegui@cumin1001>	dbctl commit (dc=all): 'es2025 (re)pooling @ 5%: Repooling after onsite maintenance', diff saved to https://phabricator.wikimedia.org/P50575 and previous config saved to /var/cache/conftool/dbconfig/20230814-153203-root.json	[production]
15:30	<bking@deploy1002>	Finished deploy [wdqs/wdqs@f1a6177]: deploying WDQS on newly-reimaged Bullseye hosts T343124 (duration: 00m 43s)	[production]
15:29	<bking@deploy1002>	Started deploy [wdqs/wdqs@f1a6177]: deploying WDQS on newly-reimaged Bullseye hosts T343124	[production]
15:29	<btullis@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1093.eqiad.wmnet with reason: host reimage	[production]
15:16	<marostegui@cumin1001>	dbctl commit (dc=all): 'es2025 (re)pooling @ 3%: Repooling after onsite maintenance', diff saved to https://phabricator.wikimedia.org/P50574 and previous config saved to /var/cache/conftool/dbconfig/20230814-151659-root.json	[production]
15:16	<btullis@cumin1001>	START - Cookbook sre.hosts.reimage for host an-worker1093.eqiad.wmnet with OS bullseye	[production]
15:01	<marostegui@cumin1001>	dbctl commit (dc=all): 'es2025 (re)pooling @ 1%: Repooling after onsite maintenance', diff saved to https://phabricator.wikimedia.org/P50572 and previous config saved to /var/cache/conftool/dbconfig/20230814-150154-root.json	[production]
14:57	<bking@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wdqs2012.codfw.wmnet with OS bullseye	[production]
14:47	<sukhe@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host durum1002.eqiad.wmnet with OS bookworm	[production]
14:42	<bking@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host wdqs1016.eqiad.wmnet with OS bullseye	[production]
14:34	<jgiannelos@deploy1002>	Finished deploy [kartotherian/deploy@ee544cb] (eqiad): (no justification provided) (duration: 00m 00s)	[production]
14:34	<jgiannelos@deploy1002>	Started deploy [kartotherian/deploy@ee544cb] (eqiad): (no justification provided)	[production]
14:33	<jgiannelos@deploy1002>	Finished deploy [kartotherian/deploy@ee544cb] (eqiad): (no justification provided) (duration: 00m 03s)	[production]
14:33	<jgiannelos@deploy1002>	Started deploy [kartotherian/deploy@ee544cb] (eqiad): (no justification provided)	[production]
14:31	<sukhe@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on durum1002.eqiad.wmnet with reason: host reimage	[production]
14:30	<jgiannelos@deploy1002>	Finished deploy [kartotherian/deploy@ee544cb] (eqiad): (no justification provided) (duration: 00m 00s)	[production]
14:30	<jgiannelos@deploy1002>	Started deploy [kartotherian/deploy@ee544cb] (eqiad): (no justification provided)	[production]
14:27	<jgiannelos@deploy1002>	Finished deploy [kartotherian/deploy@ee544cb]: (no justification provided) (duration: 00m 01s)	[production]
14:27	<jgiannelos@deploy1002>	Started deploy [kartotherian/deploy@ee544cb]: (no justification provided)	[production]
14:26	<sukhe@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on durum1002.eqiad.wmnet with reason: host reimage	[production]
14:26	<sukhe>	running authdns-update for CR 948195: T344073	[production]
14:26	<sukhe>	running authdns-update for CR 948195	[production]
14:25	<jgiannelos@deploy1002>	deploy aborted: (no justification provided) (duration: 00m 10s)	[production]
14:25	<jgiannelos@deploy1002>	Started deploy [kartotherian/deploy@ee544cb]: (no justification provided)	[production]
14:19	<bking@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs1016.eqiad.wmnet with reason: host reimage	[production]
14:16	<bking@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs1016.eqiad.wmnet with reason: host reimage	[production]
14:13	<sukhe@cumin2002>	START - Cookbook sre.hosts.reimage for host durum1002.eqiad.wmnet with OS bookworm	[production]
14:04	<urandom>	upgrading Cassandra to 4.1.1, restbase2013-{a,b,c} — T339298	[production]
14:04	<bking@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on wdqs2012.codfw.wmnet with reason: host reimage	[production]
14:01	<bking@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on wdqs2012.codfw.wmnet with reason: host reimage	[production]
13:53	<bking@cumin1001>	START - Cookbook sre.hosts.reimage for host wdqs1016.eqiad.wmnet with OS bullseye	[production]
13:40	<bking@cumin1001>	START - Cookbook sre.hosts.reimage for host wdqs2012.codfw.wmnet with OS bullseye	[production]
13:27	<derick@deploy1002>	Finished scap: Backport for [[gerrit:930798\|wmf-config: Remove wgContentTranslationDefaultParsoidClient cleanup]] (duration: 16m 56s)	[production]
13:20	<derick@deploy1002>	d3r1ck01 and derick: Continuing with sync	[production]
13:19	<derick@deploy1002>	d3r1ck01 and derick: Backport for [[gerrit:930798\|wmf-config: Remove wgContentTranslationDefaultParsoidClient cleanup]] synced to the testservers mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug2002.codfw.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option)	[production]
13:10	<derick@deploy1002>	Started scap: Backport for [[gerrit:930798\|wmf-config: Remove wgContentTranslationDefaultParsoidClient cleanup]]	[production]