production SAL

3851-3900 of 10000 results (83ms)

2023-01-18 §
22:03	<bking@cumin1001>	START - Cookbook sre.elasticsearch.rolling-operation Operation.RESTART (1 nodes at a time) for ElasticSearch cluster cloudelastic: raise heap memory to 12G - bking@cumin1001 - T323646	[production]
21:50	<kindrobot>	close UTC late backport window	[production]
21:50	<kindrobot@deploy1002>	Finished scap: Backport for [[gerrit:881462\|[config]: Undeploy GDI Safety Survey Wave 4 (T327296)]] (duration: 10m 45s)	[production]
21:41	<kindrobot@deploy1002>	essexigyan and kindrobot: Backport for [[gerrit:881462\|[config]: Undeploy GDI Safety Survey Wave 4 (T327296)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet	[production]
21:39	<kindrobot@deploy1002>	Started scap: Backport for [[gerrit:881462\|[config]: Undeploy GDI Safety Survey Wave 4 (T327296)]]	[production]
21:36	<kindrobot@deploy1002>	Finished scap: Backport for [[gerrit:881451\|Bump English Wikipedia event logging from 0.5 to 1% (T326892)]], [[gerrit:881431\|Legacy Vector is not a responsive skin (T327256)]] (duration: 13m 01s)	[production]
21:25	<kindrobot@deploy1002>	kindrobot and jdlrobson: Backport for [[gerrit:881451\|Bump English Wikipedia event logging from 0.5 to 1% (T326892)]], [[gerrit:881431\|Legacy Vector is not a responsive skin (T327256)]] synced to the testservers: mwdebug2002.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet	[production]
21:23	<kindrobot@deploy1002>	Started scap: Backport for [[gerrit:881451\|Bump English Wikipedia event logging from 0.5 to 1% (T326892)]], [[gerrit:881431\|Legacy Vector is not a responsive skin (T327256)]]	[production]
21:08	<cwhite@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host logstash1037.eqiad.wmnet with OS bullseye	[production]
21:05	<cwhite@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host logstash1036.eqiad.wmnet with OS bullseye	[production]
21:03	<kindrobot>	start UTC late backport window	[production]
20:54	<cwhite@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logstash1037.eqiad.wmnet with reason: host reimage	[production]
20:51	<cwhite@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logstash1036.eqiad.wmnet with reason: host reimage	[production]
20:49	<cwhite@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on logstash1037.eqiad.wmnet with reason: host reimage	[production]
20:48	<cwhite@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on logstash1036.eqiad.wmnet with reason: host reimage	[production]
20:36	<cwhite@cumin2002>	START - Cookbook sre.hosts.reimage for host logstash1037.eqiad.wmnet with OS bullseye	[production]
20:35	<cwhite@cumin2002>	START - Cookbook sre.hosts.reimage for host logstash1036.eqiad.wmnet with OS bullseye	[production]
20:34	<aokoth@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 14 days, 0:00:00 on vrts2001.codfw.wmnet with reason: installation failed due to read-only database	[production]
20:34	<aokoth@cumin1001>	START - Cookbook sre.hosts.downtime for 14 days, 0:00:00 on vrts2001.codfw.wmnet with reason: installation failed due to read-only database	[production]
19:54	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host logstash1037.eqiad.wmnet with OS buster	[production]
19:54	<pt1979@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002"	[production]
19:52	<bblack>	db1129 and lvs1017: removed misconfigured IP address in wrong vlan from eno1 and /e/n/i	[production]
19:51	<pt1979@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002"	[production]
19:47	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host logstash1036.eqiad.wmnet with OS buster	[production]
19:47	<pt1979@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002"	[production]
19:40	<pt1979@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - pt1979@cumin2002"	[production]
19:35	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logstash1037.eqiad.wmnet with reason: host reimage	[production]
19:32	<pt1979@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on logstash1037.eqiad.wmnet with reason: host reimage	[production]
19:26	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on logstash1036.eqiad.wmnet with reason: host reimage	[production]
19:23	<pt1979@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on logstash1036.eqiad.wmnet with reason: host reimage	[production]
19:19	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host logstash1037.eqiad.wmnet with OS buster	[production]
18:59	<pt1979@cumin2002>	START - Cookbook sre.hosts.reimage for host logstash1036.eqiad.wmnet with OS buster	[production]
18:21	<lucaswerkmeister-wmde@deploy1002>	Finished scap: Backport for [[gerrit:878927\|Enable the REST API on test-wikidata (T324999)]] (duration: 09m 38s)	[production]
18:14	<lucaswerkmeister-wmde@deploy1002>	migr and lucaswerkmeister-wmde: Backport for [[gerrit:878927\|Enable the REST API on test-wikidata (T324999)]] synced to the testservers: mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet	[production]
18:12	<lucaswerkmeister-wmde@deploy1002>	Started scap: Backport for [[gerrit:878927\|Enable the REST API on test-wikidata (T324999)]]	[production]
17:55	<otto@deploy1002>	helmfile [dse-k8s-eqiad] DONE helmfile.d/services/flink-app-example: apply	[production]
17:55	<otto@deploy1002>	helmfile [dse-k8s-eqiad] START helmfile.d/services/flink-app-example: apply	[production]
17:44	<jnuche@deploy1002>	Installation of scap version "4.33.0" completed for 560 hosts	[production]
17:44	<jnuche@deploy1002>	Installing scap version "4.33.0" for 560 hosts	[production]
17:42	<jnuche@deploy1002>	install-world aborted: (duration: 07m 17s)	[production]
17:42	<btullis@deploy1002>	Installation of scap version "4.33.0" completed for 1 hosts	[production]
17:41	<btullis@deploy1002>	Installing scap version "4.33.0" for 1 hosts	[production]
17:35	<jnuche@deploy1002>	Installing scap version "4.33.0" for 561 hosts	[production]
17:19	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=False) upgrade firmware for hosts ['logstash1037']	[production]
17:10	<pt1979@cumin2002>	START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['logstash1037']	[production]
17:10	<pt1979@cumin2002>	END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['logstash1037']	[production]
17:09	<pt1979@cumin2002>	START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['logstash1037']	[production]
17:05	<pt1979@cumin2002>	END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=False) upgrade firmware for hosts ['logstash1036']	[production]
16:57	<pt1979@cumin2002>	START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['logstash1036']	[production]
16:45	<jnuche@deploy1002>	Installation of scap version "4.33.0" completed for 1 hosts	[production]