production SAL

6401-6450 of 10000 results (51ms)

2022-06-07 §
14:00	<jbond@cumin1001>	END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) netbox-exports.wikimedia.org on all recursors	[production]
14:00	<jbond@cumin1001>	START - Cookbook sre.dns.wipe-cache netbox-exports.wikimedia.org on all recursors	[production]
13:58	<btullis@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/eventgate-main: apply	[production]
13:57	<btullis@deploy1002>	helmfile [eqiad] START helmfile.d/services/eventgate-main: apply	[production]
13:55	<btullis@deploy1002>	helmfile [codfw] DONE helmfile.d/services/eventgate-main: apply	[production]
13:54	<btullis@deploy1002>	helmfile [codfw] START helmfile.d/services/eventgate-main: apply	[production]
13:54	<btullis@deploy1002>	helmfile [staging] DONE helmfile.d/services/eventgate-main: apply	[production]
13:53	<btullis@deploy1002>	helmfile [staging] START helmfile.d/services/eventgate-main: apply	[production]
13:53	<jbond@cumin1001>	END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) netbox-exports.discovery.wmnet on all recursors	[production]
13:53	<jbond@cumin1001>	START - Cookbook sre.dns.wipe-cache netbox-exports.discovery.wmnet on all recursors	[production]
13:53	<btullis@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/eventgate-logging-external: apply	[production]
13:52	<btullis@deploy1002>	helmfile [eqiad] START helmfile.d/services/eventgate-logging-external: apply	[production]
13:51	<btullis@deploy1002>	helmfile [codfw] DONE helmfile.d/services/eventgate-logging-external: apply	[production]
13:51	<btullis@deploy1002>	helmfile [codfw] START helmfile.d/services/eventgate-logging-external: apply	[production]
13:50	<btullis@deploy1002>	helmfile [staging] DONE helmfile.d/services/eventgate-logging-external: apply	[production]
13:50	<btullis@deploy1002>	helmfile [staging] START helmfile.d/services/eventgate-logging-external: apply	[production]
13:48	<btullis@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/eventgate-analytics: apply	[production]
13:47	<btullis@deploy1002>	helmfile [eqiad] START helmfile.d/services/eventgate-analytics: apply	[production]
13:47	<btullis@deploy1002>	helmfile [codfw] DONE helmfile.d/services/eventgate-analytics: apply	[production]
13:46	<btullis@deploy1002>	helmfile [codfw] START helmfile.d/services/eventgate-analytics: apply	[production]
13:45	<btullis@deploy1002>	helmfile [staging] DONE helmfile.d/services/eventgate-analytics: apply	[production]
13:45	<btullis@deploy1002>	helmfile [staging] START helmfile.d/services/eventgate-analytics: apply	[production]
13:33	<jbond@cumin1001>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host sretest1001.eqiad.wmnet with OS buster	[production]
13:21	<ayounsi@cumin1001>	END (FAIL) - Cookbook sre.dns.netbox (exit_code=99)	[production]
13:21	<ayounsi@cumin1001>	START - Cookbook sre.dns.netbox	[production]
13:20	<ayounsi@cumin1001>	END (FAIL) - Cookbook sre.dns.netbox (exit_code=99)	[production]
13:20	<ayounsi@cumin1001>	START - Cookbook sre.dns.netbox	[production]
13:20	<ayounsi@cumin1001>	END (FAIL) - Cookbook sre.dns.netbox (exit_code=99)	[production]
13:20	<ayounsi@cumin1001>	START - Cookbook sre.dns.netbox	[production]
13:20	<jbond@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on sretest1001.eqiad.wmnet with reason: host reimage	[production]
13:16	<jbond@cumin1001>	START - Cookbook sre.hosts.downtime for 2:00:00 on sretest1001.eqiad.wmnet with reason: host reimage	[production]
13:05	<jbond@cumin1001>	START - Cookbook sre.hosts.reimage for host sretest1001.eqiad.wmnet with OS buster	[production]
12:49	<moritzm>	installing python-virtualenv updates from Buster point release	[production]
12:12	<btullis@cumin1001>	END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-analytics cluster: Roll restart of jvm daemons.	[production]
12:06	<btullis@cumin1001>	START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-analytics cluster: Roll restart of jvm daemons.	[production]
12:02	<btullis@cumin1001>	END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-druid-analytics cluster: Roll restart of jvm daemons.	[production]
12:02	<marostegui@cumin1001>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1140.eqiad.wmnet with reason: Maintenance	[production]
12:02	<marostegui@cumin1001>	START - Cookbook sre.hosts.downtime for 12:00:00 on db1140.eqiad.wmnet with reason: Maintenance	[production]
12:02	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1131 (T310011)', diff saved to https://phabricator.wikimedia.org/P29485 and previous config saved to /var/cache/conftool/dbconfig/20220607-120212-marostegui.json	[production]
11:58	<hnowlan@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
11:56	<btullis@cumin1001>	START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-druid-analytics cluster: Roll restart of jvm daemons.	[production]
11:54	<hnowlan@cumin1001>	START - Cookbook sre.dns.netbox	[production]
11:54	<btullis@cumin1001>	END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-druid-public cluster: Roll restart of jvm daemons.	[production]
11:48	<btullis@cumin1001>	START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-druid-public cluster: Roll restart of jvm daemons.	[production]
11:47	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1131', diff saved to https://phabricator.wikimedia.org/P29484 and previous config saved to /var/cache/conftool/dbconfig/20220607-114707-marostegui.json	[production]
11:42	<btullis@cumin1001>	END (PASS) - Cookbook sre.druid.roll-restart-workers (exit_code=0) for Druid public cluster: Roll restart of Druid jvm daemons.	[production]
11:35	<btullis@cumin1001>	START - Cookbook sre.kafka.roll-restart-brokers for Kafka A:kafka-jumbo-eqiad cluster: Roll restart of jvm daemons.	[production]
11:33	<hnowlan@cumin1001>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
11:32	<marostegui@cumin1001>	dbctl commit (dc=all): 'Repooling after maintenance db1131', diff saved to https://phabricator.wikimedia.org/P29483 and previous config saved to /var/cache/conftool/dbconfig/20220607-113202-marostegui.json	[production]
11:31	<btullis@deploy1002>	helmfile [eqiad] DONE helmfile.d/services/eventgate-analytics-external: apply	[production]