__all__ SAL

1701-1750 of 10000 results (26ms)

2025-05-20 §
15:07	<andrew@cloudcumin1001>	START - Cookbook wmcs.openstack.restart_openstack on deployment eqiad1 for all services	[admin]
15:05	<btullis@deploy1003>	helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply	[production]
15:05	<btullis@deploy1003>	helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply	[production]
15:01	<wmbot~dcaro@acme>	END (FAIL) - Cookbook wmcs.toolforge.k8s.kyverno.copy_images_to_registry (exit_code=99)	[tools]
15:00	<wmbot~dcaro@acme>	Updating container image toolforge-kyverno-kyverno:v1.13.6	[tools]
15:00	<wmbot~dcaro@acme>	START - Cookbook wmcs.toolforge.k8s.kyverno.copy_images_to_registry	[tools]
14:59	<wmbot~dcaro@acme>	END (PASS) - Cookbook wmcs.toolforge.k8s.kyverno.copy_images_to_registry (exit_code=0)	[tools]
14:59	<wmbot~dcaro@acme>	START - Cookbook wmcs.toolforge.k8s.kyverno.copy_images_to_registry	[tools]
14:59	<btullis@deploy1003>	helmfile [dse-k8s-eqiad] DONE helmfile.d/services/mediawiki-dumps-legacy: apply	[production]
14:59	<wmbot~dcaro@acme>	END (PASS) - Cookbook wmcs.toolforge.k8s.kyverno.copy_images_to_registry (exit_code=0)	[tools]
14:59	<wmbot~dcaro@acme>	START - Cookbook wmcs.toolforge.k8s.kyverno.copy_images_to_registry	[tools]
14:58	<wmbot~dcaro@acme>	END (ERROR) - Cookbook wmcs.toolforge.k8s.kyverno.copy_images_to_registry (exit_code=97)	[tools]
14:58	<wmbot~dcaro@acme>	Updating container image toolforge-kyverno-kyverno:v1.13.6	[tools]
14:58	<wmbot~dcaro@acme>	START - Cookbook wmcs.toolforge.k8s.kyverno.copy_images_to_registry	[tools]
14:58	<brouberol@cumin2002>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 21 days, 0:00:00 on kafka-jumbo[1016-1018].eqiad.wmnet with reason: Parted config is broken causing the hosts to have no data disk	[production]
14:57	<andrewbogott>	resetting eqiad1 rabbitmq in an attempt to resolve T394790	[admin]
14:56	<btullis@deploy1003>	helmfile [dse-k8s-eqiad] START helmfile.d/services/mediawiki-dumps-legacy: apply	[production]
14:54	<bking@cumin2002>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 7 days, 0:00:00 on 65 hosts with reason: eqiad is depooled, noisy alerts	[production]
14:53	<topranks>	shutting down control board 1 on cr2-codfw (T393552)	[production]
14:52	<topranks>	shutting down backup RE1 on cr2-codfw (T393552)	[production]
14:51	<klausman@cumin1002>	START - Cookbook sre.cassandra.roll-restart for nodes matching A:ml-cache-codfw: Enable Java security updates - klausman@cumin1002	[production]
14:50	<bking@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cirrussearch1087.eqiad.wmnet with OS bullseye	[production]
14:48	<moritzm>	installing expat security updates	[production]
14:39	<topranks>	switching active routing-engine to RE0 on cr2-codfw (this will cause protocol adjacencies to flap) (T364092)	[production]
14:30	<James_F>	Docker: [quibble-bullseye-php74-coverage] Bump phpunit-patch-coverage to 0.0.15	[releng]
14:28	<hashar>	integration: cleared Docker build cache on integration-agent-docker-1052 and integration-agent-docker-1061	[releng]
14:25	<bking@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cirrussearch1087.eqiad.wmnet with reason: host reimage	[production]
14:21	<bking@cumin2002>	START - Cookbook sre.hosts.downtime for 2:00:00 on cirrussearch1087.eqiad.wmnet with reason: host reimage	[production]
14:21	<bking@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cirrussearch1083.eqiad.wmnet with OS bullseye	[production]
14:18	<bking@cumin2002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cirrussearch1082.eqiad.wmnet with OS bullseye	[production]
14:06	<bking@cumin2002>	END (PASS) - Cookbook sre.hosts.move-vlan (exit_code=0) for host cirrussearch1087	[production]
14:06	<bking@cumin2002>	START - Cookbook sre.hosts.move-vlan for host cirrussearch1087	[production]
14:06	<bking@cumin2002>	START - Cookbook sre.hosts.reimage for host cirrussearch1087.eqiad.wmnet with OS bullseye	[production]
14:04	<bking@cumin2002>	END (PASS) - Cookbook sre.hosts.rename (exit_code=0) from elastic1087 to cirrussearch1087	[production]
14:03	<topranks>	switching active routing-engine to RE1 on cr2-codfw (this will cause protocol adjacencies to flap) (T364092)	[production]
14:03	<bking@cumin2002>	END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host cirrussearch1087	[production]
14:02	<bking@cumin2002>	START - Cookbook sre.network.configure-switch-interfaces for host cirrussearch1087	[production]
14:02	<bking@cumin2002>	END (PASS) - Cookbook sre.dns.wipe-cache (exit_code=0) cirrussearch1087 on all recursors	[production]
14:02	<bking@cumin2002>	START - Cookbook sre.dns.wipe-cache cirrussearch1087 on all recursors	[production]
14:02	<bking@cumin2002>	END (PASS) - Cookbook sre.dns.netbox (exit_code=0)	[production]
14:02	<bking@cumin2002>	END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming elastic1087 to cirrussearch1087 - bking@cumin2002"	[production]
14:01	<phuedx@deploy1003>	Finished scap sync-world: Backport for [[gerrit:1147796\|ext-EventStreamConfig: Update product_metrics.web_base stream (T394457)]] (duration: 14m 14s)	[production]
14:01	<jynus@cumin1002>	DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on db2200.codfw.wmnet,db1216.eqiad.wmnet with reason: Move s8 to s3	[production]
13:59	<bking@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cirrussearch1083.eqiad.wmnet with reason: host reimage	[production]
13:58	<bking@cumin2002>	START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming elastic1087 to cirrussearch1087 - bking@cumin2002"	[production]
13:57	<taavi>	disable host-based authentication in sshd config, not used since grid shutdown	[tools]
13:56	<topranks>	rebooting backup routing-engine RE1 on cr2-codfw to install JunOS upgrade (T364092)	[production]
13:55	<bking@cumin2002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on cirrussearch1082.eqiad.wmnet with reason: host reimage	[production]
13:54	<bking@cumin2002>	START - Cookbook sre.dns.netbox	[production]
13:54	<bking@cumin2002>	START - Cookbook sre.hosts.rename from elastic1087 to cirrussearch1087	[production]