production SAL

201-250 of 10000 results (79ms)

2025-03-12 §
13:20	<brouberol@deploy2002>	helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
13:20	<brouberol@deploy2002>	helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply	[production]
13:19	<brouberol@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply	[production]
13:19	<brouberol@deploy2002>	helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply	[production]
13:01	<Emperor>	fio testing on ms-be2088 24 disks at once whilst resetting the controller T384003	[production]
12:27	<jmm@cumin2002>	START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1034.eqiad.wmnet	[production]
12:24	<jmm@cumin2002>	END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1034.eqiad.wmnet	[production]
12:23	<jmm@cumin2002>	START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1034.eqiad.wmnet	[production]
12:23	<jmm@cumin2002>	END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti1034.eqiad.wmnet	[production]
12:22	<jmm@cumin2002>	START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1034.eqiad.wmnet	[production]
12:10	<vgutierrez@cumin1002>	END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host lvs6003.drmrs.wmnet with OS bookworm	[production]
11:55	<vgutierrez@cumin1002>	END (PASS) - Cookbook sre.loadbalancer.migrate-service-ipip (exit_code=0) for role: wdqs::internal@eqiad	[production]
11:55	<vgutierrez@cumin1002>	END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on (A:lvs-low-traffic-eqiad or A:lvs-secondary-eqiad) and A:bullseye and A:lvs	[production]
11:54	<vgutierrez@cumin1002>	START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on (A:lvs-low-traffic-eqiad or A:lvs-secondary-eqiad) and A:bullseye and A:lvs	[production]
11:50	<Emperor>	fio testing on ms-be2088 24 disks at once T384003	[production]
11:44	<vgutierrez@cumin1002>	START - Cookbook sre.loadbalancer.migrate-service-ipip for role: wdqs::internal@eqiad	[production]
11:42	<vgutierrez@cumin1002>	END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs6003.drmrs.wmnet with reason: host reimage	[production]
11:39	<vgutierrez@cumin1002>	START - Cookbook sre.hosts.downtime for 2:00:00 on lvs6003.drmrs.wmnet with reason: host reimage	[production]
11:39	<vgutierrez@cumin1002>	END (PASS) - Cookbook sre.loadbalancer.migrate-service-ipip (exit_code=0) for role: wdqs::internal@codfw	[production]
11:39	<vgutierrez@cumin1002>	END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on (A:lvs-low-traffic-codfw or A:lvs-secondary-codfw) and A:bullseye and A:lvs	[production]
11:38	<vgutierrez@cumin1002>	START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on (A:lvs-low-traffic-codfw or A:lvs-secondary-codfw) and A:bullseye and A:lvs	[production]
11:31	<vgutierrez@cumin1002>	START - Cookbook sre.loadbalancer.migrate-service-ipip for role: wdqs::internal@codfw	[production]
11:21	<vgutierrez@cumin1002>	START - Cookbook sre.hosts.reimage for host lvs6003.drmrs.wmnet with OS bookworm	[production]
11:18	<vgutierrez>	reimage lvs6003 as a liberica instance - T384477	[production]
11:17	<mvolz@deploy2002>	helmfile [eqiad] DONE helmfile.d/services/citoid: apply	[production]
11:16	<mvolz@deploy2002>	helmfile [eqiad] START helmfile.d/services/citoid: apply	[production]
11:16	<mvolz@deploy2002>	helmfile [codfw] DONE helmfile.d/services/citoid: apply	[production]
11:15	<mvolz@deploy2002>	helmfile [codfw] START helmfile.d/services/citoid: apply	[production]
11:13	<mvolz@deploy2002>	helmfile [staging] DONE helmfile.d/services/citoid: apply	[production]
11:13	<mvolz@deploy2002>	helmfile [staging] START helmfile.d/services/citoid: apply	[production]
11:11	<Emperor>	fio testing on ms-be2088 while resetting controller T384003	[production]
11:05	<elukey@cumin2002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ms-be1091.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART	[production]
11:05	<elukey@cumin2002>	START - Cookbook sre.hosts.provision for host ms-be1091.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART	[production]
10:57	<jiji@deploy2002>	scap failed: <KeyError> 'production' (scap version: 4.140.0) (duration: 13m 54s)	[production]
10:53	<elukey@cumin2002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART	[production]
10:48	<elukey@cumin2002>	START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART	[production]
10:44	<jiji@deploy2002>	Started scap sync-world: (T383845) mw-(api-int\|parsoid\|jobrunner): switch all releases to PHP 8.1	[production]
10:43	<elukey@cumin2002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART	[production]
10:42	<jynus>	removing backup1002, backup2002 dbbackups user @ m1 T387892	[production]
10:38	<elukey@cumin2002>	START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART	[production]
10:36	<elukey@cumin2002>	END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART	[production]
10:36	<elukey@cumin2002>	START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART	[production]
10:19	<jmm@cumin2002>	END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti1037.eqiad.wmnet to cluster eqiad and group C	[production]
10:18	<jmm@cumin2002>	START - Cookbook sre.ganeti.addnode for new host ganeti1037.eqiad.wmnet to cluster eqiad and group C	[production]
10:17	<jmm@cumin2002>	END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1037.eqiad.wmnet	[production]
10:14	<jynus>	removing backup1002, backup2002 dump user on es6,es7 T387892	[production]
10:14	<elukey@cumin2002>	END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART	[production]
10:13	<moritzm>	installing systemd bugfix updates from Bookworm point release	[production]
10:08	<elukey@cumin2002>	START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART	[production]
10:07	<jmm@cumin2002>	START - Cookbook sre.hosts.reboot-single for host ganeti1037.eqiad.wmnet	[production]