201-250 of 10000 results (93ms)
2025-03-12 ยง
13:20 <brouberol@deploy2002> helmfile [codfw] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
13:20 <brouberol@deploy2002> helmfile [codfw] START helmfile.d/services/cirrus-streaming-updater: apply [production]
13:19 <brouberol@deploy2002> helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply [production]
13:19 <brouberol@deploy2002> helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply [production]
13:01 <Emperor> fio testing on ms-be2088 24 disks at once whilst resetting the controller T384003 [production]
12:27 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1034.eqiad.wmnet [production]
12:24 <jmm@cumin2002> END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti1034.eqiad.wmnet [production]
12:23 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1034.eqiad.wmnet [production]
12:23 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.drain-node (exit_code=99) for draining ganeti node ganeti1034.eqiad.wmnet [production]
12:22 <jmm@cumin2002> START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti1034.eqiad.wmnet [production]
12:10 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host lvs6003.drmrs.wmnet with OS bookworm [production]
11:55 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.migrate-service-ipip (exit_code=0) for role: wdqs::internal@eqiad [production]
11:55 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on (A:lvs-low-traffic-eqiad or A:lvs-secondary-eqiad) and A:bullseye and A:lvs [production]
11:54 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on (A:lvs-low-traffic-eqiad or A:lvs-secondary-eqiad) and A:bullseye and A:lvs [production]
11:50 <Emperor> fio testing on ms-be2088 24 disks at once T384003 [production]
11:44 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.migrate-service-ipip for role: wdqs::internal@eqiad [production]
11:42 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on lvs6003.drmrs.wmnet with reason: host reimage [production]
11:39 <vgutierrez@cumin1002> START - Cookbook sre.hosts.downtime for 2:00:00 on lvs6003.drmrs.wmnet with reason: host reimage [production]
11:39 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.migrate-service-ipip (exit_code=0) for role: wdqs::internal@codfw [production]
11:39 <vgutierrez@cumin1002> END (PASS) - Cookbook sre.loadbalancer.restart-pybal (exit_code=0) rolling-restart of pybal on (A:lvs-low-traffic-codfw or A:lvs-secondary-codfw) and A:bullseye and A:lvs [production]
11:38 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.restart-pybal rolling-restart of pybal on (A:lvs-low-traffic-codfw or A:lvs-secondary-codfw) and A:bullseye and A:lvs [production]
11:31 <vgutierrez@cumin1002> START - Cookbook sre.loadbalancer.migrate-service-ipip for role: wdqs::internal@codfw [production]
11:21 <vgutierrez@cumin1002> START - Cookbook sre.hosts.reimage for host lvs6003.drmrs.wmnet with OS bookworm [production]
11:18 <vgutierrez> reimage lvs6003 as a liberica instance - T384477 [production]
11:17 <mvolz@deploy2002> helmfile [eqiad] DONE helmfile.d/services/citoid: apply [production]
11:16 <mvolz@deploy2002> helmfile [eqiad] START helmfile.d/services/citoid: apply [production]
11:16 <mvolz@deploy2002> helmfile [codfw] DONE helmfile.d/services/citoid: apply [production]
11:15 <mvolz@deploy2002> helmfile [codfw] START helmfile.d/services/citoid: apply [production]
11:13 <mvolz@deploy2002> helmfile [staging] DONE helmfile.d/services/citoid: apply [production]
11:13 <mvolz@deploy2002> helmfile [staging] START helmfile.d/services/citoid: apply [production]
11:11 <Emperor> fio testing on ms-be2088 while resetting controller T384003 [production]
11:05 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host ms-be1091.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART [production]
11:05 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host ms-be1091.mgmt.eqiad.wmnet with chassis set policy GRACEFUL_RESTART [production]
10:57 <jiji@deploy2002> scap failed: <KeyError> 'production' (scap version: 4.140.0) (duration: 13m 54s) [production]
10:53 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
10:48 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
10:44 <jiji@deploy2002> Started scap sync-world: (T383845) mw-(api-int|parsoid|jobrunner): switch all releases to PHP 8.1 [production]
10:43 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
10:42 <jynus> removing backup1002, backup2002 dbbackups user @ m1 T387892 [production]
10:38 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
10:36 <elukey@cumin2002> END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
10:36 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
10:19 <jmm@cumin2002> END (FAIL) - Cookbook sre.ganeti.addnode (exit_code=99) for new host ganeti1037.eqiad.wmnet to cluster eqiad and group C [production]
10:18 <jmm@cumin2002> START - Cookbook sre.ganeti.addnode for new host ganeti1037.eqiad.wmnet to cluster eqiad and group C [production]
10:17 <jmm@cumin2002> END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti1037.eqiad.wmnet [production]
10:14 <jynus> removing backup1002, backup2002 dump user on es6,es7 T387892 [production]
10:14 <elukey@cumin2002> END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
10:13 <moritzm> installing systemd bugfix updates from Bookworm point release [production]
10:08 <elukey@cumin2002> START - Cookbook sre.hosts.provision for host puppetserver2004.mgmt.codfw.wmnet with chassis set policy GRACEFUL_RESTART [production]
10:07 <jmm@cumin2002> START - Cookbook sre.hosts.reboot-single for host ganeti1037.eqiad.wmnet [production]