| 2025-09-24
      
      § | 
    
  | 09:04 | <filippo@cloudcumin1001> | END (FAIL) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=99) for toolsbeta-test-k8s-worker-nfs-9,toolsbeta-test-k8s-worker-nfs-7 | [toolsbeta] | 
            
  | 09:04 | <filippo@cloudcumin1001> | START - Cookbook wmcs.toolforge.k8s.reboot for toolsbeta-test-k8s-worker-nfs-9,toolsbeta-test-k8s-worker-nfs-7 | [toolsbeta] | 
            
  | 09:04 | <filippo@cloudcumin1001> | END (FAIL) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=99) for toolsbeta-test-k8s-worker-nfs-9, toolsbeta-test-k8s-worker-nfs-7 | [toolsbeta] | 
            
  | 09:04 | <filippo@cloudcumin1001> | START - Cookbook wmcs.toolforge.k8s.reboot for toolsbeta-test-k8s-worker-nfs-9, toolsbeta-test-k8s-worker-nfs-7 | [toolsbeta] | 
            
  | 09:03 | <filippo@cloudcumin1001> | END (ERROR) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=97) for tools-k8s-worker-nfs-14 | [toolsbeta] | 
            
  | 09:03 | <filippo@cloudcumin1001> | START - Cookbook wmcs.toolforge.k8s.reboot for tools-k8s-worker-nfs-14 | [toolsbeta] | 
            
  | 09:00 | <jclark@cumin1002> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 08:59 | <jclark@cumin1002> | END (FAIL) - Cookbook sre.network.configure-switch-interfaces (exit_code=99) for host an-worker1095 | [production] | 
            
  | 08:59 | <jclark@cumin1002> | START - Cookbook sre.network.configure-switch-interfaces for host an-worker1095 | [production] | 
            
  | 08:57 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host ganeti3006.esams.wmnet | [production] | 
            
  | 08:56 | <jmm@cumin2002> | START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti3006.esams.wmnet | [production] | 
            
  | 08:54 | <elukey@cumin1003> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on ml-serve1013.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 08:53 | <volans@cloudcumin1001> | END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1061.eqiad.wmnet}' | [admin] | 
            
  | 08:46 | <elukey@cumin1003> | START - Cookbook sre.hosts.downtime for 2:00:00 on ml-serve1013.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 08:42 | <filippo@cloudcumin1001> | END (FAIL) - Cookbook wmcs.toolforge.k8s.reboot (exit_code=99) for toolsbeta-bastion-7 | [toolsbeta] | 
            
  | 08:41 | <filippo@cloudcumin1001> | START - Cookbook wmcs.toolforge.k8s.reboot for toolsbeta-bastion-7 | [toolsbeta] | 
            
  | 08:41 | <moritzm> | failover Ganeti master in esams to ganeti3005 | [production] | 
            
  | 08:40 | <moritzm> | failover Ganeti master in magru to ganeti3005 | [production] | 
            
  | 08:37 | <jmm@cumin2002> | END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti3008.esams.wmnet | [production] | 
            
  | 08:37 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti3008.esams.wmnet | [production] | 
            
  | 08:35 | <volans@cloudcumin1001> | START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1061.eqiad.wmnet}' | [admin] | 
            
  | 08:35 | <volans@cloudcumin1001> | END (PASS) - Cookbook wmcs.openstack.cloudvirt.safe_reboot (exit_code=0) on hosts matched by 'D{cloudvirt1060.eqiad.wmnet}' | [admin] | 
            
  | 08:33 | <elukey@cumin1003> | START - Cookbook sre.hosts.reimage for host ml-serve1013.eqiad.wmnet with OS trixie | [production] | 
            
  | 08:26 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host ganeti3008.esams.wmnet | [production] | 
            
  | 08:24 | <jmm@cumin2002> | START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti3008.esams.wmnet | [production] | 
            
  | 08:21 | <jmm@cumin2002> | END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti3007.esams.wmnet | [production] | 
            
  | 08:21 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti3007.esams.wmnet | [production] | 
            
  | 08:19 | <wmbot~godog@r5> | END (FAIL) - Cookbook wmcs.nfs.migrate_service (exit_code=99) | [toolsbeta] | 
            
  | 08:19 | <wmbot~godog@r5> | START - Cookbook wmcs.nfs.migrate_service | [toolsbeta] | 
            
  | 08:14 | <mvernon@cumin2002> | DONE (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on ms-be1066.eqiad.wmnet with reason: vacuum | [production] | 
            
  | 08:13 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host ganeti3007.esams.wmnet | [production] | 
            
  | 08:13 | <Emperor> | VACUUM large container dbs on ms-be1066 T377827 | [production] | 
            
  | 08:13 | <volans@cloudcumin1001> | START - Cookbook wmcs.openstack.cloudvirt.safe_reboot on hosts matched by 'D{cloudvirt1060.eqiad.wmnet}' | [admin] | 
            
  | 08:09 | <dcaro@cloudcumin1001> | END (ERROR) - Cookbook wmcs.toolforge.component.deploy (exit_code=97) for component maintain-kubeusers | [toolsbeta] | 
            
  | 08:09 | <jmm@cumin2002> | START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti3007.esams.wmnet | [production] | 
            
  | 08:05 | <wmbot~godog@r5> | END (FAIL) - Cookbook wmcs.nfs.migrate_service (exit_code=99) | [toolsbeta] | 
            
  | 08:03 | <wmbot~godog@r5> | START - Cookbook wmcs.nfs.migrate_service | [toolsbeta] | 
            
  | 08:02 | <dcaro@cloudcumin1001> | START - Cookbook wmcs.toolforge.component.deploy for component maintain-kubeusers | [toolsbeta] | 
            
  | 08:02 | <dcaro@cloudcumin1001> | END (ERROR) - Cookbook wmcs.toolforge.component.deploy (exit_code=97) for component maintain-kubeusers | [toolsbeta] | 
            
  | 07:58 | <fceratto@cumin1002> | START - Cookbook sre.mysql.sanitize-wiki Checking sanitization for wikis tokwiki in section s5 | [production] | 
            
  | 07:52 | <dcaro@cloudcumin1001> | START - Cookbook wmcs.toolforge.component.deploy for component maintain-kubeusers | [toolsbeta] | 
            
  | 07:49 | <jmm@cumin2002> | END (PASS) - Cookbook sre.ganeti.drain-node (exit_code=0) for draining ganeti node ganeti3005.esams.wmnet | [production] | 
            
  | 07:49 | <jmm@cumin2002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host ganeti3005.esams.wmnet | [production] | 
            
  | 07:41 | <jmm@cumin2002> | START - Cookbook sre.hosts.reboot-single for host ganeti3005.esams.wmnet | [production] | 
            
  | 07:33 | <jmm@cumin2002> | START - Cookbook sre.ganeti.drain-node for draining ganeti node ganeti3005.esams.wmnet | [production] | 
            
  | 07:31 | <mlitn@deploy1003> | Finished scap sync-world: Backport for [[gerrit:1190684|Add MediaSearch custommatch:linked_from keyword (T403613)]] (duration: 13m 04s) | [production] | 
            
  | 07:26 | <mlitn@deploy1003> | mlitn: Continuing with sync | [production] | 
            
  | 07:25 | <mlitn@deploy1003> | mlitn: Backport for [[gerrit:1190684|Add MediaSearch custommatch:linked_from keyword (T403613)]] synced to the testservers (see https://wikitech.wikimedia.org/wiki/Mwdebug). Changes can now be verified there. | [production] | 
            
  | 07:21 | <stevemunene> | change the Druid public (AQS) connection string to druid1011 as we decommission druid1007 T405446 | [analytics] | 
            
  | 07:18 | <mlitn@deploy1003> | Started scap sync-world: Backport for [[gerrit:1190684|Add MediaSearch custommatch:linked_from keyword (T403613)]] | [production] |