| 2024-06-20
      
      § | 
    
  | 14:44 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2169.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 14:44 | <marostegui@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2 days, 0:00:00 on db2187.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 14:43 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 2 days, 0:00:00 on db2187.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 14:43 | <kamila@cumin1002> | END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host wikikube-ctrl2002.codfw.wmnet with OS bullseye | [production] | 
            
  | 14:43 | <marostegui@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2158.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 14:43 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2158.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 14:43 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2151 (T367856)', diff saved to https://phabricator.wikimedia.org/P65246 and previous config saved to /var/cache/conftool/dbconfig/20240620-144341-marostegui.json | [production] | 
            
  | 14:42 | <eevans@cumin1002> | END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching sessionstore200[5-6].codfw.wmnet: Upgrade to Java 11 — T350567 - eevans@cumin1002 | [production] | 
            
  | 14:40 | <cgoubert@cumin1002> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 14:40 | <cgoubert@cumin1002> | START - Cookbook sre.hosts.rename from mw2364 to wikikube-worker2024 | [production] | 
            
  | 14:39 | <eevans@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on aqs1013.eqiad.wmnet with reason: Main board swap — T362033 | [production] | 
            
  | 14:39 | <eevans@cumin1002> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on aqs1013.eqiad.wmnet with reason: Main board swap — T362033 | [production] | 
            
  | 14:38 | <cgoubert@cumin1002> | END (PASS) - Cookbook sre.hosts.rename (exit_code=0) from mw2363 to wikikube-worker2023 | [production] | 
            
  | 14:38 | <cgoubert@cumin1002> | END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker2023 | [production] | 
            
  | 14:38 | <taavi@cumin1002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudvirt1051.eqiad.wmnet with OS bookworm | [production] | 
            
  | 14:38 | <kamila@cumin1002> | START - Cookbook sre.hosts.reimage for host wikikube-ctrl2002.codfw.wmnet with OS bullseye | [production] | 
            
  | 14:37 | <cgoubert@cumin1002> | START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker2023 | [production] | 
            
  | 14:37 | <cgoubert@cumin1002> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 14:37 | <cgoubert@cumin1002> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming mw2363 to wikikube-worker2023 - cgoubert@cumin1002" | [production] | 
            
  | 14:37 | <jmm@cumin2002> | END (PASS) - Cookbook sre.debmonitor.remove-hosts (exit_code=0) for 1 hosts: mw2324.codfw.wmnet | [production] | 
            
  | 14:37 | <jmm@cumin2002> | START - Cookbook sre.debmonitor.remove-hosts for 1 hosts: mw2324.codfw.wmnet | [production] | 
            
  | 14:36 | <jmm@cumin2002> | END (PASS) - Cookbook sre.debmonitor.remove-hosts (exit_code=0) for 1 hosts: mw2323.codfw.wmnet | [production] | 
            
  | 14:36 | <jmm@cumin2002> | START - Cookbook sre.debmonitor.remove-hosts for 1 hosts: mw2323.codfw.wmnet | [production] | 
            
  | 14:36 | <jmm@cumin2002> | END (PASS) - Cookbook sre.debmonitor.remove-hosts (exit_code=0) for 1 hosts: mw1489.eqiad.wmnet | [production] | 
            
  | 14:36 | <jmm@cumin2002> | START - Cookbook sre.debmonitor.remove-hosts for 1 hosts: mw1489.eqiad.wmnet | [production] | 
            
  | 14:35 | <btullis@deploy1002> | helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/airflow: apply | [production] | 
            
  | 14:35 | <sukhe> | running authdns-update for CR 1047074 | [production] | 
            
  | 14:35 | <cgoubert@cumin1002> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming mw2363 to wikikube-worker2023 - cgoubert@cumin1002" | [production] | 
            
  | 14:34 | <btullis@deploy1002> | helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/airflow: apply | [production] | 
            
  | 14:32 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'db1165 (re)pooling @ 75%: post T367854 repool', diff saved to https://phabricator.wikimedia.org/P65245 and previous config saved to /var/cache/conftool/dbconfig/20240620-143244-arnaudb.json | [production] | 
            
  | 14:32 | <cgoubert@cumin1002> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 14:32 | <cgoubert@cumin1002> | START - Cookbook sre.hosts.rename from mw2363 to wikikube-worker2023 | [production] | 
            
  | 14:31 | <moritzm> | imported python-pymysql 1.0.2-2~wmf11u2 to apt.wikimedia.org (merge of the security fix from DSA 5700 on top of our internal backport) | [production] | 
            
  | 14:31 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'es1036 depool ahead of T365987', diff saved to https://phabricator.wikimedia.org/P65244 and previous config saved to /var/cache/conftool/dbconfig/20240620-143109-arnaudb.json | [production] | 
            
  | 14:30 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:30:00 on es1036.eqiad.wmnet with reason: T365987 | [production] | 
            
  | 14:30 | <eevans@cumin1002> | START - Cookbook sre.cassandra.roll-restart for nodes matching sessionstore200[5-6].codfw.wmnet: Upgrade to Java 11 — T350567 - eevans@cumin1002 | [production] | 
            
  | 14:30 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 1:30:00 on es1036.eqiad.wmnet with reason: T365987 | [production] | 
            
  | 14:29 | <cgoubert@cumin1002> | END (PASS) - Cookbook sre.hosts.rename (exit_code=0) from mw2362 to wikikube-worker2022 | [production] | 
            
  | 14:29 | <cgoubert@cumin1002> | END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host wikikube-worker2022 | [production] | 
            
  | 14:29 | <eevans@cumin1002> | END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching sessionstore2004.codfw.wmnet: Upgrade to Java 11 — T350567 - eevans@cumin1002 | [production] | 
            
  | 14:28 | <cgoubert@cumin1002> | START - Cookbook sre.network.configure-switch-interfaces for host wikikube-worker2022 | [production] | 
            
  | 14:28 | <cgoubert@cumin1002> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 14:28 | <cgoubert@cumin1002> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming mw2362 to wikikube-worker2022 - cgoubert@cumin1002" | [production] | 
            
  | 14:28 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2151', diff saved to https://phabricator.wikimedia.org/P65243 and previous config saved to /var/cache/conftool/dbconfig/20240620-142834-marostegui.json | [production] | 
            
  | 14:27 | <cgoubert@cumin1002> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Renaming mw2362 to wikikube-worker2022 - cgoubert@cumin1002" | [production] | 
            
  | 14:27 | <cdanis@deploy1002> | helmfile [codfw] DONE helmfile.d/services/mw-api-int: apply | [production] | 
            
  | 14:26 | <sukhe> | sudo cumin 'O:alerting_host' 'run-puppet-agent' | [production] | 
            
  | 14:25 | <cdanis@deploy1002> | helmfile [codfw] START helmfile.d/services/mw-api-int: apply | [production] | 
            
  | 14:25 | <cdanis@deploy1002> | helmfile [eqiad] DONE helmfile.d/services/mw-api-int: apply | [production] | 
            
  | 14:25 | <elukey@cumin1002> | END (PASS) - Cookbook sre.deploy.python-code (exit_code=0) homer to cumin2002.codfw.wmnet,cumin1002.eqiad.wmnet with reason: Update wmf-plugin for K8s ml-staging - elukey@cumin1002 | [production] |