| 
      
        2024-12-18
      
      §
     | 
  
    
  | 23:41 | 
  <jhancock@cumin2002> | 
  END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host es1043.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 22:58 | 
  <jhancock@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host es1043.eqiad.wmnet with OS bookworm | 
  [production] | 
            
  | 21:31 | 
  <mfossati@deploy2002> | 
  Finished deploy [airflow-dags/platform_eng@a43cacf]: bump image suggestions, section topics, and SEAL (duration: 01m 43s) | 
  [production] | 
            
  | 21:30 | 
  <mfossati@deploy2002> | 
  Started deploy [airflow-dags/platform_eng@a43cacf]: bump image suggestions, section topics, and SEAL | 
  [production] | 
            
  | 20:44 | 
  <cjming@deploy2002> | 
  helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic: apply | 
  [production] | 
            
  | 20:44 | 
  <cjming@deploy2002> | 
  helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic: apply | 
  [production] | 
            
  | 20:36 | 
  <cjming@deploy2002> | 
  helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply | 
  [production] | 
            
  | 20:36 | 
  <cjming@deploy2002> | 
  helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply | 
  [production] | 
            
  | 20:29 | 
  <otto@deploy2002> | 
  helmfile [codfw] DONE helmfile.d/services/eventgate-analytics-external: sync | 
  [production] | 
            
  | 20:28 | 
  <otto@deploy2002> | 
  helmfile [codfw] START helmfile.d/services/eventgate-analytics-external: sync | 
  [production] | 
            
  | 20:28 | 
  <otto@deploy2002> | 
  helmfile [eqiad] DONE helmfile.d/services/eventgate-analytics-external: sync | 
  [production] | 
            
  | 20:27 | 
  <otto@deploy2002> | 
  helmfile [eqiad] START helmfile.d/services/eventgate-analytics-external: sync | 
  [production] | 
            
  | 20:27 | 
  <otto@deploy2002> | 
  helmfile [staging] DONE helmfile.d/services/eventgate-analytics-external: sync | 
  [production] | 
            
  | 20:27 | 
  <otto@deploy2002> | 
  helmfile [staging] START helmfile.d/services/eventgate-analytics-external: sync | 
  [production] | 
            
  | 20:26 | 
  <ottomata> | 
  restarting eventgate-analytics-external to clear schema cache - T382113 |  https://phabricator.wikimedia.org/T382113#10414005 | 
  [production] | 
            
  | 19:28 | 
  <dancy@deploy2002> | 
  rebuilt and synchronized wikiversions files: group1 to 1.44.0-wmf.8  refs T375667 | 
  [production] | 
            
  | 18:55 | 
  <btullis@cumin1002> | 
  START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1002" | 
  [production] | 
            
  | 18:41 | 
  <wmbot~anticomposite@tools-bastion-13> | 
  kubectl rollout restart deployment flr # bot not processing files | 
  [tools.yifeibot] | 
            
  | 18:40 | 
  <btullis@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1069.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 18:37 | 
  <btullis@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1069.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 18:25 | 
  <btullis@cumin1002> | 
  START - Cookbook sre.hosts.reimage for host an-worker1069.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 18:25 | 
  <btullis@cumin1002> | 
  END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-worker1069.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 18:23 | 
  <btullis@cumin1002> | 
  END (PASS) - Cookbook sre.hadoop.init-hadoop-workers (exit_code=0) for hosts an-worker1068.eqiad.wmnet | 
  [production] | 
            
  | 18:21 | 
  <btullis@cumin1002> | 
  START - Cookbook sre.hadoop.init-hadoop-workers for hosts an-worker1068.eqiad.wmnet | 
  [production] | 
            
  | 18:20 | 
  <btullis@cumin1002> | 
  START - Cookbook sre.hosts.reimage for host an-worker1069.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 18:18 | 
  <btullis@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host an-worker1068.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 18:18 | 
  <btullis@cumin1002> | 
  END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1002" | 
  [production] | 
            
  | 18:18 | 
  <eevans@cumin1002> | 
  END (PASS) - Cookbook sre.cassandra.roll-restart (exit_code=0) for nodes matching sessionstore1*.eqiad.wmnet: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002 | 
  [production] | 
            
  | 18:16 | 
  <btullis@cumin1002> | 
  START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - btullis@cumin1002" | 
  [production] | 
            
  | 18:16 | 
  <btullis@cumin1002> | 
  START - Cookbook sre.hosts.reimage for host an-worker1067.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 18:15 | 
  <btullis@cumin1002> | 
  END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-worker1067.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 18:13 | 
  <btullis@cumin1002> | 
  END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-worker1069.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 18:09 | 
  <btullis@cumin1002> | 
  START - Cookbook sre.hosts.reimage for host an-worker1067.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 18:06 | 
  <lucaswerkmeister> | 
  add samtar and remove toolforge-standards-committee per T380537 | 
  [tools.bullseye] | 
            
  | 18:05 | 
  <btullis@cumin1002> | 
  END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host an-worker1067.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 18:04 | 
  <lucaswerkmeister> | 
  sudo rm __pycache__/settings*.pyc # T380537 | 
  [tools.bullseye] | 
            
  | 18:03 | 
  <lucaswerkmeister> | 
  sed -i -E "/(SPUR|SHODAN)_KEY/ s/'[^']*'/'expunged (T380537)'/" settings.py | 
  [tools.bullseye] | 
            
  | 18:01 | 
  <btullis@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on an-worker1068.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 18:01 | 
  <btullis@cumin1002> | 
  START - Cookbook sre.hosts.reimage for host an-worker1069.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 18:00 | 
  <lucaswerkmeister> | 
  sudo install -m600 settings.py{,-before-T380537} | 
  [tools.bullseye] | 
            
  | 18:00 | 
  <eevans@cumin1002> | 
  START - Cookbook sre.cassandra.roll-restart for nodes matching sessionstore1*.eqiad.wmnet: Upgrading to Cassandra 4.1.7 — T380420 - eevans@cumin1002 | 
  [production] | 
            
  | 17:59 | 
  <btullis@cumin1002> | 
  END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host an-worker1069 | 
  [production] | 
            
  | 17:58 | 
  <btullis@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on an-worker1068.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 17:58 | 
  <btullis@cumin1002> | 
  START - Cookbook sre.network.configure-switch-interfaces for host an-worker1069 | 
  [production] | 
            
  | 17:57 | 
  <btullis@cumin1002> | 
  END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | 
  [production] | 
            
  | 17:57 | 
  <btullis@cumin1002> | 
  END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Re-commissioning an-presto1005 as an-worker1069 - btullis@cumin1002" | 
  [production] | 
            
  | 17:57 | 
  <btullis@cumin1002> | 
  START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Re-commissioning an-presto1005 as an-worker1069 - btullis@cumin1002" | 
  [production] | 
            
  | 17:57 | 
  <btullis@cumin1002> | 
  START - Cookbook sre.hosts.reimage for host an-worker1067.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 17:55 | 
  <btullis@cumin1002> | 
  END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1067.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 17:51 | 
  <btullis@cumin1002> | 
  START - Cookbook sre.dns.netbox | 
  [production] |