| 
      
        2024-03-04
      
      ยง
     | 
  
    
  | 13:17 | 
  <elukey@deploy2002> | 
  helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-drafttopic' for release 'main' . | 
  [production] | 
            
  | 13:17 | 
  <cgoubert@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1353.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 13:17 | 
  <dcaro@cumin1002> | 
  END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | 
  [production] | 
            
  | 13:17 | 
  <elukey@deploy2002> | 
  helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-drafttopic' for release 'main' . | 
  [production] | 
            
  | 13:17 | 
  <elukey@deploy2002> | 
  helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-drafttopic' for release 'main' . | 
  [production] | 
            
  | 13:16 | 
  <elukey@deploy2002> | 
  helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-draftquality' for release 'main' . | 
  [production] | 
            
  | 13:15 | 
  <dcaro@cumin1002> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 13:15 | 
  <elukey@deploy2002> | 
  helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-draftquality' for release 'main' . | 
  [production] | 
            
  | 13:14 | 
  <cgoubert@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1351.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 13:14 | 
  <elukey@deploy2002> | 
  helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articletopic' for release 'main' . | 
  [production] | 
            
  | 13:14 | 
  <elukey@deploy2002> | 
  helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-articletopic' for release 'main' . | 
  [production] | 
            
  | 13:13 | 
  <elukey@deploy2002> | 
  helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . | 
  [production] | 
            
  | 13:12 | 
  <moritzm> | 
  installing jqueryui security updates | 
  [production] | 
            
  | 13:12 | 
  <elukey@deploy2002> | 
  helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'revscoring-articlequality' for release 'main' . | 
  [production] | 
            
  | 13:12 | 
  <cgoubert@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1352.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 13:10 | 
  <elukey@deploy2002> | 
  helmfile [ml-serve-eqiad] Ran 'sync' command on namespace 'experimental' for release 'main' . | 
  [production] | 
            
  | 13:10 | 
  <elukey@deploy2002> | 
  helmfile [ml-serve-codfw] Ran 'sync' command on namespace 'experimental' for release 'main' . | 
  [production] | 
            
  | 13:09 | 
  <cgoubert@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on mw1350.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 13:07 | 
  <cgoubert@cumin2002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on mw1354.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 13:07 | 
  <cgoubert@cumin2002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on mw1351.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 13:07 | 
  <cgoubert@cumin2002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on mw1353.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 13:07 | 
  <cgoubert@cumin2002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on mw1352.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 13:06 | 
  <cgoubert@cumin2002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on mw1350.eqiad.wmnet with reason: host reimage | 
  [production] | 
            
  | 12:54 | 
  <wmbot~dcaro@urcuchillay> | 
  END (PASS) - Cookbook wmcs.ceph.reboot_node (exit_code=0) (T359049) | 
  [admin] | 
            
  | 12:53 | 
  <cgoubert@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host mw1354.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 12:53 | 
  <cgoubert@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host mw1353.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 12:53 | 
  <cgoubert@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host mw1352.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 12:52 | 
  <cgoubert@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host mw1351.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 12:52 | 
  <cgoubert@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host mw1350.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 12:48 | 
  <wmbot~dcaro@urcuchillay> | 
  START - Cookbook wmcs.ceph.reboot_node (T359049) | 
  [admin] | 
            
  | 12:45 | 
  <jelto@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts etherpad1003.eqiad.wmnet | 
  [production] | 
            
  | 12:45 | 
  <jelto@cumin1002> | 
  END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | 
  [production] | 
            
  | 12:45 | 
  <jelto@cumin1002> | 
  END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: etherpad1003.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - jelto@cumin1002" | 
  [production] | 
            
  | 12:45 | 
  <claime> | 
  Depooling mw1350.eqiad.wmnet,mw1351.eqiad.wmnet,mw1352.eqiad.wmnet,mw1353.eqiad.wmnet,mw1354.eqiad.wmnet for move to kubernetes - T351074 | 
  [production] | 
            
  | 12:43 | 
  <jelto@cumin1002> | 
  START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: etherpad1003.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - jelto@cumin1002" | 
  [production] | 
            
  | 12:43 | 
  <taavi> | 
  reboot tools-sgegrid-shadow due to high number of procs in D state | 
  [tools] | 
            
  | 12:41 | 
  <jelto@cumin1002> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 12:38 | 
  <claime> | 
  Re-enabling puppet on C:profile::firewall::log::ferm to deploy new ferm_status.py - T354855 | 
  [production] | 
            
  | 12:37 | 
  <brouberol@deploy2002> | 
  helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset: apply | 
  [production] | 
            
  | 12:37 | 
  <brouberol@deploy2002> | 
  helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset: apply | 
  [production] | 
            
  | 12:36 | 
  <brouberol@deploy2002> | 
  helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/superset-next: apply | 
  [production] | 
            
  | 12:36 | 
  <brouberol@deploy2002> | 
  helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/superset-next: apply | 
  [production] | 
            
  | 12:35 | 
  <jelto@cumin1002> | 
  START - Cookbook sre.hosts.decommission for hosts etherpad1003.eqiad.wmnet | 
  [production] | 
            
  | 12:33 | 
  <claime> | 
  Enabling puppet on puppetboard2003 to test new ferm_status.py - T354855 | 
  [production] | 
            
  | 12:30 | 
  <claime> | 
  Enabling puppet on mw2322 to test new ferm_status.py - T354855 | 
  [production] | 
            
  | 12:28 | 
  <claime> | 
  Enabling puppet on kubernetes2019 to test new ferm_status.py - T354855 | 
  [production] | 
            
  | 12:22 | 
  <claime> | 
  Disabling puppet on C:profile::firewall::log::ferm to deploy new ferm_status.py - T354855 | 
  [production] | 
            
  | 12:22 | 
  <btullis> | 
  restarting hive-server2 and hive-metastore service on an-coord1003 | 
  [analytics] | 
            
  | 12:22 | 
  <claime> | 
  Uncordoning mw2314.codfw.wmnet mw2315.codfw.wmnet mw2316.codfw.wmnet mw2320.codfw.wmnet mw2321.codfw.wmnet mw2322.codfw.wmnet - T351074 | 
  [production] | 
            
  | 12:21 | 
  <cgoubert@cumin2002> | 
  conftool action : set/weight=10:pooled=yes; selector: name=(mw2314.codfw.wmnet|mw2315.codfw.wmnet|mw2316.codfw.wmnet|mw2320.codfw.wmnet|mw2321.codfw.wmnet|mw2322.codfw.wmnet),cluster=kubernetes,service=kubesvc | 
  [production] |