| 
      
        2024-02-16
      
      ยง
     | 
  
    
  | 10:50 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1249 (T352010)', diff saved to https://phabricator.wikimedia.org/P56891 and previous config saved to /var/cache/conftool/dbconfig/20240216-105041-ladsgroup.json | 
  [production] | 
            
  | 10:44 | 
  <volans@cumin1002> | 
  END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts sretest1001.eqiad.wmnet | 
  [production] | 
            
  | 10:44 | 
  <volans@cumin1002> | 
  START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts sretest1001.eqiad.wmnet | 
  [production] | 
            
  | 10:43 | 
  <volans@cumin1002> | 
  END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts sretest1001.eqiad.wmnet | 
  [production] | 
            
  | 10:42 | 
  <volans@cumin1002> | 
  START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts sretest1001.eqiad.wmnet | 
  [production] | 
            
  | 10:41 | 
  <hnowlan@cumin2002> | 
  conftool action : set/pooled=yes; selector: name=mw2379.codfw.wmnet | 
  [production] | 
            
  | 10:37 | 
  <wmbot~dcaro@urcuchillay> | 
  END (PASS) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=0) | 
  [tools] | 
            
  | 10:35 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1249', diff saved to https://phabricator.wikimedia.org/P56890 and previous config saved to /var/cache/conftool/dbconfig/20240216-103535-ladsgroup.json | 
  [production] | 
            
  | 10:32 | 
  <wmbot~dcaro@urcuchillay> | 
  START - Cookbook wmcs.openstack.cloudvirt.vm_console | 
  [tools] | 
            
  | 10:32 | 
  <wmbot~dcaro@urcuchillay> | 
  END (ERROR) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=255) | 
  [tools] | 
            
  | 10:31 | 
  <wmbot~dcaro@urcuchillay> | 
  START - Cookbook wmcs.openstack.cloudvirt.vm_console | 
  [tools] | 
            
  | 10:31 | 
  <wmbot~dcaro@urcuchillay> | 
  END (ERROR) - Cookbook wmcs.openstack.cloudvirt.vm_console (exit_code=255) | 
  [tools] | 
            
  | 10:31 | 
  <wmbot~dcaro@urcuchillay> | 
  START - Cookbook wmcs.openstack.cloudvirt.vm_console | 
  [tools] | 
            
  | 10:20 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1249', diff saved to https://phabricator.wikimedia.org/P56889 and previous config saved to /var/cache/conftool/dbconfig/20240216-102028-ladsgroup.json | 
  [production] | 
            
  | 10:05 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1249 (T352010)', diff saved to https://phabricator.wikimedia.org/P56888 and previous config saved to /var/cache/conftool/dbconfig/20240216-100521-ladsgroup.json | 
  [production] | 
            
  | 10:03 | 
  <arnaudb@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3 days, 0:00:00 on db2194.codfw.wmnet with reason: Silence for WE | 
  [production] | 
            
  | 10:03 | 
  <arnaudb@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 3 days, 0:00:00 on db2194.codfw.wmnet with reason: Silence for WE | 
  [production] | 
            
  | 09:59 | 
  <taavi@cloudcumin1001> | 
  END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker-nfs role in the tools cluster | 
  [tools] | 
            
  | 09:59 | 
  <taavi@cloudcumin1001> | 
  Added a new k8s worker-nfs tools-k8s-worker-nfs-36.tools.eqiad1.wikimedia.cloud to the cluster | 
  [tools] | 
            
  | 09:49 | 
  <taavi@cloudcumin1001> | 
  START - Cookbook wmcs.toolforge.add_k8s_node for a worker-nfs role in the tools cluster | 
  [tools] | 
            
  | 09:49 | 
  <taavi@cloudcumin1001> | 
  END (PASS) - Cookbook wmcs.toolforge.remove_k8s_node (exit_code=0) for host tools-k8s-worker-80 | 
  [tools] | 
            
  | 09:49 | 
  <taavi@cloudcumin1001> | 
  START - Cookbook wmcs.toolforge.remove_k8s_node for host tools-k8s-worker-80 | 
  [tools] | 
            
  | 09:45 | 
  <taavi@cloudcumin1001> | 
  END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker-nfs role in the tools cluster | 
  [tools] | 
            
  | 09:45 | 
  <taavi@cloudcumin1001> | 
  Added a new k8s worker-nfs tools-k8s-worker-nfs-35.tools.eqiad1.wikimedia.cloud to the cluster | 
  [tools] | 
            
  | 09:45 | 
  <dcaro> | 
  restarted webservice as it was giving 500 errors, seems back online | 
  [tools.admin] | 
            
  | 09:35 | 
  <taavi@cloudcumin1001> | 
  START - Cookbook wmcs.toolforge.add_k8s_node for a worker-nfs role in the tools cluster | 
  [tools] | 
            
  | 09:35 | 
  <taavi@cloudcumin1001> | 
  END (PASS) - Cookbook wmcs.toolforge.remove_k8s_node (exit_code=0) for host tools-k8s-worker-79 | 
  [tools] | 
            
  | 09:34 | 
  <taavi@cloudcumin1001> | 
  START - Cookbook wmcs.toolforge.remove_k8s_node for host tools-k8s-worker-79 | 
  [tools] | 
            
  | 09:24 | 
  <taavi@cloudcumin1001> | 
  END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker-nfs role in the tools cluster | 
  [tools] | 
            
  | 09:24 | 
  <taavi@cloudcumin1001> | 
  Added a new k8s worker-nfs tools-k8s-worker-nfs-34.tools.eqiad1.wikimedia.cloud to the cluster | 
  [tools] | 
            
  | 09:13 | 
  <taavi@cloudcumin1001> | 
  START - Cookbook wmcs.toolforge.add_k8s_node for a worker-nfs role in the tools cluster | 
  [tools] | 
            
  | 09:07 | 
  <jclark@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host restbase1036.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 09:07 | 
  <jclark@cumin1002> | 
  END (FAIL) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=99) generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1002" | 
  [production] | 
            
  | 09:06 | 
  <jclark@cumin1002> | 
  START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.hosts.reimage: Host reimage - jclark@cumin1002" | 
  [production] | 
            
  | 09:06 | 
  <taavi@cloudcumin1001> | 
  END (PASS) - Cookbook wmcs.toolforge.remove_k8s_node (exit_code=0) for host tools-k8s-worker-78 | 
  [tools] | 
            
  | 09:05 | 
  <taavi@cloudcumin1001> | 
  START - Cookbook wmcs.toolforge.remove_k8s_node for host tools-k8s-worker-78 | 
  [tools] | 
            
  | 09:05 | 
  <taavi@cloudcumin1001> | 
  END (PASS) - Cookbook wmcs.toolforge.add_k8s_node (exit_code=0) for a worker-nfs role in the tools cluster | 
  [tools] | 
            
  | 09:05 | 
  <taavi@cloudcumin1001> | 
  Added a new k8s worker-nfs tools-k8s-worker-nfs-33.tools.eqiad1.wikimedia.cloud to the cluster | 
  [tools] | 
            
  | 08:55 | 
  <taavi@cloudcumin1001> | 
  START - Cookbook wmcs.toolforge.add_k8s_node for a worker-nfs role in the tools cluster | 
  [tools] | 
            
  | 08:55 | 
  <taavi@cloudcumin1001> | 
  END (PASS) - Cookbook wmcs.toolforge.remove_k8s_node (exit_code=0) for host tools-k8s-worker-77 | 
  [tools] | 
            
  | 08:54 | 
  <taavi@cloudcumin1001> | 
  START - Cookbook wmcs.toolforge.remove_k8s_node for host tools-k8s-worker-77 | 
  [tools] | 
            
  | 08:38 | 
  <jclark@cumin1002> | 
  START - Cookbook sre.hosts.reimage for host an-redacteddb1001.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 08:07 | 
  <jclark@cumin1002> | 
  END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['an-redacteddb1001'] | 
  [production] | 
            
  | 08:07 | 
  <jclark@cumin1002> | 
  START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['an-redacteddb1001'] | 
  [production] | 
            
  | 06:04 | 
  <apergos> | 
  manually generating 7z files in parallel for wikidata full history dumps run, in screen session, owned by ariel, on snapshot1009 | 
  [production] | 
            
  | 05:20 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Depooling db1249 (T352010)', diff saved to https://phabricator.wikimedia.org/P56887 and previous config saved to /var/cache/conftool/dbconfig/20240216-052044-ladsgroup.json | 
  [production] | 
            
  | 05:20 | 
  <ladsgroup@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1249.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 05:20 | 
  <ladsgroup@cumin1002> | 
  START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1249.eqiad.wmnet with reason: Maintenance | 
  [production] | 
            
  | 05:20 | 
  <ladsgroup@cumin1002> | 
  dbctl commit (dc=all): 'Repooling after maintenance db1248 (T352010)', diff saved to https://phabricator.wikimedia.org/P56886 and previous config saved to /var/cache/conftool/dbconfig/20240216-052021-ladsgroup.json | 
  [production] | 
            
  | 05:05 | 
  <ladsgroup@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2139.codfw.wmnet with reason: Maintenance | 
  [production] |