| 
      
        2024-05-14
      
      §
     | 
  
    
  | 19:37 | 
  <vriley@cumin1002> | 
  START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update  mgmt  kafka-main1010 - vriley@cumin1002" | 
  [production] | 
            
  | 19:32 | 
  <vriley@cumin1002> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 19:30 | 
  <vriley@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host kafka-main1008.mgmt.eqiad.wmnet with reboot policy FORCED | 
  [production] | 
            
  | 19:26 | 
  <jclark@cumin1002> | 
  START - Cookbook sre.hosts.reimage for host kafka-main1006.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 19:26 | 
  <andrew@cloudcumin1001> | 
  END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) | 
  [admin] | 
            
  | 19:25 | 
  <vriley@cumin1002> | 
  END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['kafka-main1006'] | 
  [production] | 
            
  | 19:24 | 
  <andrew@cloudcumin1001> | 
  START - Cookbook wmcs.openstack.restart_openstack | 
  [admin] | 
            
  | 19:23 | 
  <vriley@cumin1002> | 
  START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['kafka-main1006'] | 
  [production] | 
            
  | 19:19 | 
  <vriley@cumin1002> | 
  START - Cookbook sre.hosts.provision for host kafka-main1009.mgmt.eqiad.wmnet with reboot policy FORCED | 
  [production] | 
            
  | 19:18 | 
  <vriley@cumin1002> | 
  END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | 
  [production] | 
            
  | 19:18 | 
  <cdanis> | 
  T364907 💔cdanis@apt1002.wikimedia.org ~ 🕞🍵 sudo -i reprepro --keepunreferencedfiles includedeb bullseye-wikimedia ~/otelcol-contrib_0.100.0_linux_amd64.deb | 
  [production] | 
            
  | 19:18 | 
  <vriley@cumin1002> | 
  START - Cookbook sre.hosts.provision for host kafka-main1008.mgmt.eqiad.wmnet with reboot policy FORCED | 
  [production] | 
            
  | 19:17 | 
  <vriley@cumin1002> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 19:16 | 
  <vriley@cumin1002> | 
  END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | 
  [production] | 
            
  | 19:16 | 
  <vriley@cumin1002> | 
  END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update  mgmt  kafka-main1008 - vriley@cumin1002" | 
  [production] | 
            
  | 19:16 | 
  <vriley@cumin1002> | 
  START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update  mgmt  kafka-main1008 - vriley@cumin1002" | 
  [production] | 
            
  | 19:13 | 
  <vriley@cumin1002> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 18:18 | 
  <sukhe> | 
  restart pybal on backup LVSes | 
  [production] | 
            
  | 18:17 | 
  <sukhe> | 
  [CORRECTION] above pybal restart was NOT run | 
  [production] | 
            
  | 18:15 | 
  <amastilovic@deploy1002> | 
  Finished deploy [airflow-dags/analytics@6270c72]: (no justification provided) (duration: 00m 34s) | 
  [production] | 
            
  | 18:14 | 
  <amastilovic@deploy1002> | 
  Started deploy [airflow-dags/analytics@6270c72]: (no justification provided) | 
  [production] | 
            
  | 18:14 | 
  <andrew@cloudcumin1001> | 
  END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) | 
  [admin] | 
            
  | 18:14 | 
  <andrew@cloudcumin1001> | 
  START - Cookbook wmcs.openstack.restart_openstack | 
  [admin] | 
            
  | 18:10 | 
  <sukhe> | 
  sudo cumin -b1 -s120 'A:lvs' 'systemctl restart pybal.service': clearing up alert for reverted pybal.conf CR 1031470 | 
  [production] | 
            
  | 18:04 | 
  <andrew@cloudcumin1001> | 
  END (PASS) - Cookbook wmcs.openstack.restart_openstack (exit_code=0) | 
  [admin] | 
            
  | 18:02 | 
  <andrew@cloudcumin1001> | 
  START - Cookbook wmcs.openstack.restart_openstack | 
  [admin] | 
            
  | 17:51 | 
  <taavi> | 
  reload zuul for https://gerrit.wikimedia.org/r/c/integration/config/+/1031519 | 
  [releng] | 
            
  | 17:47 | 
  <ejegg> | 
  donorwiki upgraded from b005071a to fa7de70f | 
  [production] | 
            
  | 17:33 | 
  <ryankemper@cumin2002> | 
  END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-druid-analytics cluster: Roll restart of jvm daemons. | 
  [production] | 
            
  | 17:27 | 
  <ryankemper@cumin2002> | 
  START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-druid-analytics cluster: Roll restart of jvm daemons. | 
  [production] | 
            
  | 17:25 | 
  <ryankemper@cumin2002> | 
  END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-druid-public cluster: Roll restart of jvm daemons. | 
  [production] | 
            
  | 17:19 | 
  <ryankemper@cumin2002> | 
  START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-druid-public cluster: Roll restart of jvm daemons. | 
  [production] | 
            
  | 17:18 | 
  <ryankemper@cumin2002> | 
  END (PASS) - Cookbook sre.zookeeper.roll-restart-zookeeper (exit_code=0) for Zookeeper A:zookeeper-analytics cluster: Roll restart of jvm daemons. | 
  [production] | 
            
  | 17:12 | 
  <ryankemper@cumin2002> | 
  START - Cookbook sre.zookeeper.roll-restart-zookeeper for Zookeeper A:zookeeper-analytics cluster: Roll restart of jvm daemons. | 
  [production] | 
            
  | 17:11 | 
  <sfaci@deploy1002> | 
  helmfile [dse-k8s-eqiad] DONE helmfile.d/dse-k8s-services/mpic-next: apply | 
  [production] | 
            
  | 17:11 | 
  <sfaci@deploy1002> | 
  helmfile [dse-k8s-eqiad] START helmfile.d/dse-k8s-services/mpic-next: apply | 
  [production] | 
            
  | 17:09 | 
  <ryankemper@cumin2002> | 
  END (PASS) - Cookbook sre.druid.roll-restart-workers (exit_code=0) for Druid test cluster: Roll restart of Druid jvm daemons. | 
  [production] | 
            
  | 17:02 | 
  <vriley@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host kafka-main1007.mgmt.eqiad.wmnet with reboot policy FORCED | 
  [production] | 
            
  | 17:00 | 
  <wmbot~bd808@tools-bastion-12> | 
  Configure GITLAB_EVENTS_URL to point to http://gitlab-webhooks.tool-gitlab-webhooks.svc.tools.local:8000/sse/ to bypass k8s ingress and Toolforge front proxy. (T364490) | 
  [tools.wikibugs] | 
            
  | 17:00 | 
  <ryankemper@cumin2002> | 
  START - Cookbook sre.druid.roll-restart-workers for Druid test cluster: Roll restart of Druid jvm daemons. | 
  [production] | 
            
  | 16:51 | 
  <vriley@cumin1002> | 
  END (PASS) - Cookbook sre.hosts.provision (exit_code=0) for host kafka-main1006.mgmt.eqiad.wmnet with reboot policy FORCED | 
  [production] | 
            
  | 16:50 | 
  <vriley@cumin1002> | 
  START - Cookbook sre.hosts.provision for host kafka-main1007.mgmt.eqiad.wmnet with reboot policy FORCED | 
  [production] | 
            
  | 16:49 | 
  <vriley@cumin1002> | 
  END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | 
  [production] | 
            
  | 16:49 | 
  <vriley@cumin1002> | 
  END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update  mgmt  kafka-main1007 - vriley@cumin1002" | 
  [production] | 
            
  | 16:48 | 
  <vriley@cumin1002> | 
  START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: update  mgmt  kafka-main1007 - vriley@cumin1002" | 
  [production] | 
            
  | 16:46 | 
  <wmbot~bd808@tools-bastion-12> | 
  Configure GITLAB_EVENTS_URL to point to http://gitlab-webhooks.tool-gitlab-webhooks.svc.tools.local:8000/sse/ as a test of bypassing k8s ingress (T364490) | 
  [tools.wikibugs-testing] | 
            
  | 16:46 | 
  <vriley@cumin1002> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 16:44 | 
  <pfischer@deploy1002> | 
  helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply | 
  [production] | 
            
  | 16:41 | 
  <dzahn@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on mw2286.codfw.wmnet with reason: T364863 | 
  [production] | 
            
  | 16:40 | 
  <dzahn@cumin2002> | 
  START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on mw2286.codfw.wmnet with reason: T364863 | 
  [production] |