| 
      
        2023-08-31
      
      ยง
     | 
  
    
  | 20:36 | 
  <jhuneidi@deploy1002> | 
  Started scap: Backport for [[gerrit:953660|WatchlistManager: Do not require watchlist rights for clearing talk page notification (T345031)]] | 
  [production] | 
            
  | 20:34 | 
  <jhuneidi@deploy1002> | 
  Finished scap: Backport for [[gerrit:950046|Undeploy Research Incentive survey on enwiki (T336092)]], [[gerrit:954079|Pre-deploy Campaigns Event Discovery survey (T345158)]] (duration: 14m 19s) | 
  [production] | 
            
  | 20:32 | 
  <brett@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host doh6002.wikimedia.org with OS bookworm | 
  [production] | 
            
  | 20:29 | 
  <jhuneidi@deploy1002> | 
  jhuneidi and dani: Continuing with sync | 
  [production] | 
            
  | 20:28 | 
  <eevans@cumin1001> | 
  END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['restbase1030.eqiad.wmnet'] | 
  [production] | 
            
  | 20:28 | 
  <eevans@cumin1001> | 
  START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['restbase1030.eqiad.wmnet'] | 
  [production] | 
            
  | 20:27 | 
  <ryankemper@cumin1001> | 
  START - Cookbook sre.wdqs.restart | 
  [production] | 
            
  | 20:27 | 
  <eevans@cumin1001> | 
  END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['restbase1030.eqiad.wmnet'] | 
  [production] | 
            
  | 20:27 | 
  <eevans@cumin1001> | 
  START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['restbase1030.eqiad.wmnet'] | 
  [production] | 
            
  | 20:26 | 
  <eevans@cumin1001> | 
  END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['restbase1030.eqiad.wmnet'] | 
  [production] | 
            
  | 20:26 | 
  <eevans@cumin1001> | 
  START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['restbase1030.eqiad.wmnet'] | 
  [production] | 
            
  | 20:25 | 
  <eevans@cumin1001> | 
  END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['restbase1030.eqiad.wmnet'] | 
  [production] | 
            
  | 20:21 | 
  <jhuneidi@deploy1002> | 
  jhuneidi and dani: Backport for [[gerrit:950046|Undeploy Research Incentive survey on enwiki (T336092)]], [[gerrit:954079|Pre-deploy Campaigns Event Discovery survey (T345158)]] synced to the testservers mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet, mwdebug1002.eqiad.wmnet, and mw-debug kubernetes deployment (accessible via k8s-experimental XWD option) | 
  [production] | 
            
  | 20:20 | 
  <jhancock@cumin2002> | 
  END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kubernetes2038.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 20:20 | 
  <jhancock@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host kubernetes2038.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 20:20 | 
  <jhuneidi@deploy1002> | 
  Started scap: Backport for [[gerrit:950046|Undeploy Research Incentive survey on enwiki (T336092)]], [[gerrit:954079|Pre-deploy Campaigns Event Discovery survey (T345158)]] | 
  [production] | 
            
  | 20:18 | 
  <eevans@cumin1001> | 
  START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['restbase1030.eqiad.wmnet'] | 
  [production] | 
            
  | 20:17 | 
  <eevans@cumin1001> | 
  END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['restbase1030.eqiad.wmnet'] | 
  [production] | 
            
  | 20:16 | 
  <inflatador> | 
  'bking@wdqs1004 depool wdqs1004 to test script changes T342361' | 
  [production] | 
            
  | 20:13 | 
  <jhancock@cumin2002> | 
  END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kubernetes2039.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 20:13 | 
  <jhancock@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host kubernetes2039.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 20:11 | 
  <ryankemper@cumin1001> | 
  END (PASS) - Cookbook sre.hosts.decommission (exit_code=0) for hosts wdqs1005.eqiad.wmnet | 
  [production] | 
            
  | 20:11 | 
  <ryankemper@cumin1001> | 
  END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | 
  [production] | 
            
  | 20:11 | 
  <ryankemper@cumin1001> | 
  END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: wdqs1005.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - ryankemper@cumin1001" | 
  [production] | 
            
  | 20:11 | 
  <brett@cumin2002> | 
  END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on doh6002.wikimedia.org with reason: host reimage | 
  [production] | 
            
  | 20:11 | 
  <jhancock@cumin2002> | 
  END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kubernetes2039.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 20:11 | 
  <jhancock@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host kubernetes2039.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 20:09 | 
  <eevans@cumin1001> | 
  START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['restbase1030.eqiad.wmnet'] | 
  [production] | 
            
  | 20:07 | 
  <brett@cumin2002> | 
  START - Cookbook sre.hosts.downtime for 2:00:00 on doh6002.wikimedia.org with reason: host reimage | 
  [production] | 
            
  | 20:07 | 
  <eevans@cumin1001> | 
  END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host restbase1030.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 20:01 | 
  <jhancock@cumin2002> | 
  END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kubernetes2038.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 20:01 | 
  <jhancock@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host kubernetes2038.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 20:00 | 
  <jhancock@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host kubernetes2037.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 20:00 | 
  <jhancock@cumin2002> | 
  END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kubernetes2038.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 20:00 | 
  <jhancock@cumin2002> | 
  END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host kubernetes2039.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 20:00 | 
  <jhancock@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host kubernetes2038.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 19:59 | 
  <jhancock@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host kubernetes2039.codfw.wmnet with OS bullseye | 
  [production] | 
            
  | 19:51 | 
  <eevans@cumin1001> | 
  START - Cookbook sre.hosts.reimage for host restbase1030.eqiad.wmnet with OS bullseye | 
  [production] | 
            
  | 19:48 | 
  <ryankemper@cumin1001> | 
  END (FAIL) - Cookbook sre.wdqs.restart (exit_code=99) | 
  [production] | 
            
  | 19:45 | 
  <brett@cumin2002> | 
  START - Cookbook sre.hosts.reimage for host doh6002.wikimedia.org with OS bookworm | 
  [production] | 
            
  | 19:44 | 
  <ryankemper@cumin1001> | 
  START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: wdqs1005.eqiad.wmnet decommissioned, removing all IPs except the asset tag one - ryankemper@cumin1001" | 
  [production] | 
            
  | 19:33 | 
  <ryankemper@cumin1001> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 19:30 | 
  <cmooney@cumin1001> | 
  START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Add management record for lsw1-a3-codfw - cmooney@cumin1001" | 
  [production] | 
            
  | 19:30 | 
  <ryankemper@cumin1001> | 
  START - Cookbook sre.wdqs.restart | 
  [production] | 
            
  | 19:28 | 
  <ryankemper@cumin1001> | 
  START - Cookbook sre.hosts.decommission for hosts wdqs1005.eqiad.wmnet | 
  [production] | 
            
  | 19:14 | 
  <cmooney@cumin1001> | 
  START - Cookbook sre.dns.netbox | 
  [production] | 
            
  | 19:14 | 
  <cmooney@cumin1001> | 
  START - Cookbook sre.network.provision for device lsw1-a3-codfw.mgmt.codfw.wmnet | 
  [production] | 
            
  | 19:07 | 
  <ryankemper@cumin1001> | 
  END (PASS) - Cookbook sre.wdqs.restart (exit_code=0) | 
  [production] | 
            
  | 19:03 | 
  <ryankemper> | 
  T344198 on `ryankemper@cumin1001`: `sudo -E cumin 'A:wdqs-all' 'sudo disable-puppet "revoking old cert and generating new one with new alt_names - T344198"'` | 
  [production] | 
            
  | 19:03 | 
  <ryankemper> | 
  T344198 Temporarily disabling puppet on all `wdqs*` hosts in preparation for `wdqs.discovery.wmnet` certificate revocation | 
  [production] |