| 2023-11-17
      
      § | 
    
  | 04:44 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1146:3314 (T348183)', diff saved to https://phabricator.wikimedia.org/P53534 and previous config saved to /var/cache/conftool/dbconfig/20231117-044443-arnaudb.json | [production] | 
            
  | 04:29 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1146:3314', diff saved to https://phabricator.wikimedia.org/P53533 and previous config saved to /var/cache/conftool/dbconfig/20231117-042937-arnaudb.json | [production] | 
            
  | 04:14 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1146:3314', diff saved to https://phabricator.wikimedia.org/P53532 and previous config saved to /var/cache/conftool/dbconfig/20231117-041430-arnaudb.json | [production] | 
            
  | 03:59 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1146:3314 (T348183)', diff saved to https://phabricator.wikimedia.org/P53531 and previous config saved to /var/cache/conftool/dbconfig/20231117-035924-arnaudb.json | [production] | 
            
  | 01:19 | <cstone> | payments-wiki upgraded from eae2f35e to 56790715 | [production] | 
            
  | 01:12 | <jclark@cumin1001> | END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1158.eqiad.wmnet with OS bullseye | [production] | 
            
  | 01:00 | <jclark@cumin1001> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['an-worker1158'] | [production] | 
            
  | 00:55 | <jclark@cumin1001> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['an-worker1158'] | [production] | 
            
  | 00:50 | <jclark@cumin1001> | END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1157.eqiad.wmnet with OS bullseye | [production] | 
            
  | 00:48 | <ejegg> | fundraising civiproxy upgraded from c000fc1e to 6625c844 | [production] | 
            
  | 00:39 | <jclark@cumin1001> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=1) upgrade firmware for hosts ['an-worker1157'] | [production] | 
            
  | 00:32 | <jclark@cumin1001> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['an-worker1157'] | [production] | 
            
  
    | 2023-11-16
      
      § | 
    
  | 23:52 | <jclark@cumin1001> | END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['an-worker1158'] | [production] | 
            
  | 23:51 | <jclark@cumin1001> | START - Cookbook sre.hosts.reimage for host an-worker1158.eqiad.wmnet with OS bullseye | [production] | 
            
  | 23:46 | <jclark@cumin1001> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['an-worker1158'] | [production] | 
            
  | 23:43 | <samtar@deploy2002> | Finished scap: Backport for [[gerrit:975029|Revert "Disable drawer temporarily while erroring"]] (duration: 07m 31s) | [production] | 
            
  | 23:37 | <samtar@deploy2002> | samtar: Continuing with sync | [production] | 
            
  | 23:37 | <samtar@deploy2002> | samtar: Backport for [[gerrit:975029|Revert "Disable drawer temporarily while erroring"]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 23:35 | <samtar@deploy2002> | Started scap: Backport for [[gerrit:975029|Revert "Disable drawer temporarily while erroring"]] | [production] | 
            
  | 23:34 | <samtar@deploy2002> | Sync cancelled. | [production] | 
            
  | 23:33 | <topranks> | Change VRRP IP for public1-a-codfw vlan on codfw CRs T347191 | [production] | 
            
  | 23:30 | <topranks> | Add gateway IP for public1-a-codfw Vlan to ssw in codfw T347191 | [production] | 
            
  | 23:30 | <jclark@cumin1001> | START - Cookbook sre.hosts.reimage for host an-worker1157.eqiad.wmnet with OS bullseye | [production] | 
            
  | 23:29 | <samtar@deploy2002> | jdlrobson and samtar: Backport for [[gerrit:975097|Disable drawer temporarily while erroring (T351362)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 23:29 | <jclark@cumin1001> | END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host an-worker1157.eqiad.wmnet with OS bullseye | [production] | 
            
  | 23:28 | <samtar@deploy2002> | Started scap: Backport for [[gerrit:975097|Disable drawer temporarily while erroring (T351362)]] | [production] | 
            
  | 23:28 | <cmooney@cumin1001> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 23:28 | <cmooney@cumin1001> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Remove old vlan 2001 entries - cmooney@cumin1001" | [production] | 
            
  | 23:27 | <cmooney@cumin1001> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Remove old vlan 2001 entries - cmooney@cumin1001" | [production] | 
            
  | 23:25 | <cmooney@cumin1001> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 23:10 | <cmooney@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1:00:00 on cr[1-2]-codfw,cr[1-2]-codfw IPv6 with reason: Move public1-a-codfw vlan GW from codfw CR routers to ssw | [production] | 
            
  | 23:10 | <cmooney@cumin1001> | START - Cookbook sre.hosts.downtime for 1:00:00 on cr[1-2]-codfw,cr[1-2]-codfw IPv6 with reason: Move public1-a-codfw vlan GW from codfw CR routers to ssw | [production] | 
            
  | 22:39 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'Depooling db1146:3314 (T348183)', diff saved to https://phabricator.wikimedia.org/P53529 and previous config saved to /var/cache/conftool/dbconfig/20231116-223915-arnaudb.json | [production] | 
            
  | 22:39 | <arnaudb@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1146.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 22:38 | <arnaudb@cumin1001> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1146.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 22:36 | <mutante> | disabled puppet on miscweb*, netmon* and phab* hosts, deploying gerrit:974285, confirming noop | [production] | 
            
  | 22:31 | <cmooney@cumin1001> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 22:31 | <cmooney@cumin1001> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Remove old vlan 1117 entries - cmooney@cumin1001" | [production] | 
            
  | 22:30 | <cmooney@cumin1001> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "Triggered by cookbooks.sre.dns.netbox: Remove old vlan 1117 entries - cmooney@cumin1001" | [production] | 
            
  | 22:29 | <cmooney@cumin1001> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 22:09 | <jclark@cumin1001> | START - Cookbook sre.hosts.reimage for host an-worker1157.eqiad.wmnet with OS bullseye | [production] | 
            
  | 22:00 | <dr0ptp4kt@deploy2002> | Finished scap: Backport for [[gerrit:975028|Make the feed gracefully handle long snippets and urls (T347732 T351463)]] (duration: 09m 50s) | [production] | 
            
  | 21:59 | <jclark@cumin1001> | END (PASS) - Cookbook sre.hardware.upgrade-firmware (exit_code=0) upgrade firmware for hosts ['an-worker1157'] | [production] | 
            
  | 21:54 | <dr0ptp4kt@deploy2002> | dr0ptp4kt and soda: Continuing with sync | [production] | 
            
  | 21:53 | <jclark@cumin1001> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['an-worker1157'] | [production] | 
            
  | 21:53 | <jclark@cumin1001> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['an-worker1157'] | [production] | 
            
  | 21:53 | <jclark@cumin1001> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['an-worker1157'] | [production] | 
            
  | 21:52 | <jclark@cumin1001> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['an-worker1157'] | [production] | 
            
  | 21:52 | <jclark@cumin1001> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['an-worker1157'] | [production] | 
            
  | 21:51 | <dr0ptp4kt@deploy2002> | dr0ptp4kt and soda: Backport for [[gerrit:975028|Make the feed gracefully handle long snippets and urls (T347732 T351463)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] |