| 2023-05-09
      
      ยง | 
    
  | 14:29 | <jclark@cumin1001> | START - Cookbook sre.network.configure-switch-interfaces for host backup1010 | [production] | 
            
  | 14:29 | <jclark@cumin1001> | END (PASS) - Cookbook sre.network.configure-switch-interfaces (exit_code=0) for host backup1011 | [production] | 
            
  | 14:29 | <jclark@cumin1001> | START - Cookbook sre.network.configure-switch-interfaces for host backup1011 | [production] | 
            
  | 14:27 | <jclark@cumin1001> | END (PASS) - Cookbook sre.dns.netbox (exit_code=0) | [production] | 
            
  | 14:25 | <jclark@cumin1001> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 14:24 | <pt1979@cumin2002> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['db2180'] | [production] | 
            
  | 14:23 | <pt1979@cumin2002> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['db2180'] | [production] | 
            
  | 14:20 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db2153', diff saved to https://phabricator.wikimedia.org/P48018 and previous config saved to /var/cache/conftool/dbconfig/20230509-142044-ladsgroup.json | [production] | 
            
  | 14:15 | <sukhe> | set routing-options static route 208.80.153.240/28 next-hop 10.192.49.7 [move static route for high-traffic2 to lvs2010]: T335777 | [production] | 
            
  | 14:15 | <jmm@cumin2002> | START - Cookbook sre.ganeti.reimage for host testvm2005.codfw.wmnet with OS bookworm | [production] | 
            
  | 14:14 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1198', diff saved to https://phabricator.wikimedia.org/P48017 and previous config saved to /var/cache/conftool/dbconfig/20230509-141421-ladsgroup.json | [production] | 
            
  | 14:08 | <sukhe@deploy1002> | Locking from deployment [ALL REPOSITORIES]: LVS reimaging in codfw, blocking deploys T326767 | [production] | 
            
  | 14:05 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db2153 (T335845)', diff saved to https://phabricator.wikimedia.org/P48016 and previous config saved to /var/cache/conftool/dbconfig/20230509-140535-ladsgroup.json | [production] | 
            
  | 13:59 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1198 (T335845)', diff saved to https://phabricator.wikimedia.org/P48015 and previous config saved to /var/cache/conftool/dbconfig/20230509-135915-ladsgroup.json | [production] | 
            
  | 13:58 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Depooling db2153 (T335845)', diff saved to https://phabricator.wikimedia.org/P48014 and previous config saved to /var/cache/conftool/dbconfig/20230509-135815-ladsgroup.json | [production] | 
            
  | 13:58 | <ladsgroup@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2153.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 13:57 | <ladsgroup@cumin1001> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2153.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 13:57 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db2146 (T335845)', diff saved to https://phabricator.wikimedia.org/P48013 and previous config saved to /var/cache/conftool/dbconfig/20230509-135750-ladsgroup.json | [production] | 
            
  | 13:49 | <taavi@deploy1002> | Finished scap: Backport for [[gerrit:910768|Add $wmgUseRealMe (T324535)]] (duration: 07m 51s) | [production] | 
            
  | 13:49 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Depooling db1198 (T335845)', diff saved to https://phabricator.wikimedia.org/P48012 and previous config saved to /var/cache/conftool/dbconfig/20230509-134952-ladsgroup.json | [production] | 
            
  | 13:49 | <ladsgroup@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db1198.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 13:49 | <ladsgroup@cumin1001> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db1198.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 13:49 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1189 (T335845)', diff saved to https://phabricator.wikimedia.org/P48011 and previous config saved to /var/cache/conftool/dbconfig/20230509-134929-ladsgroup.json | [production] | 
            
  | 13:49 | <marostegui@cumin1001> | dbctl commit (dc=all): 'Depool db2180 T336031', diff saved to https://phabricator.wikimedia.org/P48010 and previous config saved to /var/cache/conftool/dbconfig/20230509-134921-root.json | [production] | 
            
  | 13:44 | <moritzm> | rearmed keyholder on netmon* post reboot | [production] | 
            
  | 13:43 | <taavi@deploy1002> | taavi: Backport for [[gerrit:910768|Add $wmgUseRealMe (T324535)]] synced to the testservers: mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug2001.codfw.wmnet, mwdebug1001.eqiad.wmnet | [production] | 
            
  | 13:42 | <sukhe> | sudo cumin -b1 -s1200 'A:cp and A:esams' 'varnish-frontend-restart: T253093 | [production] | 
            
  | 13:42 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db2146', diff saved to https://phabricator.wikimedia.org/P48009 and previous config saved to /var/cache/conftool/dbconfig/20230509-134244-ladsgroup.json | [production] | 
            
  | 13:42 | <taavi@deploy1002> | Started scap: Backport for [[gerrit:910768|Add $wmgUseRealMe (T324535)]] | [production] | 
            
  | 13:38 | <taavi@deploy1002> | Finished scap: Backport for [[gerrit:910767|Add RealMe to extension-list (T324535)]] (duration: 35m 47s) | [production] | 
            
  | 13:34 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1189', diff saved to https://phabricator.wikimedia.org/P48008 and previous config saved to /var/cache/conftool/dbconfig/20230509-133416-ladsgroup.json | [production] | 
            
  | 13:28 | <btullis@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 4:00:00 on an-worker1088.eqiad.wmnet with reason: Replacing RAID controller battery | [production] | 
            
  | 13:28 | <btullis@cumin1001> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host an-test-client1001.eqiad.wmnet | [production] | 
            
  | 13:28 | <btullis@cumin1001> | START - Cookbook sre.hosts.downtime for 4:00:00 on an-worker1088.eqiad.wmnet with reason: Replacing RAID controller battery | [production] | 
            
  | 13:27 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db2146', diff saved to https://phabricator.wikimedia.org/P48007 and previous config saved to /var/cache/conftool/dbconfig/20230509-132737-ladsgroup.json | [production] | 
            
  | 13:27 | <moritzm> | updated bookworm d-i image to 2022-05-09 daily build T330495 | [production] | 
            
  | 13:23 | <btullis@cumin1001> | START - Cookbook sre.hosts.reboot-single for host an-test-client1001.eqiad.wmnet | [production] | 
            
  | 13:23 | <taavi@deploy1002> | taavi: Backport for [[gerrit:910767|Add RealMe to extension-list (T324535)]] synced to the testservers: mwdebug2001.codfw.wmnet, mwdebug1002.eqiad.wmnet, mwdebug2002.codfw.wmnet, mwdebug1001.eqiad.wmnet | [production] | 
            
  | 13:23 | <btullis@cumin1001> | END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for an-worker1088.eqiad.wmnet | [production] | 
            
  | 13:23 | <btullis@cumin1001> | START - Cookbook sre.hosts.remove-downtime for an-worker1088.eqiad.wmnet | [production] | 
            
  | 13:19 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1189', diff saved to https://phabricator.wikimedia.org/P48006 and previous config saved to /var/cache/conftool/dbconfig/20230509-131910-ladsgroup.json | [production] | 
            
  | 13:12 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db2146 (T335845)', diff saved to https://phabricator.wikimedia.org/P48005 and previous config saved to /var/cache/conftool/dbconfig/20230509-131231-ladsgroup.json | [production] | 
            
  | 13:05 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Depooling db2146 (T335845)', diff saved to https://phabricator.wikimedia.org/P48004 and previous config saved to /var/cache/conftool/dbconfig/20230509-130524-ladsgroup.json | [production] | 
            
  | 13:05 | <ladsgroup@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2146.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 13:05 | <ladsgroup@cumin1001> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2146.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 13:05 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db2145 (T335845)', diff saved to https://phabricator.wikimedia.org/P48003 and previous config saved to /var/cache/conftool/dbconfig/20230509-130459-ladsgroup.json | [production] | 
            
  | 13:04 | <jmm@cumin2002> | END (PASS) - Cookbook sre.puppet.sync-netbox-hiera (exit_code=0) generate netbox hiera data: "sync after adding ldap-rw servers - jmm@cumin2002" | [production] | 
            
  | 13:04 | <ladsgroup@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db1189 (T335845)', diff saved to https://phabricator.wikimedia.org/P48002 and previous config saved to /var/cache/conftool/dbconfig/20230509-130404-ladsgroup.json | [production] | 
            
  | 13:02 | <taavi@deploy1002> | Started scap: Backport for [[gerrit:910767|Add RealMe to extension-list (T324535)]] | [production] | 
            
  | 13:01 | <jmm@cumin2002> | START - Cookbook sre.puppet.sync-netbox-hiera generate netbox hiera data: "sync after adding ldap-rw servers - jmm@cumin2002" | [production] |