| 2024-03-04
      
      ยง | 
    
  | 20:34 | <brett@cumin2002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 3:00:00 on cp5025.eqsin.wmnet with reason: T355905 | [production] | 
            
  | 20:34 | <brett@cumin2002> | START - Cookbook sre.hosts.downtime for 3:00:00 on cp5025.eqsin.wmnet with reason: T355905 | [production] | 
            
  | 20:33 | <brett@puppetmaster1001> | conftool action : set/pooled=no; selector: name=cp5025.eqsin.wmnet | [production] | 
            
  | 20:33 | <logmsgbot> | @deploy2002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 20:33 | <logmsgbot> | @deploy2002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 20:31 | <logmsgbot> | @deploy2002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 20:31 | <logmsgbot> | @deploy2002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 20:29 | <logmsgbot> | @deploy2002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 20:28 | <logmsgbot> | @deploy2002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 20:25 | <logmsgbot> | @deploy2002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 20:25 | <logmsgbot> | @deploy2002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 20:18 | <bking@cumin2002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic2105.codfw.wmnet with reason: host reimage | [production] | 
            
  | 20:15 | <bking@cumin2002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on elastic2104.codfw.wmnet with reason: host reimage | [production] | 
            
  | 20:14 | <bking@cumin2002> | START - Cookbook sre.hosts.downtime for 2:00:00 on elastic2105.codfw.wmnet with reason: host reimage | [production] | 
            
  | 20:12 | <bking@cumin2002> | START - Cookbook sre.hosts.downtime for 2:00:00 on elastic2104.codfw.wmnet with reason: host reimage | [production] | 
            
  | 20:08 | <logmsgbot> | @deploy2002 helmfile [eqiad] DONE helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 20:08 | <logmsgbot> | @deploy2002 helmfile [eqiad] START helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 20:02 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Depooling db2116 (T357189)', diff saved to https://phabricator.wikimedia.org/P58401 and previous config saved to /var/cache/conftool/dbconfig/20240304-200143-arnaudb.json | [production] | 
            
  | 20:01 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2116.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 20:01 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 12:00:00 on db2116.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 20:01 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2103 (T357189)', diff saved to https://phabricator.wikimedia.org/P58400 and previous config saved to /var/cache/conftool/dbconfig/20240304-200121-arnaudb.json | [production] | 
            
  | 19:58 | <bking@cumin2002> | START - Cookbook sre.hosts.reimage for host elastic2105.codfw.wmnet with OS bullseye | [production] | 
            
  | 19:57 | <htriedman@deploy2002> | Finished deploy [airflow-dags/platform_eng@a076d5c]: (no justification provided) (duration: 00m 26s) | [production] | 
            
  | 19:56 | <htriedman@deploy2002> | Started deploy [airflow-dags/platform_eng@a076d5c]: (no justification provided) | [production] | 
            
  | 19:56 | <bking@cumin2002> | START - Cookbook sre.hosts.reimage for host elastic2104.codfw.wmnet with OS bullseye | [production] | 
            
  | 19:46 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2103', diff saved to https://phabricator.wikimedia.org/P58399 and previous config saved to /var/cache/conftool/dbconfig/20240304-194614-arnaudb.json | [production] | 
            
  | 19:31 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2103', diff saved to https://phabricator.wikimedia.org/P58398 and previous config saved to /var/cache/conftool/dbconfig/20240304-193108-arnaudb.json | [production] | 
            
  | 19:16 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db2103 (T357189)', diff saved to https://phabricator.wikimedia.org/P58396 and previous config saved to /var/cache/conftool/dbconfig/20240304-191601-arnaudb.json | [production] | 
            
  | 19:10 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Depooling db2103 (T357189)', diff saved to https://phabricator.wikimedia.org/P58395 and previous config saved to /var/cache/conftool/dbconfig/20240304-191028-arnaudb.json | [production] | 
            
  | 19:10 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2103.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 19:10 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 12:00:00 on db2103.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 19:06 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db2102.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 19:06 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 12:00:00 on db2102.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 19:03 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on dbstore1008.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 19:03 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 12:00:00 on dbstore1008.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 19:00 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1240.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 19:00 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 12:00:00 on db1240.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 19:00 | <htriedman@deploy2002> | Finished deploy [airflow-dags/analytics_product@a076d5c]: (no justification provided) (duration: 00m 09s) | [production] | 
            
  | 19:00 | <htriedman@deploy2002> | Started deploy [airflow-dags/analytics_product@a076d5c]: (no justification provided) | [production] | 
            
  | 18:57 | <arnaudb@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1239.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 18:57 | <arnaudb@cumin1002> | START - Cookbook sre.hosts.downtime for 12:00:00 on db1239.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 18:57 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1234 (T357189)', diff saved to https://phabricator.wikimedia.org/P58394 and previous config saved to /var/cache/conftool/dbconfig/20240304-185740-arnaudb.json | [production] | 
            
  | 18:42 | <arnaudb@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1234', diff saved to https://phabricator.wikimedia.org/P58393 and previous config saved to /var/cache/conftool/dbconfig/20240304-184234-arnaudb.json | [production] | 
            
  | 18:40 | <akosiaris@cumin1002> | END (FAIL) - Cookbook sre.hosts.reimage (exit_code=99) for host parse1024.eqiad.wmnet with OS bullseye | [production] | 
            
  | 18:32 | <marostegui@cumin1002> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 12:00:00 on db1239.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 18:32 | <marostegui@cumin1002> | START - Cookbook sre.hosts.downtime for 12:00:00 on db1239.eqiad.wmnet with reason: Maintenance | [production] | 
            
  | 18:32 | <marostegui@cumin1002> | dbctl commit (dc=all): 'Repooling after maintenance db1235 (T354015)', diff saved to https://phabricator.wikimedia.org/P58392 and previous config saved to /var/cache/conftool/dbconfig/20240304-183212-marostegui.json | [production] | 
            
  | 18:29 | <jhancock@cumin2002> | END (FAIL) - Cookbook sre.hardware.upgrade-firmware (exit_code=99) upgrade firmware for hosts ['es2036'] | [production] | 
            
  | 18:29 | <jhancock@cumin2002> | START - Cookbook sre.hardware.upgrade-firmware upgrade firmware for hosts ['es2036'] | [production] | 
            
  | 18:27 | <jhancock@cumin2002> | END (FAIL) - Cookbook sre.hosts.provision (exit_code=99) for host dbprov2006.mgmt.codfw.wmnet with reboot policy FORCED | [production] |