| 2023-11-20
      
      ยง | 
    
  | 21:06 | <catrope@deploy2002> | catrope and stjn: Continuing with sync | [production] | 
            
  | 21:05 | <catrope@deploy2002> | catrope and stjn: Backport for [[gerrit:973795|Enable action blocks in ruwiki (T351048)]] synced to the testservers (https://wikitech.wikimedia.org/wiki/Mwdebug) | [production] | 
            
  | 21:03 | <catrope@deploy2002> | Started scap: Backport for [[gerrit:973795|Enable action blocks in ruwiki (T351048)]] | [production] | 
            
  | 21:02 | <eevans@cumin1001> | END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for aqs1014.eqiad.wmnet | [production] | 
            
  | 21:02 | <eevans@cumin1001> | START - Cookbook sre.hosts.remove-downtime for aqs1014.eqiad.wmnet | [production] | 
            
  | 21:02 | <eevans@cumin1001> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host aqs1014.eqiad.wmnet with OS bullseye | [production] | 
            
  | 20:40 | <eevans@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aqs1014.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 20:37 | <eevans@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on aqs1014.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 20:34 | <arnaudb@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 1 day, 0:00:00 on db2139.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 20:33 | <arnaudb@cumin1001> | START - Cookbook sre.hosts.downtime for 1 day, 0:00:00 on db2139.codfw.wmnet with reason: Maintenance | [production] | 
            
  | 20:33 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db2138:3314 (T348183)', diff saved to https://phabricator.wikimedia.org/P53645 and previous config saved to /var/cache/conftool/dbconfig/20231120-203337-arnaudb.json | [production] | 
            
  | 20:21 | <eevans@cumin1001> | START - Cookbook sre.hosts.reimage for host aqs1014.eqiad.wmnet with OS bullseye | [production] | 
            
  | 20:21 | <eevans@cumin1001> | END (ERROR) - Cookbook sre.hosts.reimage (exit_code=97) for host aqs1014.eqiad.wmnet with OS bullseye | [production] | 
            
  | 20:18 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db2138:3314', diff saved to https://phabricator.wikimedia.org/P53644 and previous config saved to /var/cache/conftool/dbconfig/20231120-201831-arnaudb.json | [production] | 
            
  | 20:10 | <eevans@cumin1001> | START - Cookbook sre.hosts.reimage for host aqs1014.eqiad.wmnet with OS bullseye | [production] | 
            
  | 20:08 | <eevans@cumin1001> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host aqs1013.eqiad.wmnet with OS bullseye | [production] | 
            
  | 20:03 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db2138:3314', diff saved to https://phabricator.wikimedia.org/P53643 and previous config saved to /var/cache/conftool/dbconfig/20231120-200324-arnaudb.json | [production] | 
            
  | 19:59 | <brett@cumin2002> | END (PASS) - Cookbook sre.hosts.remove-downtime (exit_code=0) for acmechief2001.codfw.wmnet | [production] | 
            
  | 19:59 | <brett@cumin2002> | START - Cookbook sre.hosts.remove-downtime for acmechief2001.codfw.wmnet | [production] | 
            
  | 19:50 | <eevans@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on aqs1013.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 19:48 | <arnaudb@cumin1001> | dbctl commit (dc=all): 'Repooling after maintenance db2138:3314 (T348183)', diff saved to https://phabricator.wikimedia.org/P53642 and previous config saved to /var/cache/conftool/dbconfig/20231120-194818-arnaudb.json | [production] | 
            
  | 19:48 | <eevans@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on aqs1013.eqiad.wmnet with reason: host reimage | [production] | 
            
  | 19:36 | <eevans@cumin1001> | START - Cookbook sre.hosts.reimage for host aqs1013.eqiad.wmnet with OS bullseye | [production] | 
            
  | 19:21 | <sukhe> | pool cp4045.ulsfo.wmnet post reboot and puppet 7 upgrade | [production] | 
            
  | 19:16 | <sukhe@cumin2002> | END (PASS) - Cookbook sre.hosts.reboot-single (exit_code=0) for host cp4045.ulsfo.wmnet | [production] | 
            
  | 19:05 | <sukhe@cumin2002> | START - Cookbook sre.hosts.reboot-single for host cp4045.ulsfo.wmnet | [production] | 
            
  | 19:04 | <cmooney@cumin1001> | END (FAIL) - Cookbook sre.dns.netbox (exit_code=99) | [production] | 
            
  | 19:03 | <brett@cumin1001> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host acmechief2001.codfw.wmnet with OS bookworm | [production] | 
            
  | 19:03 | <cmooney@cumin1001> | START - Cookbook sre.dns.netbox | [production] | 
            
  | 19:02 | <sukhe> | depool cp4045 for reboot | [production] | 
            
  | 18:59 | <cmooney@cumin1001> | END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox | [production] | 
            
  | 18:59 | <cmooney@cumin1001> | START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox | [production] | 
            
  | 18:59 | <cmooney@cumin1001> | END (PASS) - Cookbook sre.netbox.update-extras (exit_code=0) rolling restart_daemons on A:netbox-canary | [production] | 
            
  | 18:59 | <cmooney@cumin1001> | START - Cookbook sre.netbox.update-extras rolling restart_daemons on A:netbox-canary | [production] | 
            
  | 18:57 | <jmm@cumin2002> | END (PASS) - Cookbook sre.puppet.migrate-host (exit_code=0) for host cp4045.ulsfo.wmnet | [production] | 
            
  | 18:48 | <jmm@cumin2002> | START - Cookbook sre.puppet.migrate-host for host cp4045.ulsfo.wmnet | [production] | 
            
  | 18:44 | <brett@cumin1001> | END (PASS) - Cookbook sre.hosts.downtime (exit_code=0) for 2:00:00 on acmechief2001.codfw.wmnet with reason: host reimage | [production] | 
            
  | 18:41 | <brett@cumin1001> | START - Cookbook sre.hosts.downtime for 2:00:00 on acmechief2001.codfw.wmnet with reason: host reimage | [production] | 
            
  | 18:39 | <bking@cumin1001> | START - Cookbook sre.wdqs.data-reload | [production] | 
            
  | 18:38 | <bking@cumin1001> | END (ERROR) - Cookbook sre.wdqs.data-reload (exit_code=97) | [production] | 
            
  | 18:37 | <ebernhardson@deploy2002> | helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 18:37 | <ebernhardson@deploy2002> | helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 18:27 | <brett@cumin1001> | START - Cookbook sre.hosts.reimage for host acmechief2001.codfw.wmnet with OS bookworm | [production] | 
            
  | 18:25 | <jmm@cumin2002> | END (PASS) - Cookbook sre.puppet.migrate-role (exit_code=0) for role: wikidough | [production] | 
            
  | 18:18 | <volans> | installed spicerack v8.1.0 on the cumin hosts | [production] | 
            
  | 18:13 | <jmm@cumin2002> | START - Cookbook sre.puppet.migrate-role for role: wikidough | [production] | 
            
  | 18:08 | <ebernhardson> | start test backfill of 4 days of itwiki and frwiki edits to relforge from cirrus updater | [production] | 
            
  | 18:06 | <ebernhardson@deploy2002> | helmfile [staging] DONE helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 18:06 | <ebernhardson@deploy2002> | helmfile [staging] START helmfile.d/services/cirrus-streaming-updater: apply | [production] | 
            
  | 17:49 | <bking@cumin2002> | END (PASS) - Cookbook sre.hosts.reimage (exit_code=0) for host cloudelastic1010.wikimedia.org with OS bullseye | [production] |